Data Engineer
Role Summary:
The Data Engineer is the foundation of the AI CoE. You build and maintain the robust, scalable, and efficient data pipelines that feed clean, reliable data to the data scientists and ML models. You ensure that data is accessible, organized, and of high quality.
Key Responsibilities:
- Design, construct, install, and maintain large-scale data processing systems and data lakes/warehouses.
- Build scalable and reliable ETL/ELT pipelines to ingest data from diverse sources.
- Ensure data quality and implement data governance and security standards.
- Collaborate with Data Scientists to understand data requirements for modeling.
- Optimize data systems for performance and cost.
- Work with data architecture on data modeling and schema design.
Qualifications:
- Bachelor’s or Master’s in Computer Science or a related field.
- 3+ years of experience in a Data Engineer role.
- Expert-level proficiency in SQL and experience with big data technologies (e.g., Spark, Hadoop, Kafka).
- Strong programming skills in Python, Scala, or Java.
- Experience with cloud data services (e.g., AWS Redshift/S3, Azure Synapse/Data Lake, GCP BigQuery).
- Knowledge of data modeling and data warehousing concepts.
