Description
Position Overview:
As a Data Engineer, you will be responsible for designing, developing, and maintaining our data architecture and infrastructure. You will collaborate with cross-functional teams to understand data requirements, implement data pipelines, and support data-driven decision-making across the organization.
Key Responsibilities:
-
Data Pipeline Development:
- Design and implement robust, scalable, and maintainable data pipelines for the collection, processing, and storage of large volumes of data.
- Collaborate with data scientists and analysts to ensure seamless integration of data pipelines with analytical tools.
-
Database Management:
- Optimize and manage databases for performance, scalability, and reliability.
- Implement and maintain ETL processes to extract, transform, and load data from various sources.
-
Data Quality and Governance:
- Establish and enforce data quality standards and governance practices.
- Implement data monitoring and validation processes to ensure the accuracy and completeness of data.
-
Collaboration:
- Collaborate with cross-functional teams to understand data requirements and provide technical solutions.
- Work closely with data scientists and analysts to support their data needs and ensure the availability of clean and relevant data.
-
Continuous Improvement:
- Stay current with emerging trends and technologies in data engineering.
- Identify opportunities for process improvements and implement best practices in data engineering.
Qualifications:
- Bachelor’s or Master’s degree in Computer Science, Information Technology, or a related field.
- Proven experience as a Data Engineer or in a similar role.
- Proficiency in programming languages such as Python, Java, or Scala.
- Experience with data warehousing technologies (e.g., Redshift, Snowflake) and big data frameworks (e.g., Apache Spark).
Preferred Skills:
- Familiarity with cloud platforms such as AWS, Azure, or Google Cloud.
- Experience with containerization and orchestration tools (e.g., Docker, Kubernetes).
- Knowledge of data modeling and schema design principles.