Job
Description
The Data Engineer exercises judgment when following general instructions and works with minimal instruction to support the integration and automation of data solutions. This role focuses on data massaging, reconciliation, and analysis, resolving routine to semi-routine issues. Responsibilities include creating optimized SQL queries, managing data pipelines, and collaborating with cross-functional teams to ensure data accuracy and availability.NoteThis role may come into contact with confidential or sensitive customer information requiring special treatment in accordance with Red Hat policies and applicable privacy laws.- Write optimized and scalable complex SQL queries- Automate data processing tasks using Python, focusing on cleaning and merging datasets.- Manage data pipelines, including scheduling, monitoring, and debugging workflows.- Collaborate with data engineers and IT teams to maintain data accessibility for stakeholders.- Assist in developing automated tests to ensure the accuracy and integrity of data.- Participate in version control and CI/CD processes for deploying and testing pipeline changes across environments.- Work cross-functionally with analysts, engineers, and operations.- Data stewardship includingdata governance, data compliance, data transformation, data cleanliness, data validation, data audit/maintenance.Primary Job Responsibilities
Writing complex, highly-optimized SQL queries across large datasets, involved in SQL Query tuning and provided tuning recommendationsExperienced in Data Analytics, hands-on experience of various Python libraries such as NumPy and PandasPython development experience to massage, clean data and automate data extract and loadsExpertise to convert raw data to processed data by merging, finding outliers, errors, trends, missing values and distributions in the dataExpertise in Creating, Debugging, Scheduling and Monitoring jobs using Airflow, resolve performance tuning related issues and queriesFoster collaboration among Data engineers, IT & other business groups to ensure data is accessible to FP&A teamScheduled a regular hot backup process and involved in the backup activitiesStrong analytical and problem-solving skills with ability to represent complex algorithms in softwareDevelop automated unit tests, end-to-end tests, and integration tests to assist in quality assurance (QA) procedures
Required Skills
Bachelor's or Master's degree in Computer Science, IT, Engineering or equivalent5+ years of experience as a Data Engineer, BI Engineer, Systems Analyst in a company with large, complex data sourcesWorking knowledge of DBT, Snowflake, Fivetran, Git and SQL or Python programming skills for data querying, cleaning, and presentationBuild highly available, reliable and secured API solutions, experience working with REST API design and ImplementationWorking knowledge of relational databases (PostgreSQL, MSSQL, etc.), experience with AWS services including S3, Redshift, EMR and RDS.Ability to manage multiple projects at the same time in a fast-paced team environment, across time zones, and with different cultures, while maintaining ability to work as part of a teamThe candidate must have good troubleshooting skills and be able to think through issues and problems in a logical manner and planning knowledge would be an added advantageDetail-oriented and enthusiastic who is also focused and diligent on delivering results