Senior Data Engineer

5 years

0 Lacs

Posted:3 days ago| Platform: Linkedin logo

Apply

Work Mode

On-site

Job Type

Full Time

Job Description

As a Senior Data Engineer, you will architect, build, and maintain our data infrastructure that powers critical business decisions. You will work closely with data scientists, analysts, and product teams to design and implement scalable solutions for data processing, storage, and retrieval. Your work will directly impact our ability to leverage data for business intelligence, machine learning initiatives, and customer insights.

Responsibilities

  • Design, build, and maintain our end-to-end data infrastructure on AWS and GCP cloud platforms.
  • Develop and optimize ETL/ELT pipelines to process large volumes of data from multiple sources.
  • Build and support data pipelines for reporting, analytics, and machine learning applications.
  • Implement and manage streaming data solutions using Kafka and other technologies.
  • Design and optimize database schemas and data models in ClickHouse and other databases.
  • Develop and maintain data workflows using Apache Airflow and similar orchestration tools.
  • Write efficient, maintainable, and scalable code using PySpark and other data processing frameworks.
  • Collaborate with data scientists to implement ML infrastructure for model training and deployment.
  • Ensure data quality, reliability, and security across all data platforms.
  • Monitor data pipelines and implement proactive alerting systems.
  • Troubleshoot and resolve data infrastructure issues.
  • Document data flows, architectures, and processes.
  • Stay current with industry trends and emerging technologies in data engineering.

Requirements

  • Bachelor's degree in Computer Science, Engineering, or related technical field (Master's preferred).
  • 5+ years of experience in data engineering roles.
  • Strong expertise in AWS and/or GCP cloud platforms and services.
  • Proficiency in building data pipelines using modern ETL/ELT tools and frameworks.
  • Experience with stream processing technologies such as Kafka.
  • Hands-on experience with ClickHouse or similar analytical databases.
  • Strong programming skills in Python and experience with PySpark.
  • Experience with workflow orchestration tools like Apache Airflow.
  • Solid understanding of data modeling, data warehousing concepts, and dimensional modeling.
  • Knowledge of SQL and NoSQL databases.
  • Strong problem-solving skills and attention to detail.
  • Excellent communication skills and ability to work in cross-functional teams.
  • Experience in D2C, e-commerce, or retail industries.
  • Knowledge of data visualization tools (Tableau, Looker, Power BI).
  • Experience with real-time analytics solutions.
  • Familiarity with CI/CD practices for data pipelines.
  • Experience with containerization technologies (Docker, Kubernetes).
  • Understanding of data governance and compliance requirements.
  • Experience with MLOps or ML engineering Technologies.
  • Cloud Platforms: AWS (S3 Redshift, EMR, Lambda), GCP (BigQuery, Dataflow, Dataproc).
  • Data Processing: Apache Spark, PySpark, Python, SQL.
  • Streaming: Apache Kafka, Kinesis.
  • Data Storage: ClickHouse, S, 3 BigQuery, PostgreSQL, MongoDB.
  • Orchestration: Apache Airflow.
  • Version Control: Git.
  • Containerization: Docker, Kubernetes (optional).
This job was posted by Sidharth Patra from Traya Health.

Mock Interview

Practice Video Interview with JobPe AI

Start PySpark Interview
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now

RecommendedJobs for You

Gurugram, Haryana, India

Bengaluru, Karnataka, India

Hyderabad, Telangana, India