Data Engineer

2 days ago


Gurugram, India LeewayHertz Full time

Job Description
This is a remote position.

Job Summary
As a Senior Data Engineer, you will be responsible for designing, building, and optimizing data pipelines and lakehouse architectures on AWS. You will ensure data availability, quality, lineage, and governance across analytical and operational platforms. Your expertise will enable scalable, secure, and cost-effective data solutions that power advanced analytics and business intelligence.

Responsibilities

  • Implement and manage S3 (raw, staging, curated zones), Glue Catalog, Lake Formation, and Iceberg/Hudi/Delta Lake for schema evolution and versioning.
  • Develop PySpark jobs on Glue/EMR, enforce schema validation, partitioning, and scalable transformations.
  • Build workflows using Step Functions, EventBridge, or Airflow (MWAA), with CI/CD deployments via CodePipeline & CodeBuild.
  • Apply schema contracts, validations (Glue Schema Registry, Deequ, Great Expectations), and maintain lineage/metadata using Glue Catalog or third-party tools (Atlan, OpenMetadata, Collibra).
  • Enable Athena and Redshift Spectrum queries, manage operational stores (DynamoDB/Aurora), and integrate with OpenSearch for observability.
  • Design efficient partitioning/bucketing strategies, adopt columnar formats (Parquet/ORC), and implement spot instance usage/bookmarking.
  • Enforce IAM-based access policies, apply KMS encryption, private endpoints, and GDPR/PII data masking.
  • Prepare Gold-layer KPIs for dashboards, forecasting, and customer insights with QuickSight, Superset, or Metabase.
  • Partner with analysts, data scientists, and DevOps to enable seamless data consumption and delivery.

Requirements
Essential Skills
Job

  • Hands-on expertise with AWS data stack (S3, Glue, Lake Formation, Athena, Redshift, EMR, Lambda).
  • Strong programming skills in PySpark & Python for ETL, scripting, and automation.
  • Proficiency in SQL (CTEs, window functions, complex aggregations).
  • Experience in data governance, quality frameworks (Deequ, Great Expectations).
  • Knowledge of data modeling, partitioning strategies, and schema enforcement.
  • Familiarity with BI integration (QuickSight, Superset, Metabase).

Personal

  • Strong problem-solving ability in complex data environments.
  • Ability to communicate technical insights to non-technical stakeholders.
  • Commitment to best practices in data governance, compliance, and security.
  • Collaborative mindset with cross-functional teams.

Preferred Skills
Job

  • Real-time ingestion experience (Kinesis, MSK, Kafka on AWS).
  • Exposure to ML feature store integration with SageMaker.
  • Infrastructure as Code (Terraform, CloudFormation, or CDK).
  • Experience with Data Mesh or domain-driven data architecture.

Personal

  • Experience mentoring junior data engineers.
  • Ability to lead data projects from design to production.
  • Proactive in learning new AWS and data ecosystem technologies.

Other Relevant Information

  • Bachelor's/Master's degree in Computer Science, Information Technology, or related field.
  • Minimum 4 years of proven experience in data engineering with AWS.

Benefits

  • This role offers the flexibility of working remotely in India.

LeewayHertz is an equal opportunity employer and does not discriminate based on race, color, religion, sex, age, disability, national origin, sexual orientation, gender identity, or any other protected status. We encourage a diverse range of applicants.
check(event) ; career-website-detail-template-2 => ,meta)" mousedown="lyte-button => check(event)" final- final-class="lyte-button lyteBackgroundColorBtn lyteSuccess" lyte-rendered="">



  • Gurugram, India CoPoint Data Full time

    About CoPoint Data CoPoint Data is a specialized consulting firm focused on transforming businesses through process improvement, data insights, and technology-driven innovation. We leverage AI technologies, Microsoft cloud platforms, and modern web development frameworks to deliver intelligent, scalable solutions that drive measurable impact for our clients....


  • Gurugram, India CoPoint Data Full time

    About CoPoint AI CoPoint AI is a specialized consulting firm focused on transforming businesses through process improvement, data insights, and technology-driven innovation. We leverage AI technologies, Microsoft cloud platforms, and modern web development frameworks to deliver intelligent, scalable solutions that drive measurable impact for our clients. Our...

  • Data Engineer

    2 days ago


    Gurugram, India Obrimo Technologies Full time

    Responsibilities: Develop, implement, and maintain efficient and scalable data engineering solutions to support the company's data initiatives. Collaborate with cross-functional teams to understand business requirements and translate them into technical data engineering solutions. Build and maintain data pipelines to ingest, transform, and store...

  • Data Engineer

    2 days ago


    Gurugram, India AuxoAI Full time

    Role SummaryAuxoAI is seeking a skilled and experienced Data Engineer to join our dynamic team. The ideal candidate will have 7-10 years of prior experience in data engineering, with a strong background in Databricks. This role offers an exciting opportunity to work on diverse projects, collaborating with cross-functional teams to design, build, and optimize...

  • Data engineer

    2 days ago


    Gurugram, India Orbion Infotech Full time

    Company Description Orbion Infotech is your trusted partner for comprehensive software services and top-tier staff augmentation solutions. With a track record of success, we empower organizations to thrive in today's digital landscape. Our dedicated team of industry experts offers custom software development, staff augmentation, and strategic technology...

  • Data Engineer

    2 days ago


    Gurugram, India Healthpoint Ventures Full time

    Company Description At Healthpoint Ventures, we are dedicated to fostering innovation in healthcare through strategic collaboration with providers, payers, and stakeholders. Our mission is to harness the power of artificial intelligence to maximize value across the healthcare ecosystem. By driving impactful projects and joint ventures, we leverage AI to...

  • Data Engineer

    1 day ago


    Gurugram, India NatWest Group Full time

    Join us as a Data Engineering Lead This is an exciting opportunity to use your technical expertise to collaborate with colleagues and build effortless, digital first customer experiences You'll be simplifying the bank through developing innovative data driven solutions, inspiring to be commercially successful through insight, and keeping our customers' and...

  • Data Engineer

    2 days ago


    Gurugram, India Digital Business People Full time

    Job Title:Data Engineer Location:Gurgaon (On-site) Experience:4 Years Joining:Immediate About the Role We are looking for a skilledData Engineerwith strong expertise inPython, AWS Cloud, and Redshiftto design, build, and maintain scalable data pipelines and solutions. The ideal candidate will have hands-on experience with large-scale data systems, cloud...

  • Data Engineer

    1 day ago


    Gurugram, India ExcelGens, Inc. Full time

    We're Hiring: Data Engineer (Microsoft Fabric Specialist) Are you an experienced Data Engineer (5–7 years) with strong hands-on expertise in Microsoft Fabric? Join ExcelGens in Gurgaon and play a key role in building scalable, modern, and intelligent data solutions. What you'll do: Design & develop data pipelines using Microsoft Fabric (Data Factory,...

  • Data Engineer

    1 day ago


    Gurugram, India ExcelGens, Inc. Full time

    We're Hiring:Data Engineer (Microsoft Fabric Specialist) Are you an experienced Data Engineer (2-3 years) with strong hands-on expertise in Microsoft Fabric? JoinExcelGens, Inc.in Gurgaon and play a key role in building scalable, modern, and intelligent data solutions. What you'll do: Design & develop data pipelines using Microsoft Fabric (Data Factory,...