Senior Data Engineer

1 day ago


Gurugram Gurugram India LeewayHertz Full time

Job Description

Job Description

This is a remote position.

Job Summary

We are looking for a Senior Data Quality & Governance Engineer who will take ownership of enforcing data contracts, quality, and metadata standards across intelligence and analytics platforms. This role involves designing lineage models, implementing statistical validations, and establishing reusable quality dashboardsall using AWS-native and lightweight open-source tools.

Responsibilities

- Design and enforce data contracts across Raw Clean Modeled zones.
- Define schema versioning policies, KPI logic, and metadata rules across business-critical datasets.
- Implement row-level and statistical validations using Great Expectations or Deequ.
- Create audit-ready QA tables to track failed checks, schema mismatches, and data regressions.
- Track end-to-end lineage and KPI evolution using OpenMetadata (with Glue/S3/Athena).
- Auto-classify columns as PII, derived, or forecast-driving fields using AWS Glue Tags/Scripts.
- Provide CTAS-based Athena queries for building QA dashboards and UAT verifications.
- Build BI-ready, QA-approved datasets for downstream tools like Superset and Power BI.
- Establish reusable profiling and validation dashboards for data quality and business teams.
- Collaborate with engineers, QA, and business SMEs to finalize data validation logic.

Requirements

Essential Skills:

Job

- Hands-on experience with AWS Glue (Jobs, Crawlers, Catalog), S3, and Athena.
- Strong foundation in data contracts, quality enforcement, and schema versioning.
- Expertise in using Deequ or Great Expectations for anomaly detection and data validation.
- Familiarity with OpenMetadata, Amundsen, or custom metadata tracking solutions.
- Ability to tag and manage sensitive data fields (e.g., PII, model inputs, derived KPIs).
- Strong SQL with Athena (CTEs, CTAS, filters, aggregations).
- Experience building QA dashboards in Superset, Streamlit, or similar BI tools.

Personal

- Excellent collaboration and communication with QA, architects, and business teams.
- Self-driven with attention to detail in schema accuracy and metadata enrichment.
- Ability to translate KPIs and quality rules into validation logic.
- Proactive in surfacing data regressions and audit issues before they reach production.
- High ownership mindset with a strong data compliance and governance attitude.

Preferred Skills

Job

- Implementation of statistical QA techniques like z-score anomalies or entropy thresholds.
- Experience handling rejected record logs, schema drift validations, and data reconciliation.
- Awareness of AWS cost optimization techniques in Glue and Athena.

Personal

- Proactive, ownership-driven mindset with a collaborative approach.
- Strong communication and collaboration skills.
- Strong problem-solving skills with attention to detail.
- Have the ability to work under stringent deadlines and demanding client conditions.
- Strong analytical and problem-solving skills.
- Ability to work in fast-paced, delivery-focused environments.
- Should have strong mentoring and documentation skills.
- Ability to take end-to-end ownership of QA validation modules.

Other Relevant Information

- Bachelors degree in Computer Science, Information Technology, or a related field.
- Minimum 9+ years of experience in data engineering & architecture.

Benefits

- This role offers the flexibility of working remotely in India.

LeewayHertz is an equal opportunity employer and does not discriminate based on race, color, religion, sex, age, disability, national origin, sexual orientation, gender identity, or any other protected status. We encourage a diverse range of applicants.

check(event) ; career-website-detail-template-2 => apply(record.id,meta)' mousedown='lyte-button => check(event)' final-style='background-color:#6875E2;border-color:#6875E2;color:white;' final-class='lyte-button lyteBackgroundColorBtn lyteSuccess' lyte-rendered=''>



  • Gurugram, India Capgemini Full time

    Job Description Job Description This role involves the development and application of engineering practice and knowledge in defining, configuring and deploying industrial digital technologies (including but not limited to PLM and MES) for managing continuity of information across the engineering enterprise, including design, industrialization,...


  • Gurugram, India IT Firm Full time

    We are seeking an experienced and highly motivated Senior Data Engineer to join our dynamic team. The ideal candidate will play a critical role in designing, developing, and maintaining high-performance data infrastructure and pipelines. As a Senior Data Engineer, you will be responsible for ensuring the scalability, performance, and reliability of data...


  • Gurugram, Gurugram, India Alight Solutions Full time

    Job Description Our story At Alight, we believe a companys success starts with its people. At our core, we Champion People, help our colleagues Grow with Purpose and true to our name we encourage colleagues to Be Alight. Our Values: Champion People be empathetic and help create a place where everyone belongs. Grow with purpose Be inspired by our higher...


  • Gurugram, Gurugram, India BOOSoft Full time

    Job Description Company Description BOOSoft is a technology company that prides itself on its fast, flexible, and human-centric approach to identifying opportunities, solving problems, and building amazingly useful technology. This is strictly a On-Site role, WFH is not allowed. Salary Guide : 8-12L Role Description This is a full-time on-site role for...


  • Gurugram, Gurugram, India SAMS ADVANCED CLIMATIC TECHNOLOGIES PVT. LTD. Full time

    Job Description Company Description SAMS ADVANCED CLIMATIC TECHNOLOGIES PVT. LTD. (SAMSACT) is the leader in environmental testing, operating since 1999. SAMSACT provides sales and service support to manufacturers of semiconductors, electronics, automobiles, and solar panels across India. With offices in Hyderabad, Delhi, Pune, Chennai, Ahmedabad, and...


  • Gurugram, Gurugram, India Visiblaze Full time

    Job Description About the Role We are hiring a Senior Engineering Manager to lead the technical strategy, architecture, and delivery of our cybersecurity platform. This is a hands-on leadership role: you&aposll be building and scaling a small, high-performing team, owning end-to-end engineering execution, and laying the foundation for our product and...


  • Gurugram, Gurugram, India Simpplr Full time

    Job Description Who We Are Simpplr is the AI-powered platform that unifies the digital workplace bringing together engagement, enablement, and services to transform the employee experience. It streamlines communication, simplifies interactions, automates workflows, and elevates the everyday experience of work. The platform is intuitive, highly extensible,...


  • Gurugram, India AGRIM Full time

    Senior Data Engineer (SDE-3)Location : Gurugram (Work from Office)Experience : 5+Tech Stack : Python, SQL (PostgreSQL/MySQL), Apache Spark, Kafka, Airflow, BigQuery/Redshift, Druid, Hadoop, SnowflakeResponsibilities :- Design and develop scalable ETL pipelines for ingesting and transforming large datasets.- Architect and optimize data warehouses and data...

  • Data Engineer

    1 day ago


    Gurugram, India NS Global Corporation Full time

    Job Role : Data Engineer - Senior LevelJob Locations : Gurugram, Haryana, IndiaRequired Experience : 7 - 10 YearsClient Budget : 2.0 LPM + GSTOur budget : 1.7 lpmSkills : data warehouse, Architectural Patterns, Modern data engineering tools and framework, AWS, SQL, File FormatsResponsibilities : - Design, develop and own robust data pipelines, ensuring...

  • Data Engineer

    2 weeks ago


    Bengaluru, Gurugram, India Trigent Software Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Detailed Job Description:We are currently seeking a Senior Data Engineer with hands-on coding experience and a strong background in Python, PySpark, and Object-oriented programming. The ideal candidate will be responsible for designing, developing, and implementing new features to our existing framework using PySpark and Python. This position requires a deep...