Data Engineer

2 weeks ago


Gurgaon, Haryana, India Masin Projects Pvt. Ltd Full time

Data Engineer - Multi-source ETL & GenAI Pipelines (3+ Years)

Roles and Responsibilities :

- Build and maintain scalable, fault-tolerant data pipelines to support GenAI and analytics workloads across OCR, documents, and case data.

- Manage ingestion and transformation of semi-structured legal documents (PDF, Word, Excel) into structured formats.

- Enable RAG workflows by processing data into chunked, vectorized formats with metadata.

- Handle large-scale ingestion from multiple sources into cloud-native data lakes (S3, GCS), data warehouses (BigQuery, Snowflake), and PostgreSQL.

- Automate pipelines using orchestration tools like Airflow/Prefect, including retry logic, alerting, and metadata tracking.

- Collaborate with ML Engineers to ensure data availability, traceability, and performance for inference and training pipelines.

- Implement data validation and testing frameworks using Great Expectations or dbt.

- Integrate OCR pipelines and post-processing outputs for embedding and document search.

- Design infrastructure for streaming vs batch data needs and optimize for cost, latency, and reliability.

Qualifications :

- Bachelors or Masters degree in Computer Science, Data Engineering, or equivalent.

- 3+ years of experience in building distributed data pipelines and managing multi-source ingestion.

- Proficiency with Python, SQL, and data tools like Pandas, PySpark.

- Experience working with data orchestration tools (Airflow, Prefect), and file formats like Parquet, Avro, JSON.

- Hands-on experience with cloud storage/data warehouse systems (S3, GCS, BigQuery, Redshift).

- Understanding of GenAI and vector database ingestion pipelines is a strong plus.

- Bonus : Experience with OCR tools (Tesseract, Google Document AI), PDF parsing libraries (PyMuPDF), and API-based document processors.

(ref:hirist.tech)
  • Data Engineer Advisor

    14 hours ago


    Gurgaon, Haryana, India NTT DATA Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Req ID: 338198NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Data Engineer Advisor to join our team in Gurgaon, Haryāna (IN-HR), India (IN). Technical Support: Provide L1, L2...


  • Gurgaon, Haryana, India NTT DATA Full time

    Req ID 295081NTT DATA strives to hire exceptional innovative and passionate individuals who want to grow with us If you want to be part of an inclusive adaptable and forward-thinking organization apply now We are currently seeking a Digital Engineering Engineer to join our team in Gurgaon Hary xc4 x81na IN-HR India IN Job Duties Must have ...

  • Data Engineer Advisor

    16 hours ago


    Gurgaon, Haryana, India NTT DATA North America Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    Req ID:338198NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.We are currently seeking a Data Engineer Advisor to join our team in Gurgaon, Haryāna (IN-HR), India (IN).Technical Support:Provide L1, L2 and L3...

  • Data Engineer

    2 weeks ago


    Gurgaon, Haryana, India AuxoAI Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Role SummaryAuxoAI is seeking a skilled and experienced Data Engineer to join our dynamic team. The ideal candidate will have 7-10 years of prior experience in data engineering, with a strong background in Databricks. This role offers an exciting opportunity to work on diverse projects, collaborating with cross-functional teams to design, build, and optimize...

  • Data Engineer

    7 days ago


    Gurgaon, Haryana, India Strategic Talent Partner Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Role: Data Engineer for financial datasets in a quant research & trading environment.Key Responsibilities:Build Python-driven data ingestion pipelines from market & internal sources.Ensure data quality with automated validation & anomaly detection.Manage data vendors, optimize costs, & expand data pipeline.Collaborate with quants & engineers for seamless...

  • Data Engineer

    3 days ago


    Gurgaon, Haryana, India NatWest Group Full time US$ 1,50,000 - US$ 2,00,000 per year

    Join us as a Data Engineering LeadThis is an exciting opportunity to use your technical expertise to collaborate with colleagues and build effortless, digital first customer experiencesYou'll be simplifying the bank through developing innovative data driven solutions, inspiring to be commercially successful through insight, and keeping our customers' and the...

  • Data Engineer

    7 days ago


    Gurgaon, Haryana, India ExcelGens, Inc. Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    We're Hiring:Data Engineer (Microsoft Fabric Specialist)Are you an experienced Data Engineer (2-3 years) with strong hands-on expertise in Microsoft Fabric?JoinExcelGens, Inc.in Gurgaon and play a key role in building scalable, modern, and intelligent data solutions.What you'll do: Design & develop data pipelines using Microsoft Fabric (Data Factory, Data...

  • Data Engineer

    5 days ago


    Gurgaon, Haryana, India Skillventory Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Role & responsibilitiesRole PurposeThe Data Engineer will be responsible for developing and maintaining ADA (Analytical Data Architecture) solutions that support the Personal Bank Reporting teams. This role requires a strong technical foundation in data engineering, with a focus on building scalable, efficient data pipelines and enabling high-quality...

  • Data Engineer

    2 weeks ago


    Gurgaon, Haryana, India BigStep Technologies Pvt Ltd Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Key Responsibilities:  Build, scale, and maintain robust data solutions to support business objectives. Design, implement, and optimize high-performance data pipelines (extraction,loading, transformation, and orchestration) with scalability, reliability, and speed. Lead end-to-end software development projects involving large language models(LLMs),...

  • Data Engineer

    2 weeks ago


    Gurgaon, Haryana, India Strategic Talent Partner Full time

    About the Role : We are seeking a proactive and detail-oriented Data Engineer to build and maintain robust data processes for our financial datasets. You will play a pivotal role in designing data ingestion pipelines, ensuring data quality, and enabling seamless access for our quantitative research and trading Responsibilities : Python Driven Data Pipelines...