Data Engineer

16 hours ago


Delhi, Delhi, India GEMTECH PARAS SOLUTIONS PVT LTD Full time ₹ 14,00,000 - ₹ 18,00,000 per year

Job description:

We're looking for a hands-on Data Engineer to manage and scale our data scraping pipelines across 60+ websites. The job involves handling OCR-processed PDFs, ensuring data quality, and building robust, self-healing workflows that fuel AI-driven insights.

You'll Work On:

Managing and optimizing Airflow scraping DAGs

Implementing validation checks, retry logic & error alerts

Cleaning and normalizing OCR text (Tesseract / AWS Textract)

Handling deduplication, formatting, and missing data

Maintaining MySQL/PostgreSQL data integrity

Collaborating with ML engineers on downstream pipelines

What You Bring:

2–5 years of hands-on experience in Python data engineering

Experience with Airflow, Pandas, and OCR tools

Solid SQL skills and schema design (MySQL/PostgreSQL)

Comfort with CSVs and building ETL pipelines

Required:

  1. Scrapy or Selenium experience

  2. CAPTCHAs handling

  3. Experience in PyMuPDF, Regex

  4. AWS S3

  5. LangChain, LLM, Fast API

  6. Streamlit

  7. Matplotlib

Job Type: Full-time

Day shift

Work Location: In person

Job Type: Full-time

Pay: ₹70, ₹150,000.00 per month

Application Question(s):

  • Total years of experience in web scraping / data extraction

  • Have you worked with large-scale data pipelines?

  • Are you proficient in writing complex Regex patterns for data extraction and cleaning?

  • Have you implemented or managed data pipelines using tools like Apache Airflow?

  • Years of experience with PDF Parsing and using OCR tools (e.g., Tesseract, Google Document AI, AWS Textract, etc.)

    1. Years of experience handling complex PDF tables with merged rows, rotated layouts, or inconsistent formatting
  • Are you willing to relocate to Delhi if selected?

  • Current CTC

  • Expected CTC

Work Location: In person


  • Data Engineer

    2 weeks ago


    Delhi, Delhi, India ixceed Full time ₹ 30,00,000 - ₹ 45,00,000 per year

    Role: Data Engineer (Python)Location: DelhiMode: PermanentType: HybridJob Description:We are looking for a highly motivated Data Engineer with a strong foundation in computer science and hands-on experience in Python and systems engineering. This role involves designing and developing data-driven applications, APIs, and interactive dashboards while applying...

  • Data Engineer

    7 days ago


    Delhi, Delhi, India Straive Full time ₹ 6,00,000 - ₹ 12,00,000 per year

    Company DescriptionStraive operationalizes Data Analytics and AI for global enterprises, including several Fortune 500 companies, to drive efficiency and enhance user experience. As a global leader in AI-driven value creation, Straive empowers diverse clients across industries such as Banking, Financial and Information Services, Retail, Media and Technology,...

  • Data Engineer

    2 weeks ago


    Delhi, Delhi, India EXL Service Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Data Engineer (SAS DQ Conversion) We're Hiring: Data Engineer (SAS DQ Conversion) Experience: 6 Years Notice Period: Immediate Joiners only (First Come, First Serve) Join us at EXL, where you'll work on modernizing legacy systems, optimizing SQL queries, and delivering impactful data solutions. Must-Haves:Proficiency in SQL with hands-on...

  • Data Engineer

    2 weeks ago


    Delhi, Delhi, India Weekday Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    This role is for one of our clientsCompany Name: Industry: Technology, Information and MediaSeniority level: Mid-Senior levelMin Experience: 5 yearsLocation: NCRJobType: full-timeWe are seeking an experienced Data Engineer with deep expertise in the AWS data ecosystem to design, build, and optimize scalable data pipelines and platforms. The ideal candidate...

  • Data Engineer

    1 week ago


    Delhi, Delhi, India Iris Software Pvt Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Data Engineer We're hiring for the Role of Data Engineer. Mandatory skills : Databricks , Pyspark , Bigdata , Hadoop , Hive Experience: 5-9 Years Notice Period: IMMEDIATE to 15Days JOINERS only Interested candidate can send your CV.

  • Data Engineer

    3 days ago


    Delhi, Delhi, India Environmental Resources Management Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Who is ERM?ERM is a leading global sustainability consulting firm, committed for nearly 50 years to helping organizations navigate complex environmental, social, and governance (ESG) challenges. We bring together a diverse and inclusive community of experts across regions and disciplines, providing a truly multicultural environment that fosters...

  • Data Engineer

    7 days ago


    Delhi, Delhi, India Natlov Technologies Pvt Ltd Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Hiring: Data Engineer (AWS )Location:Remote |Shift:6 PM – 3 AM ISTJoinNatlov Technologies Pvt. Ltd.and be part of a dynamic data engineering teamWhat You'll DoBuild and optimize scalable data pipelines & modelsEnsure data quality, security, and performanceCollaborate across teams & mentor juniorsWork with modern tools like AWS, BigQuery, Snowflake,...

  • Data Engineer

    4 days ago


    Delhi, Delhi, India Talent Worx Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    We are looking for a skilled Data Engineer / Database Administrator (DBA) to join our dynamic team at Talent Worx. This role involves designing, implementing, and maintaining database systems while ensuring data integrity and performance optimization. The ideal candidate will have a strong foundation in both data engineering and database...

  • Data Engineer

    1 week ago


    Delhi, Delhi, India ISITCA PRIVATE LIMITED Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    About the Role: We are looking for a hands-on Data Engineer to join our team and take full ownership of scraping pipelines and data quality. You'll be working on data from 60+ websites involving PDFs, processed via OCR and stored in MySQL/PostgreSQL. You'll build robust, self-healing pipelines and fix common data issues (missing fields, duplication,...

  • Data Engineer

    1 week ago


    Delhi, Delhi, India Hunch Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    What is Hunch?Hunch is a dating app that helps you land a date without swiping like a junkie. Designed for people tired of mindless swiping and commodified matchmaking, Hunch leverages a powerful AI-engine to help users find meaningful connections by focusing on personality over just looks. With2M+ downloadsand a4.4-star rating, Hunch is going viral in the...