Data Engineer

4 weeks ago


New Delhi, India ISITCA PRIVATE LIMITED Full time

About the Role: We are looking for a hands-on Data Engineer to join our team and take full ownership of scraping pipelines and data quality. You'll be working on data from 60+ websites involving PDFs, processed via OCR and stored in MySQL/PostgreSQL. You’ll build robust, self-healing pipelines and fix common data issues (missing fields, duplication, formatting errors).

Responsibilities:

  • Own and optimize Airflow scraping DAGs for 60+ sites
  • Implement validation checks, retry logic, and error alerts
  • Build pre-processing routines to clean OCR'd text
  • Create data normalization and deduplication workflows
  • Maintain data integrity across MySQL and PostgreSQL
  • Collaborate with ML team for downstream AI use cases

Requirements:

  • 2–5 years of experience in Python-based data engineering
  • Experience with Airflow, Pandas, OCR (Tesseract or AWS Textract)
  • Solid SQL and schema design skills (MySQL/PostgreSQL)
  • Familiarity with CSV processing and data pipelines
  • Bonus: Experience with scraping using Scrapy or Selenium

Location: Delhi (in-office only)

Minimum 3 years experience

must be a graduate: b tech preferred / BCA/ MCA /BSc /MSc


Mandatory keywords (must have skills)


scraping

python

selenium

NumPy

Pandas


Optional Keywords: (good to have the following skills)


Beautiful soup

MySQL

Large Language Model ( LLM)

Machine Learning

Natural Language Processing (NLP)

GitHub

Django



  • New Delhi, India Eucloid Data Solutions Full time

    Job DescriptionEucloid is looking for a skilled Data Engineer to join our team and contribute to the design, development, and optimization of data frameworks pipelines using Python, Elasticsearch, SQL DB and Queue System. The ideal candidate will have a strong technical background, a passion for solving complex problems, and experience in building scalable...


  • New Delhi, India Eucloid Data Solutions Full time

    Job DescriptionEucloid is looking for a skilled Data Engineer to join our team and contribute to the design, development, and optimization of data frameworks pipelines using Python, Elasticsearch, SQL DB and Queue System. The ideal candidate will have a strong technical background, a passion for solving complex problems, and experience in building scalable...

  • Data Engineer

    2 weeks ago


    Delhi, India Data-Hat AI Full time

    Department: Data Engineering & AI SolutionsReports To: Lead Data Solutions ArchitectTravel: International travel required (up to 30–40%)Position Summary:We are hiring a senior-level Data Engineer to lead the design, development, and optimization of high-performance data infrastructure that underpins mission-critical AI systems. With 12+ years of...

  • Data engineer

    2 weeks ago


    Delhi, India Data-Hat AI Full time

    Department: Data Engineering & AI Solutions  Reports To: Lead Data Solutions ArchitectTravel: International travel required (up to 30–40%)Position Summary:  We are hiring a senior-level Data Engineer to lead the design, development, and optimization of high-performance data infrastructure that underpins mission-critical AI systems. With 12+ years of...

  • Data Engineer

    2 weeks ago


    Delhi, India Data-Hat AI Full time

    Department: Data Engineering & AI SolutionsReports To: Lead Data Solutions ArchitectTravel: International travel required (up to 30–40%)Position Summary:We are hiring a senior-level Data Engineer to lead the design, development, and optimization of high-performance data infrastructure that underpins mission-critical AI systems. With 12+ years of...

  • Data Engineer

    2 weeks ago


    Delhi, India Data-Hat AI Full time

    Department:Data Engineering & AI SolutionsReports To:Lead Data Solutions ArchitectTravel:International travel required (up to 30–40%)Position Summary: We are hiring a senior-level Data Engineer to lead the design, development, and optimization of high-performance data infrastructurethat underpins mission-critical AI systems. With 12+ years of experience,...

  • Data Engineer

    2 weeks ago


    Delhi, India Data-Hat AI Full time

    Department:Data Engineering & AI SolutionsReports To:Lead Data Solutions ArchitectTravel:International travel required (up to 30–40%)Position Summary:We are hiring a senior-level Data Engineer to lead the design, development, and optimization of high-performance data infrastructurethat underpins mission-critical AI systems. With 12+ years of experience,...

  • Technical Specialist

    2 weeks ago


    New Delhi, India NTT DATA Full time

    Job Description Make an impact with NTT DATA Join a company that is pushing the boundaries of what is possible. We are renowned for our technical excellence and leading innovations, and for making a difference to our clients and society. Our workplace embraces diversity and inclusion – it’s a place where you can grow, belong and thrive. Your day at...


  • Delhi, India Eucloid Data Solutions Full time

    About The Role:Eucloid is looking for a senior/ lead Data Engineer to join our Data Platform team supporting various business applications. The ideal candidate will support development of data infrastructure on Databricks for our clients by participating in activities which may includestarting from up- stream and down-stream technology selection to designing...


  • Delhi, India Eucloid Data Solutions Full time

    About The Role:Eucloid is looking for a senior/ lead Data Engineer to join our Data Platform team supporting various business applications. The ideal candidate will support development of data infrastructure on Databricks for our clients by participating in activities which may includestarting from up- stream and down-stream technology selection to designing...