
Data Engineer
16 hours ago
Job description:
We're looking for a hands-on Data Engineer to manage and scale our data scraping pipelines across 60+ websites. The job involves handling OCR-processed PDFs, ensuring data quality, and building robust, self-healing workflows that fuel AI-driven insights.
You'll Work On:
Managing and optimizing Airflow scraping DAGs
Implementing validation checks, retry logic & error alerts
Cleaning and normalizing OCR text (Tesseract / AWS Textract)
Handling deduplication, formatting, and missing data
Maintaining MySQL/PostgreSQL data integrity
Collaborating with ML engineers on downstream pipelines
What You Bring:
2–5 years of hands-on experience in Python data engineering
Experience with Airflow, Pandas, and OCR tools
Solid SQL skills and schema design (MySQL/PostgreSQL)
Comfort with CSVs and building ETL pipelines
Required:
Scrapy or Selenium experience
CAPTCHAs handling
Experience in PyMuPDF, Regex
AWS S3
LangChain, LLM, Fast API
Streamlit
Matplotlib
Job Type: Full-time
Day shift
Work Location: In person
Job Type: Full-time
Pay: ₹70, ₹150,000.00 per month
Application Question(s):
Total years of experience in web scraping / data extraction
Have you worked with large-scale data pipelines?
Are you proficient in writing complex Regex patterns for data extraction and cleaning?
Have you implemented or managed data pipelines using tools like Apache Airflow?
Years of experience with PDF Parsing and using OCR tools (e.g., Tesseract, Google Document AI, AWS Textract, etc.)
- Years of experience handling complex PDF tables with merged rows, rotated layouts, or inconsistent formatting
Are you willing to relocate to Delhi if selected?
Current CTC
Expected CTC
Work Location: In person
-
Data Engineer
2 weeks ago
Delhi, Delhi, India ixceed Full time ₹ 30,00,000 - ₹ 45,00,000 per yearRole: Data Engineer (Python)Location: DelhiMode: PermanentType: HybridJob Description:We are looking for a highly motivated Data Engineer with a strong foundation in computer science and hands-on experience in Python and systems engineering. This role involves designing and developing data-driven applications, APIs, and interactive dashboards while applying...
-
Data Engineer
7 days ago
Delhi, Delhi, India Straive Full time ₹ 6,00,000 - ₹ 12,00,000 per yearCompany DescriptionStraive operationalizes Data Analytics and AI for global enterprises, including several Fortune 500 companies, to drive efficiency and enhance user experience. As a global leader in AI-driven value creation, Straive empowers diverse clients across industries such as Banking, Financial and Information Services, Retail, Media and Technology,...
-
Data Engineer
2 weeks ago
Delhi, Delhi, India EXL Service Full time ₹ 20,00,000 - ₹ 25,00,000 per yearData Engineer (SAS DQ Conversion) We're Hiring: Data Engineer (SAS DQ Conversion) Experience: 6 Years Notice Period: Immediate Joiners only (First Come, First Serve) Join us at EXL, where you'll work on modernizing legacy systems, optimizing SQL queries, and delivering impactful data solutions. Must-Haves:Proficiency in SQL with hands-on...
-
Data Engineer
2 weeks ago
Delhi, Delhi, India Weekday Full time ₹ 15,00,000 - ₹ 25,00,000 per yearThis role is for one of our clientsCompany Name: Industry: Technology, Information and MediaSeniority level: Mid-Senior levelMin Experience: 5 yearsLocation: NCRJobType: full-timeWe are seeking an experienced Data Engineer with deep expertise in the AWS data ecosystem to design, build, and optimize scalable data pipelines and platforms. The ideal candidate...
-
Data Engineer
1 week ago
Delhi, Delhi, India Iris Software Pvt Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per yearData Engineer We're hiring for the Role of Data Engineer. Mandatory skills : Databricks , Pyspark , Bigdata , Hadoop , Hive Experience: 5-9 Years Notice Period: IMMEDIATE to 15Days JOINERS only Interested candidate can send your CV.
-
Data Engineer
3 days ago
Delhi, Delhi, India Environmental Resources Management Full time ₹ 9,00,000 - ₹ 12,00,000 per yearWho is ERM?ERM is a leading global sustainability consulting firm, committed for nearly 50 years to helping organizations navigate complex environmental, social, and governance (ESG) challenges. We bring together a diverse and inclusive community of experts across regions and disciplines, providing a truly multicultural environment that fosters...
-
Data Engineer
7 days ago
Delhi, Delhi, India Natlov Technologies Pvt Ltd Full time ₹ 9,00,000 - ₹ 12,00,000 per yearHiring: Data Engineer (AWS )Location:Remote |Shift:6 PM – 3 AM ISTJoinNatlov Technologies Pvt. Ltd.and be part of a dynamic data engineering teamWhat You'll DoBuild and optimize scalable data pipelines & modelsEnsure data quality, security, and performanceCollaborate across teams & mentor juniorsWork with modern tools like AWS, BigQuery, Snowflake,...
-
Data Engineer
4 days ago
Delhi, Delhi, India Talent Worx Full time ₹ 6,00,000 - ₹ 18,00,000 per yearWe are looking for a skilled Data Engineer / Database Administrator (DBA) to join our dynamic team at Talent Worx. This role involves designing, implementing, and maintaining database systems while ensuring data integrity and performance optimization. The ideal candidate will have a strong foundation in both data engineering and database...
-
Data Engineer
1 week ago
Delhi, Delhi, India ISITCA PRIVATE LIMITED Full time ₹ 8,00,000 - ₹ 12,00,000 per yearAbout the Role: We are looking for a hands-on Data Engineer to join our team and take full ownership of scraping pipelines and data quality. You'll be working on data from 60+ websites involving PDFs, processed via OCR and stored in MySQL/PostgreSQL. You'll build robust, self-healing pipelines and fix common data issues (missing fields, duplication,...
-
Data Engineer
1 week ago
Delhi, Delhi, India Hunch Full time ₹ 8,00,000 - ₹ 24,00,000 per yearWhat is Hunch?Hunch is a dating app that helps you land a date without swiping like a junkie. Designed for people tired of mindless swiping and commodified matchmaking, Hunch leverages a powerful AI-engine to help users find meaningful connections by focusing on personality over just looks. With2M+ downloadsand a4.4-star rating, Hunch is going viral in the...