Data Extractor

2 days ago


Gandhinagar, Gujarat, India beBeeData Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

As a Data Engineer, you will be responsible for extracting and processing large amounts of data from various sources. This involves designing and implementing scalable web scraping systems to handle high-volume data collection without interruptions.

The ideal candidate will have experience in web scraping, crawling, or data collection, with strong proficiency in Python and familiarity with NoSQL databases and data serialization formats.

Key responsibilities include building and maintaining automated scrapers, developing multi-threaded crawlers, normalizing scraped data, and ensuring consistency before passing it to data pipelines.

You will also implement anti-bot and evasion tactics, such as proxy rotation and request throttling, to handle scraping restrictions.

Integration with pipelines is another key aspect of the role, where you will deliver clean, structured datasets into NoSQL stores and ETL pipelines for further enrichment and graph-based storage.

Additionally, you will ensure data accuracy, deduplicate records, and maintain a trust scoring system for data confidence.

Requirements:
  • Technical Skills:
    • Experience in web scraping, crawling, or data collection.
    • Strong proficiency in Python (libraries like BeautifulSoup, Scrapy, Selenium, Playwright, Requests).
    • Familiarity with NoSQL databases (MongoDB, DynamoDB) and data serialization formats (JSON, CSV, Parquet).
    • Experience in handling large-scale scraping with proxy management and rate-limiting.
    • Basic knowledge of ETL processes and integration with data pipelines.
    • Exposure to graph databases (Neo4j) is a plus.
    • Soft Skills:
      • Detail-oriented, ensuring accuracy and reliability of collected data.
      • Strong problem-solving skills, particularly in adapting scrapers to evolving web structures.
      • Curious mindset with a drive to discover new data sources.
      • Comfortable working in a fast-paced environment.

This is an exciting opportunity to leverage technology in the fight against fraud and build something impactful from day one.



  • Gandhinagar, Gujarat, India beBeeDataExtractor Full time ₹ 9,00,000 - ₹ 15,00,000

    Job Title: Data Extractor Engineer">We are seeking a skilled Data Extractor Engineer to join our team. As a key member of our data extraction team, you will be responsible for designing, developing, and maintaining web crawlers to extract valuable insights from the web.">The ideal candidate will have strong Python programming skills and experience in web...


  • Gandhinagar, Gujarat, India beBeeConsultant Full time ₹ 20,00,000 - ₹ 25,00,000

    Job DescriptionWe are seeking a skilled consultant to design and implement end-to-end data models and pipelines in SAP Datasphere.The ideal candidate will have experience in integrating SAP and non-SAP sources using Data Provisioning and Federation tools.They will also be responsible for developing views and data layers using SAP BW/4HANA and SAP HANA...


  • Gandhinagar, Gujarat, India beBeeDataIntegration Full time ₹ 3,00,00,000 - ₹ 3,50,00,000

    Job Title:SAP DataSphere ConsultantRole Overview:As a seasoned SAP Datasphere Consultant, we are seeking an expert with a proven track record of designing, building, and optimizing data integration flows using SAP DataSphere.This role involves collaborating with cross-functional teams to develop strategic data integration solutions that drive business growth...


  • Gandhinagar, Gujarat, India beBeeBlockchain Full time ₹ 1,80,00,000 - ₹ 2,50,00,000

    About the Role:As a highly skilled engineer, you will play a key role in transforming raw on-chain data into actionable insights by decoding smart contract events and implementing pricing logic from decentralized exchanges (DEXs).This is an exciting opportunity to work in a fast-paced, remote-first environment with a highly technical team.We are looking for...