
Scalable Data Architect
23 hours ago
As we build our platform from scratch, you will have the opportunity to shape our technology vision and architecture.
This role is ideal for someone who thrives on early-stage challenges and loves building scalable solutions from day zero.
Key Responsibilities:- Web Scraping & Crawling: Design and develop automated scrapers to extract structured and unstructured data from websites, APIs, and public datasets.
- Scalable Scraping Systems: Develop multi-threaded, distributed crawlers capable of handling high-volume data collection without interruptions.
- Data Parsing & Cleaning: Normalize scraped data, remove noise, and ensure consistency before passing to data pipelines.
- Anti-bot & Evasion Tactics: Implement proxy rotation, captcha solving, and request throttling techniques to handle scraping restrictions.
- Integration with Pipelines: Deliver clean, structured datasets into NoSQL stores and ETL pipelines for further enrichment and graph-based storage.
- Data Quality & Validation: Ensure data accuracy, deduplicate records, and maintain a trust scoring system for data confidence.
- Documentation & Maintenance: Keep scrapers updated when websites change, and document scraping logic for reproducibility.
- Data Quality & Validation: Ensure data accuracy, deduplicate records, and maintain a trust scoring system for data confidence.
- Integration with Pipelines: Deliver clean, structured datasets into NoSQL stores and ETL pipelines for further enrichment and graph-based storage.
- Anti-bot & Evasion Tactics: Implement proxy rotation, captcha solving, and request throttling techniques to handle scraping restrictions.
- 2+ years of experience in web scraping, crawling, or data collection.
- Strong proficiency in Python (libraries like BeautifulSoup, Scrapy, Selenium, Playwright, Requests).
- Familiarity with NoSQL databases (MongoDB, DynamoDB) and data serialization formats (JSON, CSV, Parquet).
- Experience in handling large-scale scraping with proxy management and rate-limiting.
- Basic knowledge of ETL processes and integration with data pipelines.
- Exposure to graph databases (Neo4j) is a plus.
- We're united by a single, critical mission - stopping fraud before it impacts businesses.
- Curiosity, innovation, and proactive action define our approach.
- We value transparency, collaboration, and individual ownership, creating an environment where talented people can do their best work.
- Data Parsing & Cleaning: Normalize scraped data, remove noise, and ensure consistency before passing to data pipelines.
- Scalable Scraping Systems: Develop multi-threaded, distributed crawlers capable of handling high-volume data collection without interruptions.
-
Scalable Data Solutions Engineer
4 days ago
Ghaziabad, Uttar Pradesh, India beBeeData Full time ₹ 1,80,00,000 - ₹ 2,50,00,000Cloud Data Solutions ArchitectWe are seeking a skilled and motivated individual to build scalable data pipelines and cloud-native data solutions on Google Cloud Platform.Key Responsibilities:Design, develop, and optimize robust data ingestion pipelines using GCP services such as Pub/Sub, Dataflow, and Cloud Storage.Architect and manage scalable BigQuery data...
-
Data Architect
2 weeks ago
Ghaziabad, Uttar Pradesh, India beBeeData Full time ₹ 1,00,00,000 - ₹ 2,00,00,000Data Architect PositionWe are seeking a skilled professional to design, develop and maintain large-scale data systems using distributed computing frameworks.Key Responsibilities:Evaluate existing system architectures for scalability and efficiency.Implement database performance tuning techniques to improve system speed.Collaborate with cross-functional teams...
-
Data Architect
2 weeks ago
Ghaziabad, Uttar Pradesh, India beBeeDataArchitect Full time ₹ 15,00,000 - ₹ 25,00,000Job Title: Data ArchitectWe are seeking a highly skilled and experienced Data Architect to build and design scalable data systems using Azure Databricks, PySpark, and Delta Lake.Collaborate with cross-functional teams to develop high-quality database solutions for the organization.Create and implement scalable data pipelines with Azure Databricks by...
-
Senior Cloud Developer
2 weeks ago
Ghaziabad, Uttar Pradesh, India beBeeBackend Full time ₹ 1,80,00,000 - ₹ 2,20,00,000Job Summary:We are seeking a highly skilled developer to fill the role of Senior Full Stack Backend Developer. This position requires expertise in AWS architecture, Node.js backend development, and scalable database design.Key Responsibilities:Design, build, and optimize scalable APIs using Node.js and AWS services.Manage cloud infrastructure and drive...
-
Chief Data Architect
2 weeks ago
Ghaziabad, Uttar Pradesh, India beBeeData Full time US$ 2,00,000 - US$ 2,50,000Senior Data EngineerOur mission is to make nutritious food accessible and affordable for everyone. As a critical member of our team, you will be pivotal in designing, building, and maintaining highly scalable data pipelines, optimizing data delivery, and automating data processes.About the RoleThis role is responsible for constructing and optimizing our data...
-
Lead Data Architect
5 days ago
Ghaziabad, Uttar Pradesh, India beBeeData Full time ₹ 25,00,000 - ₹ 35,00,000Senior Data Engineer PositionThis role entails leading the design and implementation of scalable, high-performance data pipelines using cloud-based data warehousing platforms. Key responsibilities include architecting modular ELT pipelines, designing layered data models, and optimizing database performance for scalability and cost-effectiveness.Develop and...
-
Data Model Architect
7 days ago
Ghaziabad, Uttar Pradesh, India beBeeMachineLearning Full time ₹ 20,00,000 - ₹ 25,00,000Job Title: Data Model ArchitectWe are seeking an experienced data model architect to join our team of consultants.As a data model architect at Dexian, you will be responsible for designing, developing, and deploying machine learning models and algorithms to solve complex business challenges with Databricks.You will work closely with data scientists, data...
-
Cloud Data Architect
7 days ago
Ghaziabad, Uttar Pradesh, India beBeeCloud Full time ₹ 18,00,000 - ₹ 22,00,000**Job Title:** Cloud Data ArchitectJob Description:We are seeking a skilled cloud data architect to join our team. As a cloud data architect, you will design and implement scalable cloud-based data architectures that meet the needs of our business.Required Skills and Qualifications:To be successful in this role, you will need to have:Hands-on experience with...
-
Data Architect
2 weeks ago
Ghaziabad, Uttar Pradesh, India Veritis Group Inc Full timePosition Title: Data ArchitectLocation: HyderabadEmployment Type: Full-TimeMode: Work from OfficeWorking Shift: UK but might change based on business needAbout the RoleWe are looking for a hands-on Data Architect who can own the end-to-end data architecture strategy while also actively engage in the design, development, and optimization of our data...
-
Big Data Solutions Architect
5 days ago
Ghaziabad, Uttar Pradesh, India beBeeData Full time ₹ 2,00,00,000 - ₹ 3,00,00,000Job Title: Strategic Data ArchitectThe role entails designing, developing, and overseeing scalable data infrastructure and pipelines to drive business growth.Key Responsibilities:Lead a team of data engineers in implementing large-scale data solutions.Design and implement architecture for data pipelines ensuring performance, reliability, and...