Data Platform Engineer
1 day ago
Job Summary:
BharatGen is on a mission to create AI that truly represents the diversity, culture, and unique context of India. At the heart of this mission lies the need for robust, scalable infrastructure to build multilingual and multimodal datasets that power foundational AI models. We're seeking a skilled Data Platform Engineer to build scalable tools, platforms, and pipelines tailored for processing large-scale, multilingual, multimodal datasets critical for foundational AI models.
In this role, you will build scalable data pipelines to ingest, transform, and prepare data from diverse sources—text, speech, images, and video—making it ready for Generative AI model training. Your work will involve developing and managing the underlying platform while addressing challenges like governance, security, observability, lineage, and scalability. The outcomes of your work will include efficient tools for data processing, a reliable data platform, and high-quality datasets tailored to the evolving needs of large-scale AI and LLM training.
Collaborating closely with researchers and ML engineers, you will play a pivotal role in enabling BharatGen to deliver state-of-the-art AI models, contributing to the advancement of India's AI ecosystem through innovative data engineering solutions.
Key Responsibilities:
- Design and Build Scalable Platforms: Develop distributed infrastructure for ingesting, processing, and transforming diverse datasets (text, speech, images, video) at terabyte to petabyte scale.
- Develop Robust Data Pipelines: Create reliable, scalable pipelines to prepare datasets for Generative AI and LLM training.
- Implement Governance and Observability: Build frameworks for data lineage, monitoring, and access control to ensure data quality and operational reliability.
- Optimize Performance and Cost: Enhance platform performance and resource utilization using cost-effective strategies, including GPU-accelerated preprocessing.
- Collaborate and Innovate: Work closely with researchers and ML engineers to adapt platforms and data pipelines to evolving LLM requirements, addressing various data challenges.
- Drive Innovation: Stay updated on emerging tools, frameworks, and best practices to implement cutting-edge solutions for large-scale dataset creation.
Minimum Qualifications and Experience:
- Bachelor's or Master's degree in Computer Science, Data Engineering, or a related field with 3+ years of industry experience.
Required Skills:
- Proficiency in distributed systems and frameworks (e.g., Kafka, Ray, PySpark) for scalable data workflows.
- Exposure to end-to-end data lifecycle management, including DataOps.
- Strong programming skills in Python, Scala, or Go, with a focus on high-performance pipeline development.
- Experience with building and optimizing data pipelines, including ETL processes, data modeling, and integration into scalable workflows.
- Expertise in data scraping, crawling frameworks, and modern dataset development techniques such as synthetic data generation techniques.
- Experience with cloud platforms (AWS, GCP, Azure) and container orchestration (Docker, Kubernetes).
- Deep understanding of data platform design, including data architecture, metadata tracking, data lineage, observability, monitoring, and scalability best practices.
- Familiarity with Infrastructure-as-Code tools (e.g., Terraform, CloudFormation), CI/CD pipelines, relational/NoSQL databases, and GPU-accelerated workflows.
- Familiarity with visualization and monitoring tools for lifecycle management and pipeline performance tracking.
- Expertise in managing unstructured data (text, speech, or multimodal datasets) for high-performance use cases, ideally in the context of LLM/AI datasets.
- Understanding of challenges in scalable data engineering, including ingestion, transformation, and storage optimization for large-scale accelerated workflows.
-
Data Platform Engineer
7 days ago
Mumbai Metropolitan Region, India Russell Investments Full time ₹ 6,00,000 - ₹ 12,00,000 per yearBusiness Unit:Global TechnologyReporting To:Senior Manager, Application DevelopmentShift:EMEA (1:30 pm - 10:30 pm IST) (India)About Russell Investments, Mumbai:Russell Investments is a leading outsourced financial partner and global investment solutions firm providing a wide range of investment capabilities to institutional investors, financial...
-
Data Engineer
1 day ago
Mumbai Metropolitan Region, India Adsremedy Full time ₹ 6,00,000 - ₹ 12,00,000 per yearAbout AdsremedyAtAdsremedy, we are revolutionizing the digital advertising landscape with cutting-edgeAdTech solutions. We offer services likeProgrammatic Advertising,Campaign Optimization,Advanced Analytics,Cross-Platform Campaigns, and specialize inIn App,CTVandDOOH ads. Our mission is to empower businesses to connect with audiences and drive measurable...
-
Senior Data Engineer
1 week ago
Mumbai Metropolitan Region, India Scouto AI Full time ₹ 20,00,000 - ₹ 25,00,000 per yearAbout The OpportunityA high-growth enterprise in the Cloud Data & Analytics (SaaS) sector delivering scalable analytics, reporting, and real-time insights to global customers. The team builds robust data platforms that enable product analytics, BI, ML, and operational reporting across high-throughput data flows.Location: Mumbai, Maharashtra, India —...
-
Senior Data Engineer
7 days ago
Mumbai Metropolitan Region, India Kroll Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWe're seeking a Senior Data Analyst who combines strong analytical insight with hands-on data engineering skills. You will design and maintain data pipelines, optimize data models, and develop reporting solutions that enable reliable analytics and governance at scale. This is an individual contributor role where technical depth, analytical thinking, and...
-
DevOps/Infrastructure Engineer
1 week ago
Mumbai Metropolitan Region, India NTT DATA North America Full time ₹ 20,00,000 - ₹ 25,00,000 per yearReq ID:341953NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.We are currently seeking a DevOps/Infrastructure Engineer to join our team in Mumbai, Mahārāshtra (IN-MH), India (IN).Job Duties: This is a DevOps...
-
Associate Platform Reliability Engineer
3 days ago
Mumbai Metropolitan Region, India Jefferies Full time ₹ 9,00,000 - ₹ 12,00,000 per yearOverviewJOB DESCRIPTIONWe are seeking a hands-on, technically skilled professional to join our global team as an Associate Platform Reliability Engineer. This role is critical to ensuring the stability, reliability, and resilience of Jefferies' front-to-back technology infrastructure, with a focus on post-trade processing, operations, and regulatory...
-
Platform Engineer
2 weeks ago
Mumbai Metropolitan Region, India Interactive Brokers Full time ₹ 5,00,000 - ₹ 15,00,000 per yearCompany OverviewInteractive Brokers Group, Inc. (Nasdaq: IBKR) is a global financial services company headquartered in Greenwich, CT, USA, with offices in over 15 countries. We have been at the forefront of financial innovation for over four decades, known for our cutting-edge technology and client commitment.IBKR affiliates provide global electronic...
-
Data Engineer Intern
1 day ago
Mumbai Metropolitan Region, India NextGen Digital Solutions - NDS Full time ₹ 9,00,000 - ₹ 12,00,000 per yearJob Title:Data Engineer Intern (Fabric & Power BI)Location:Navi MumbaiCompany:NextGen Digital Solutions (NDS)Internship Duration:3 MonthsCompany DescriptionNextGen Digital Solutions (NDS) is a Microsoft Solution Partner specializing in consulting, implementation, and support services for digital transformation with Automation, Low Code, Analytics & AI. NDS...
-
Senior Data Engineer
1 week ago
Mumbai Metropolitan Region, India Albatronix Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are seeking a Senior Data Engineer to join an on-site engineering team in Goregaon, Mumbai. This role is ideal for hands-on engineers who design, build, and operate scalable, production-grade data pipelines and analytical platforms that serve machine learning, BI, and real-time streaming use cases.Role & ResponsibilitiesDesign and implement tailored data...
-
Senior Data Engineer
2 weeks ago
Mumbai Metropolitan Region, India Golden Legand Leasing and Finance Ltd. Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout The RoleWe are building an independent AI team to power next-generation data-driven solutions for AashaPurti (Personal & Gold Loans) and India Online Pay (Payment Gateway). As a Data Engineer, you will play a key role in designing and implementing scalable data pipelines, data lakes, and ETL workflows to support AI models, real-time analytics, and BI...