TIH-IoT | Data Platform Engineer
4 weeks ago
Key Responsibilities:Design and Build Scalable Platforms: Develop distributed infrastructure for ingesting, processing, and transforming diverse datasets (text, speech, images, video) at terabyte to petabyte scale.Develop Robust Data Pipelines: Create reliable, scalable pipelines to prepare datasets for Generative AI and LLM training.Implement Governance and Observability: Build frameworks for data lineage, monitoring, and access control to ensure data quality and operational reliability.Optimize Performance and Cost: Enhance platform performance and resource utilization using cost-effective strategies, including GPU-accelerated preprocessing.Collaborate and Innovate: Work closely with researchers and ML engineers to adapt platforms and data pipelines to evolving LLM requirements, addressing various data challenges.Drive Innovation: Stay updated on emerging tools, frameworks, and best practices to implement cutting-edge solutions for large-scale dataset creation.
Minimum Qualifications and Experience:Education:Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field.(Preferred) Advanced degrees or certifications in Distributed Systems, Data Engineering, or Big Data technologies
Experience and Expertise:3+ years of overall industry experience in engineering roles, demonstrating strong foundations in software development, systems engineering, or related disciplines.1+ years of specific hands-on experience in developing large-scale, distributed data pipelines and platforms, preferably in high-performance AI or ML environments.Expertise in managing unstructured data (text, speech, or multimodal datasets) for high-performance use cases, ideally in the context of LLM/AI datasets.Understanding of challenges in scalable data engineering, including ingestion, transformation, and storage optimization for large-scale accelerated workflows.
Skills:1.TechnicalProficiency in distributed systems and frameworks (e.g., Kafka, Ray, PySpark) for scalable data workflows.Exposure to end-to-end data lifecycle management, including DataOps.Strong programming skills in Python, Scala, or Go, with a focus on high-performance pipeline development.Experience with building and optimizing data pipelines, including ETL processes, data modeling, and integration into scalable workflows.Expertise in data scraping, crawling frameworks, and modern dataset development techniques such as synthetic data generation techniques.Experience with cloud platforms (AWS, GCP, Azure) and container orchestration (Docker, Kubernetes).Deep understanding of data platform design, including data architecture, metadata tracking, data lineage, observability, monitoring, and scalability best practices.Familiarity with Infrastructure-as-Code tools (e.g., Terraform, CloudFormation), CI/CD pipelines, relational/NoSQL databases, and GPU-accelerated workflows.Familiarity with visualization and monitoring tools for lifecycle management and pipeline performance tracking.
2.Soft SkillsAdaptability and innovation in fast-paced, dynamic environments.Strong collaboration skills for interdisciplinary teamwork.Proactive problem-solving and a growth mindset to thrive in a mission-driven organization.
-
AI Data Infrastructure Specialist
4 weeks ago
Mumbai, Maharashtra, India TIH-IoT Full timeAbout the RoleWe are seeking an experienced AI Data Infrastructure Specialist to join our team at TIH-IoT. As a key member of our engineering department, you will be responsible for designing and building scalable data platforms to support the development of cutting-edge AI models.This is a unique opportunity to work on high-profile projects and collaborate...
-
TIH-IoT | Data Platform Engineer
4 weeks ago
mumbai, India TIH-IoT Full timeJob Summary:BharatGen is on a mission to create AI that truly represents the diversity, culture, and unique context of India. At the heart of this mission lies the need for robust, scalable infrastructure to build multilingual and multimodal datasets that power foundational AI models. We’re seeking a skilled Data Platform Engineer to build scalable tools,...
-
TIH-IoT | Data Platform Engineer
4 weeks ago
mumbai, India TIH-IoT Full timeJob Summary:BharatGen is on a mission to create AI that truly represents the diversity, culture, and unique context of India. At the heart of this mission lies the need for robust, scalable infrastructure to build multilingual and multimodal datasets that power foundational AI models. We’re seeking a skilled Data Platform Engineer to build scalable tools,...
-
TIH-IoT | Data Platform Engineer
4 weeks ago
mumbai, India TIH-IoT Full timeJob Summary: BharatGen is on a mission to create AI that truly represents the diversity, culture, and unique context of India. At the heart of this mission lies the need for robust, scalable infrastructure to build multilingual and multimodal datasets that power foundational AI models. We’re seeking a skilled Data Platform Engineer to build scalable...
-
Senior AI Architect
4 weeks ago
Mumbai, Maharashtra, India TIH-IoT Full timeJob SummaryWe are seeking a highly experienced Senior AI Architect to lead the development of our text and speech systems. The ideal candidate will have a strong background in AI, machine learning, and software engineering, with a proven track record of designing and deploying large-scale AI models.About the RoleAs a Senior AI Architect at TIH-IoT, you will...
-
Intern - Business Operations
6 months ago
Mumbai, India TIH-IoT, IIT Bombay Full time**Job Description -** - Data compiling in excel. - Document checking/verification. - Document print, scan, file, courier and organise. - Collecting data as and when required. - Assist for an offline course or course related activity. **Job Duration - **3 months **Shift -** Day Shift **Working Days - **Monday to Friday **Location of work - **TIH-IoT, IIT...
-
TIH-IoT | Senior Executive
1 month ago
Mumbai, India TIH-IoT Full timeJob Description:Manage the general accounting functions, including, but not limited to: accounts payable, accounts receivable, general ledger, Income tax and GST related mattersMonitoring and analyzing accounting data and produce financial reports or statementsEstablishing and enforcing proper accounting methods, policies and principlesOrganising and...
-
TIH-IoT | Senior Executive
1 month ago
mumbai, India TIH-IoT Full timeJob Description:Manage the general accounting functions, including, but not limited to: accounts payable, accounts receivable, general ledger, Income tax and GST related mattersMonitoring and analyzing accounting data and produce financial reports or statementsEstablishing and enforcing proper accounting methods, policies and principlesOrganising and...
-
Data platform engineer
2 weeks ago
Mumbai, India TIH-IoT Full timeJob Summary: Bharat Gen is on a mission to create AI that truly represents the diversity, culture, and unique context of India. At the heart of this mission lies the need for robust, scalable infrastructure to build multilingual and multimodal datasets that power foundational AI models. We’re seeking a skilled Data Platform Engineer to build scalable...
-
Data platform engineer
4 weeks ago
Mumbai, India TIH-IoT Full timeJob Summary:Bharat Gen is on a mission to create AI that truly represents the diversity, culture, and unique context of India. At the heart of this mission lies the need for robust, scalable infrastructure to build multilingual and multimodal datasets that power foundational AI models. We’re seeking a skilled Data Platform Engineer to build scalable tools,...
-
Data Platform Engineer
4 weeks ago
Mumbai, India TIH-IoT Full timeJob Summary:BharatGen is on a mission to create AI that truly represents the diversity, culture, and unique context of India. At the heart of this mission lies the need for robust, scalable infrastructure to build multilingual and multimodal datasets that power foundational AI models. We’re seeking a skilled Data Platform Engineer to build scalable tools,...
-
Data Platform Engineer
4 weeks ago
Mumbai, India TIH-IoT Full timeJob Summary:BharatGen is on a mission to create AI that truly represents the diversity, culture, and unique context of India. At the heart of this mission lies the need for robust, scalable infrastructure to build multilingual and multimodal datasets that power foundational AI models. We’re seeking a skilled Data Platform Engineer to build scalable tools,...
-
Data Platform Engineer
4 weeks ago
Mumbai, India TIH-IoT Full timeJob Summary: BharatGen is on a mission to create AI that truly represents the diversity, culture, and unique context of India. At the heart of this mission lies the need for robust, scalable infrastructure to build multilingual and multimodal datasets that power foundational AI models. We’re seeking a skilled Data Platform Engineer to build scalable...
-
[15h Left] Data Platform Engineer
2 weeks ago
Mumbai, India TIH-IoT Full timeJob Summary:BharatGen is on a mission to create AI that truly represents the diversity, culture, and unique context of India. At the heart of this mission lies the need for robust, scalable infrastructure to build multilingual and multimodal datasets that power foundational AI models. We’re seeking a skilled Data Platform Engineer to build scalable tools,...
-
Large Scale Data Platform Architect
1 day ago
Mumbai, Maharashtra, India TIH | IIT Bombay Full timeJob SummaryBharatGen is committed to developing AI that represents the diversity and culture of India. To achieve this mission, we need a robust infrastructure for building multilingual and multimodal datasets that power foundational AI models. We're seeking an experienced Large Scale Data Platform Architect to design scalable tools, platforms, and pipelines...
-
TIH-IoT | Generative AI – Tech Lead
4 weeks ago
mumbai, India TIH-IoT Full timeJob Description:• Technical Leadership:- Lead, mentor, and inspire a team of AI researchers and engineers, fostering a culture of innovation and technical excellence.- Define and execute the technical roadmap and strategy for generative AI projects, ensuring alignment with organizational goals.- Architect, develop, and optimize large-scale generative AI...
-
TIH-IoT | Generative AI – Tech Lead
4 weeks ago
mumbai, India TIH-IoT Full timeJob Description: • Technical Leadership: - Lead, mentor, and inspire a team of AI researchers and engineers, fostering a culture of innovation and technical excellence. - Define and execute the technical roadmap and strategy for generative AI projects, ensuring alignment with organizational goals. - Architect, develop, and optimize large-scale generative...
-
AI Innovator
1 month ago
Mumbai, Maharashtra, India TIH Foundation for IoT & IoE, IIT Bombay Full timeAbout Us:TIH Foundation for IoT & IoE, IIT Bombay is at the forefront of Generative AI innovation, dedicated to addressing India's unique challenges through cutting-edge technology. Our mission is to develop a suite of generative AI technology and solutions that capture and reflect the rich linguistic, cultural, socio-economic, and industry-specific...
-
Lead AI Engineer
16 hours ago
Mumbai, Maharashtra, India TIH | IIT Bombay Full timeJob OverviewWe are seeking a highly skilled Lead AI Engineer to join our team at TIH | IIT Bombay. This role will involve leading the development of large-scale generative AI models, particularly in the text and speech domains.
-
TIH | IIT Bombay | Data Platform Engineer
3 days ago
mumbai, India TIH | IIT Bombay Full timeJob Summary: BharatGen is on a mission to create AI that truly represents the diversity, culture, and unique context of India. At the heart of this mission lies the need for robust, scalable infrastructure to build multilingual and multimodal datasets that power foundational AI models. We’re seeking a skilled Data Platform Engineer to build scalable...