Data Curation Associate-west Zone

1 week ago


Bengaluru Karnataka, India ArtPark - I-Hub for Robotics and Autonomous Systems Innovation Foundation Full time

**Location**: Bangalore

**Type**:Full time Consultant

**About the Team**:
You will be a part of the Language Data & AI team at ARTPARK and IISc.

You will be part of this core team at ARTPARK.

**Overview of the project**:
As part of an ambitious nation-wide program, you will help create unique, high-quality open-source speech and text datasets spanning every district to accelerate the state-of-the-art in NLP (Natural Language Processing) in Indian languages.

With several large projects under BhashaSetu, ARTPARK’s vision is to spearhead the creation of an inclusive digital-India through propel the AI advancements in Indic languages spanning projects in speech data collection, curation and advanced language modelling.

You will be part of the operations team which drives data collection and curation across all projects in BhashaSetu program, working closely with the ARTPARK team.

**Role & key responsibilities**:
You will be responsible for the data from one or more of the following states:
Gujarat

Rajasthan

Maharashtra

Goa

Understand requirements for data curation and its implications on the AI models built. Understanding the requirements thoroughly as and when guideline documentation is received.

Search and recruit the correct curation experts as required by the project through searching and contacts (e.g., NGOs, local institutes etc.) in the districts/area that you are managing

Design task flyers and find out all ways to reach the individuals and language experts(local in a district) who could be interested in data curation and quality checking

Contact (through phone call and WhatsApp) to applicants as well as those who did not

Host project awareness calls with potential experts to drive understanding of their tasks

Manage day-to-day curation operations for audio and transcription data

Training the recruited experts in required tasks. Provide relevant documentation for training

Assigning of daily workload basis their availability and closely coordinating to get the work done

Supervising their daily performance and review their work on a daily basis.

**Skills and background**:
Should be a native and local language speaker of the local language of the following districts:
Marathi
- (primarily from Washim, Gondia, Mumbai suburban)

Gujarati
- (primarily from Navsari, Valsad, Devbhoomi Dwarka, Gandhinagar)

Hindi
- (primarily from Umaria, Dhar, Katni, Bhopal)

Hindi, Rajasthani, Marwari
- (primarily from Barmer, Jaisalmer, Jaipur)

Should be good at verbal and written communication both in English and local language

Should be good with handling multiple people (remotely working) and get the task done by them.

Skills: Microsoft Office (Excel, Word, PowerPoint) and Google workspace (Docs, Sheets, Slides)

**Good-to-haves**:
Experience in data curation

Experience in speech data annotation and labelling

Experience in working with data sourcing and annotation companies

ARTPARK at IISc drives impact through innovations in AI & Robotics, by harnessing the best of research/academia, startups/industry, and government/nonprofits.

Our pioneering platform initiatives in language data & AI and health data & AI are driving national-scale impact with stakeholders such as MeitY’s Bhashini, Office of PSA, ICMR, States and Cities.

These platforms are in pursuit of our vision - AI for All.



  • Bengaluru, Karnataka, India ArtPark - I-Hub for Robotics and Autonomous Systems Innovation Foundation Full time

    **Location**: Bangalore **Type**:Full time Consultant **About the Team**: You will be a part of the Language Data & AI team at ARTPARK and IISc. You will be part of this core team at ARTPARK. **Overview of the project**: As part of an ambitious nation-wide program, you will help create unique, high-quality open-source speech and text datasets spanning...


  • Bengaluru, Karnataka, India ArtPark - I-Hub for Robotics and Autonomous Systems Innovation Foundation Full time

    **Location**: Bangalore **Type**:Full time Consultant **About the Team**: You will be a part of the Language Data & AI team at ARTPARK and IISc. You will be part of this core team at ARTPARK. **Overview of the project**: As part of an ambitious nation-wide program, you will help create unique, high-quality open-source speech and text datasets spanning...


  • Bengaluru, Karnataka, India ArtPark - I-Hub for Robotics and Autonomous Systems Innovation Foundation Full time

    **Location**: Bangalore **Type**:Full time Consultant **About the Team**: You will be a part of the Language Data & AI team at ARTPARK and IISc. You will be part of this core team at ARTPARK. **Overview of the project**: As part of an ambitious nation-wide program, you will help create unique, high-quality open-source speech and text datasets spanning...

  • Data Curation SME

    4 weeks ago


    Bengaluru, Karnataka, India Triomics Full time

    About Triomics:Triomics is building the modern technology stack for oncology trial sites and investigators that unifies the workflows of clinical care and clinical research, moving the healthcare industry closer to the vision of Clinical Research as a Care Option. Our platform, which is based on our proprietary oncology-focused large language model (OncoLLM)...

  • Data Sourcing

    1 week ago


    Bengaluru, Karnataka, India ArtPark - I-Hub for Robotics and Autonomous Systems Innovation Foundation Full time

    **Data Sourcing & Quality Associate***: **Bhasha Setu ( ** ** pan-India language data and AI initiatives)** ***: **ARTPARK (AI & Robotics Technology Park), IISc, Bangalore***: **As part of an ambitious India-wide program, you will help create unique, high-quality ** ** open-source ** ** speech and text datasets spanning every district to accelerate the...


  • Bengaluru, Karnataka, India West Pharmaceutical Services Full time

    At West we re a dedicated team that is connected by a purpose to improve patient lives that has been at the center of our Company for more than a century Our story began when Herman O West solved the problem of supplying penicillin in mass quantities to the US Government during World War 2 Through our work to deliver thousands of life-saving and...


  • Bengaluru, Karnataka, India beBeeExecutive Full time ₹ 7,50,000 - ₹ 10,00,000

    Job Overview">Catering to the ever-evolving demands of the culinary world, we seek an exceptional Category Executive to spearhead our restaurant data curation efforts. With a proven track record of delivering high-quality results and collaborating with cross-functional teams, this role promises a challenging yet rewarding experience.">Key...

  • Helpdesk Associate

    6 days ago


    Bengaluru, Karnataka, India NTT DATA Full time

    At NTT DATA, we know that with the right people on board, anything is possible. The quality, integrity, and commitment of our employees are key factors in our company's growth, market presence and our ability to help our clients stay a step ahead of the competition. By hiring the best people and helping them grow both professionally and personally, we ensure...


  • Bengaluru, Karnataka, India Turbostart Full time

    IF YOU ARE INTERESTED IN THIS JOB, PLEASE APPLY HERE: will not be looking at applicants via LinkedIn, but only applications submitted via the form above.Who are we:Turbostart is an early-stage venture fund backing high-potential startups from Pre-seed to Series A. Beyond being a VC fund, we're an innovative ecosystem that blends the functions of an...

  • Knowledge Curator

    6 days ago


    Bengaluru, India Mott MacDonald Full time

    **Job Profile** As a company we relentlessly focus on excellence and digital innovation. Managing our data, information and knowledge is key to improve outcomes and create better solutions for our clients. As a key member of the Knowledge, Information and Data management team, this role is critical to ensure that our project information is curated and shared...