Data Scientist

1 week ago


Bengaluru Karnataka, India Waayslive Solution Full time

DS (Vector Search + GCP )
- Bangalore

Bangalore

Data/Applied scientist (Search)
- Strong in Python and experience with Jupyter notebooks, Python packages like

polars, pandas, numpy, scikit-learn, matplotlib, etc.
- Must have: Experience with machine learning lifecycle, including data

preparation, training, evaluation, and deployment
- Must have: Hands-on experience with GCP services for ML & data science
- Must have: Experience with Vector Search, Hybrid Search techniques, Query preprocessing
- Must have: Experience with embeddings generation using models like BERT, Sentence

Transformers, or custom models
- Must have: Experience in embedding indexing and retrieval (e.g.,

Elastic, FAISS, ScaNN, Annoy)
- Must have: Experience with LLMs and use cases like RAG (Retrieval-Augmented Generation)
- Must have: Understanding of semantic vs lexical search paradigms
- Must have: Experience with Learning to Rank (LTR) techniques and libraries (e.g., XGBoost,

LightGBM with LTR support)
- Should be proficient in SQL and BigQuery for analytics and feature generation
- Should have experience with Dataproc clusters for distributed data processing using Apache

Spark or PySpark
- Should have experience deploying models and services using Vertex AI, Cloud Run, or Cloud

Functions
- Should be comfortable working with BM25 ranking (via Elasticsearch or OpenSearch) and

blending with vector-based approaches
- Good to have: Familiarity with Vertex AI Matching Engine for scalable vector retrieval
- Good to have: Familiarity with TensorFlow Hub, Hugging Face, or other model repositories
- Good to have: Experience with prompt engineering, context windowing, and embedding

optimization for LLM-based systems
- Must have: Awareness of evaluation metrics for search relevance
- Should have exposure to CI/CD pipelines and model versioning practices

GCP Tools Experience:
ML & AI: Vertex AI, Vertex AI Matching Engine, AutoML, AI Platform

Storage: BigQuery, Cloud Storage, Firestore

Ingestion: Pub/Sub, Cloud Functions, Cloud Run

Search: Vector Databases (e.g., Matching Engine, Qdrant on GKE), Elasticsearch/OpenSearch

Compute: Cloud Run, Cloud Functions, Vertex Pipelines, Cloud Dataproc (Spark/PySpark)

CI/CD & IaC: GitLab/GitHub Actions

EXPERTISE AND QUALIFICATIONS

Data/Applied scientist (Search)
- Strong in Python and experience with Jupyter notebooks, Python packages like

polars, pandas, numpy, scikit-learn, matplotlib, etc.
- Must have: Experience with machine learning lifecycle, including data

preparation, training, evaluation, and deployment
- Must have: Hands-on experience with GCP services for ML & data science
- Must have: Experience with Vector Search, Hybrid Search techniques, Query preprocessing
- Must have: Experience with embeddings generation using models like BERT, Sentence

Transformers, or custom models
- Must have: Experience in embedding indexing and retrieval (e.g.,

Elastic, FAISS, ScaNN, Annoy)
- Must have: Experience with LLMs and use cases like RAG (Retrieval-Augmented Generation)
- Must have: Understanding of semantic vs lexical search paradigms
- Must have: Experience with Learning to Rank (LTR) techniques and libraries (e.g., XGBoost,

LightGBM with LTR support)
- Should be proficient in SQL and BigQuery for analytics and feature generation
- Should have experience with Dataproc clusters for distributed data processing using Apache

Spark or PySpark
- Should have experience deploying models and services using Vertex AI, Cloud Run, or Cloud

Functions
- Should be comfortable working with BM25 ranking (via Elasticsearch or OpenSearch) and

blending with vector-based approaches
- Good to have: Familiarity with Vertex AI Matching Engine for scalable vector retrieval
- Good to have: Familiarity with TensorFlow Hub, Hugging Face, or other model repositories
- Good to have: Experience with prompt engineering, context windowing, and embedding

optimization for LLM-based systems
- Must have: Awareness of evaluation metrics for search relevance
- Should have exposure to CI/CD pipelines and model versioning practices

GCP Tools Experience:
ML & AI: Vertex AI, Vertex AI Matching Engine, AutoML, AI Platform

Storage: BigQuery, Cloud Storage, Firestore

Ingestion: Pub/Sub, Cloud Functions, Cloud Run

Search: Vector Databases (e.g., Matching Engine, Qdrant on GKE), Elasticsearch/OpenSearch

Compute: Cloud Run, Cloud Functions, Vertex Pipelines, Cloud Dataproc (Spark/PySpark)

CI/CD & IaC: GitLab/GitHub Actions

Pay: Up to ₹1,700,000.00 per year

Work Location: In person


  • Data Scientists

    5 days ago


    Bengaluru, Karnataka, India NTT DATA Full time

    **Req ID**: 276519 We are currently seeking a Data Scientists to join our team in Bangalore, Karnātaka (IN-KA), India (IN). Data Scientists Responsibilities - Programming Languages (Pyton, R), Data Manipulation and Analysis Tools, Machine Learning Libraries, Data Visualization tools, Cloud Platforms Programming Languages (Pyton, R), Data Manipulation and...

  • Data Scientist

    2 weeks ago


    Bengaluru, India NTT DATA Full time

    **Req ID**: 240734 We are currently seeking a Data Scientist to join our team in bangalore, Karnātaka (IN-KA), India (IN). Profile Data Scientist - Ability to work with various database types and connectors - Expert level skills in leveraging Alteryx to fetch, curate and publish data - Expert level hands-on experience building Tableau dashboards for...

  • Data Scientist

    1 week ago


    Bengaluru, Karnataka, India Gethired Global Full time

    Prompt Support Needed: Immediate Job Openings for Data Science Professionals Gethired Global is a leading business services provider, delivering a wide range of technology-enabled staffing and managed outsourcing solutions globally. We are currently actively recruiting Data Science professionals or experts for immediate positions at our client's...

  • Data Scientist

    2 weeks ago


    Bengaluru, Karnataka, India Affine Full time

    Job Description - **Expertise in Object-Oriented Python Programming**: Proficiency in Python programming is essential for Data Scientists, and specifically, expertise in object-oriented programming is crucial for developing efficient and scalable code. - **Pyspark**: Data Scientists should be proficient in using PySpark, which is a powerful tool for...

  • Data Scientist

    2 weeks ago


    Bengaluru, Karnataka, India Swedium Global Services Full time

    Data Scientist SwediumGlobal is seeking for experienced Data Scientists Location: Bangalore, India. Job Description: Data Scientist Role: - Work with data scientists and engineers to develop end-to-end machine learning solutions, including data pipelines, model training, and deployment via APIs. - Collaborate with key stakeholder to understand...

  • Lead Data Scientist

    2 weeks ago


    Bengaluru, Karnataka, India Enable Data Incorporated Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Enable Data Incorporated is looking for a talented and experienced Lead Data Scientist to join our innovative team. In this role, you will be responsible for driving data science initiatives, developing predictive models, and providing insights that guide key business decisions. The ideal candidate has a strong background in statistical analysis, machine...

  • Data Scientist

    2 weeks ago


    Bengaluru, Karnataka, India MiQ Digital Full time

    Location: Bengaluru **What you’ll do** We’re MiQ, a global programmatic media partner for marketers and agencies. Our people are at the heart of everything we do, so you will be too. No matter the role or the location, we’re all united in the vision to lead the programmatic industry and make it better. As part of the Data Scientist team under DnA,...

  • Data Scientist

    2 weeks ago


    Bengaluru, Karnataka, India Sciens Technologies Full time

    We are hiring for Core Data Scientist Work Mode - Hybrid(3days/week to office) Work Location -Onsite, Bangalore Experience: 8 + years Core **#DataScientist** and **#Algorithm** with 6/10 Python (core data scientist mínimal knowledge with python) Required Skills - **#MLEngineer**, **#Python**, **#OOP**, **#Optimization**,...

  • Lead Data Scientist

    5 days ago


    Bengaluru, Karnataka, India NTT DATA Full time

    NTT DATA strives to hire exceptional innovative and passionate individuals who want to grow with us If you want to be part of an inclusive adaptable and forward-thinking organization apply now We are currently seeking a Lead Data Scientist - Computer Vision Generative AI to join our team in Bangalore Karn taka IN-KA India IN We are seeking a...

  • Lead Data Scientist

    7 days ago


    Bengaluru, Karnataka, India NTT DATA Full time US$ 1,04,000 - US$ 1,30,878 per year

    NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now. We are currently seeking a Lead Data Scientist - Computer Vision & Generative AI to join our team in Bangalore, Karnātaka (IN-KA), India (IN). We are seeking...