AI Systems Engineer

15 hours ago


Gurgaon, Haryana, India Shunya Labs Full time ₹ 12,00,000 - ₹ 36,00,000 per year

About US

Shunya Labs is building the Voice AI Infrastructure Layer for Enterprises powering speech intelligence, conversational agents, and domain-specific voice applications across industries. Born from deep work in mental-health AI and built for global enterprise scale, our stack combines state-of-the-art ASR/TTS models with an open-weights philosophy , driving accuracy, privacy, and scalability.

About the Role

We're seeking an
AI Systems Engineer
who thrives at the intersection of
AI model optimization
,
infrastructure engineering
, and
applied research
.

You will evaluate, host, and optimize a wide range of AI models—spanning ASR, LLMs, and multimodal systems and build the orchestration layer that powers scalable, low-latency deployments.

This is a role for someone who's comfortable navigating
ambiguity
, researching
emerging AI methods
, and translating client requirements into robust, production-ready solutions.

You'll work across the full stack—from GPU inference tuning to React-based control dashboards building a resilient and scalable AI delivery platform.

Key Responsibilities -

AI Model Evaluation & Optimization

·      Evaluate, benchmark, and optimize AI models (speech, text, vision, multimodal) for latency, throughput, and accuracy.

·      Implement advanced inference optimizations using
ONNX Runtime
,
TensorRT
,
quantization
, and
GPU batching
.

·      Continuously research and experiment with the
latest AI runtimes
, serving frameworks, and model architectures.

·      Develop efficient caching and model loading strategies for multi-tenant serving.

AI Infrastructure & Orchestration

·      Design and develop a
central orchestration layer
to manage multi-model inference, load balancing, and intelligent routing.

·      Build
scalable, fault-tolerant deployments
using
AWS ECS/EKS
,
Lambda
, and
Terraform
.

·      Use
Kubernetes autoscaling
and GPU node optimization to minimize latency under dynamic load.

·      Implement observability and monitoring (Prometheus, Grafana, CloudWatch) across the model-serving ecosystem.

DevOps, CI/CD & Automation

·      Build and maintain
CI/CD pipelines
for model integration, updates, and deployment (GitHub Actions, CodePipeline, etc.).

·      Manage
Dockerized environments
, version control, and GPU-enabled build pipelines.

·      Ensure reproducibility and resilience through
infrastructure-as-code
and automated testing.

Frontend & Developer Tools

·      Create

-based dashboards for performance visualization, latency tracking, and configuration control.

·      Build intuitive internal tools for model comparison, experiment management, and deployment control.

·      Utilize
Cursor
,
VS Code
, and other AI-powered development tools to accelerate iteration.

Client Interaction & Solutioning

·      Work closely with clients and internal stakeholders to gather
functional and performance requirements
.

·      Translate abstract business needs into
deployable AI systems
with measurable KPIs.

·      Prototype quickly, iterate with feedback, and deliver robust production systems.

Research & Continuous Innovation

·      Stay on top of the
latest AI research and model releases
(OpenAI, Anthropic, Hugging Face, Meta, etc.).

·      Evaluate emerging frameworks for model serving, fine-tuning, and retrieval (LangChain, LlamaIndex, GraphRAG, etc.).

·      Proactively identify and implement performance or cost improvements in the model serving stack.

·      Share learnings and contribute to the internal AI knowledge base.

Ambiguous Problem Solving

·      Work effectively in
undefined problem spaces
, identifying optimal paths forward through experimentation.

·      Break down high-level goals into actionable technical strategies.

·      Balance trade-offs between accuracy, latency, and cost while innovating under uncertainty.

Required Skills

·      Strong proficiency in
Python
,
TypeScript/JavaScript
,
Bash
, and modern software development practices.

·      Deep understanding of
Docker
,
Kubernetes
,
Terraform
, and
AWS (ECS, Lambda, S3, CloudFront)
.

·      Experience with
inference optimization
(ONNX, TensorRT, quantization, batching).

·      Proven ability to design and scale
real-time inference pipelines
.

·      Experience building and maintaining
CI/CD pipelines
and
monitoring systems
.

·      Hands-on experience with

or similar frameworks for dashboard/UI development.

·      Strong grasp of
API design
,
load balancing
, and
GPU resource management
.

Nice to Have

·      Experience with
LangChain
,
LlamaIndex
,
GraphRAG
, or
vector databases (FAISS, Neo4j)
.

·      Familiarity with
speech processing models
(Whisper, Silero, NeMo, etc.).

·      Prior work with
serverless inference
or
edge AI
architectures.

·      Knowledge of
data pipelines
,
model versioning
, and
MLOps best practices
.

Soft Skills

·      Excellent problem-solving in ambiguous, evolving environments.

·      Strong ability to research, self-learn, and prototype emerging AI technologies.

·      Confident communicator who can translate technical findings to business impact.

·      Ownership mindset with a collaborative, solution-oriented approach.



  • Gurgaon, Haryana, India Inxee Systems Private Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Position Title:AI & Embedded Systems EngineerLocation:GurgaonJob Description:We're looking for an innovative and technically strongR&D Engineer – AI & Embedded Systemswith expertise inPython, AI, and Machine Learning, combined with a solid understanding ofC/C++ and Linux programming. The ideal candidate will bridge the gap betweendata science and embedded...

  • Senior AI Engineer

    4 days ago


    Gurgaon, Haryana, India NextDimension AI Full time ₹ 15,12,000 - ₹ 37,80,000 per year

    Compensation:INR 12-30 LPA Base + Bonus + EquityLocation:GurgaonNextDimensionis a US-based technology startup building AI Agents in Healthcare, established by a team of distinguished AI/ML Scientists and Engineers fromGoogle, Amazon, and Snowflake. We're empowering Enterprises by building sophisticated, high-impactAI agentsthat automate sales, supercharge...

  • AI Engineer

    2 weeks ago


    Gurgaon, Haryana, India Weekday AI Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    This role is for one of the Weekday's clientsMin Experience: 4 yearsLocation: Gurgaon, NCRJobType: full-timeWe are seeking a highly skilled and innovative AI Engineer with 4–8 years of hands-on experience in artificial intelligence, machine learning, and deep learning. This role requires a professional who can design, build, and deploy AI-driven solutions...

  • AI Engineer

    1 week ago


    Gurgaon, Haryana, India Weekday AI Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    This role is for one of the Weekday's clientsMin Experience: 4 yearsLocation: Gurgaon, NCRJobType: full-time We are seeking a highly skilled and innovative AI Engineer with 4–8 years of hands-on experience in artificial intelligence, machine learning, and deep learning. This role requires a professional who can design, build, and deploy AI-driven...


  • Gurgaon, Haryana, India OneOrg | Business AI Full time ₹ 12,00,000 - ₹ 25,00,000 per year

    AI Engineer – Clinical Intelligence & Product ) Location: Gurgaon, India | Full-Time | HealthTech | Evidence IntelligenceAbout  is India's first Evidence Intelligence Platform for Doctors, built on trusted medical sources like ICMR, NEJM, Cochrane, and PubMed — powered by a medical-grade large language model (LLM).We're building the AI...

  • Agentic AI Engineer

    10 hours ago


    Gurgaon, Haryana, India AI-Data Value Info Com-Tech Alliance Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Company DescriptionAI-Data Value Info Com-Tech Alliance is a strategic advisory collective specializing in Advanced AI Engineering, Data Protection, Cyber Law, and Global Technology Compliance. We help technology ventures navigate the complexities of AI innovation, data governance, and international regulatory standards, ensuring secure and trusted market...

  • Agentic AI Engineer

    2 days ago


    Gurgaon, Haryana, India AI-Data Value Info Com-Tech Alliance Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Company DescriptionAI-Data Value Info Com-Tech Alliance (ADVICTA) specializes in Advanced AI Engineering and Strategic Enterprise Governance, offering innovative solutions for high-growth startups, scaleups, and VC-backed technology ventures. We empower organizations across sectors such as FinTech, HealthTech, SaaS Governance, and Advanced AI/ML Platforms by...

  • AI/ML Lead

    1 week ago


    Gurgaon, Haryana, India Truminds Software Systems Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    AI Software Lead Yrs of relevant experience )Production-Focused AI: Design, develop, and deploy scalable AI solutions, emphasizing MLOps best practices.Model Expertise: Deep proficiency in AI model engineering, including:Training and fine-tuning large language models (LLMs) and other deep learning architectures.Model optimization, quantization, and...

  • Gen AI Architect

    2 days ago


    Gurgaon, Haryana, India EPAM Systems Full time ₹ 12,00,000 - ₹ 30,00,000 per year

    EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most...

  • Agentic AI

    1 week ago


    Gurgaon, Haryana, India AI-Data Value Info Com-Tech Alliance Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Role DescriptionThis is a full-time on-site role located in Gurugram for an Agentic AI & GenAI Engineer. The engineer will develop, implement, and optimize AI frameworks and solutions, with a focus on generative AI tools like LangChain, TensorFlow, RAG, and Langraph. Day-to-day tasks include designing and fine-tuning machine learning models, applying neural...