AI Engineer

5 days ago

Chennai, Tamil Nadu, India Mako IT Lab Full time ₹ 12,00,000 - ₹ 36,00,000 per year

About MAKO
Founded in 2013, Mako IT Lab is a global software development company with a strong presence across the USA, UK, India, and Nepal. Over the years, we've partnered with companies globally helping them solve complex challenges and build meaningful digital experiences.

What truly defines Mako is our culture. We believe in creating an environment where people feel empowered to take ownership, exercise freedom in their ideas, and contribute to solutions that genuinely make an impact. Learning is at the heart of who we are—our teams constantly grow through hands-on exposure, real-world problem solving, and continuous knowledge sharing across functions and geographies.

We don't just build long-term partnerships with clients—we build long-term careers for our people. At Mako, you'll be part of a collaborative, supportive, and fast-growing global team where curiosity is encouraged, initiative is celebrated, and every individual plays a meaningful role in shaping the company's journey.

Role Overview
We are seeking an experienced AI Engineer with deep expertise in LLM-driven architectures, RAG systems, agentic workflows, and multimodal AI development. The ideal candidate will be skilled in building scalable AI pipelines using FastAPI, Kafka, FastMCP, and Tavily Web Search, while also having hands-on experience with vllm-based inference and Stable Diffusion pipelines.You will architect and implement intelligent systems leveraging Large Language Models, vision models, and autonomous agents, with a strong focus on observability, performance, and production reliability.

Key Responsibilities

LLM, VLLM & Agentic System Development
Build autonomous LLM agents using LangChain, LangGraph, and FastMCP.
Develop RAG workflows using embeddings, vector stores, and knowledge-grounded reasoning.
Integrate VLLM / SGLang / other high-throughput inference backends for low-latency model serving.
Implement Tavily web-search integrations for real-time knowledge augmentation.
Optimize inference using quantized GGUF, tensorized formats, and GPU-accelerated pipelines.
Multimodal & Image Generation Systems
Build and deploy Stable Diffusion (SDXL/SD 1.5/ControlNet/T2I) pipelines for image generation tasks.
Integrate LoRAs, control modules, and diffusion-based fine-tuning for custom domains.
Develop multimodal agents that combine LLM reasoning with vision tasks such as classification, captioning, or image prompts.
Backend & Infrastructure Engineering
Build robust FastAPI services for orchestrating LLMs, Stable Diffusion, retrieval, and agentic tasks.
Develop event-driven workflows using Kafka for distributed AI systems.
Implement auditing, agent-output monitoring, and API-layer logging for end-to-end traceability.
High-level API & Third-party Integrations
Integrate third-party services: authentication, analytics, search APIs, cloud inference APIs, and enterprise data sources.
Build secure and scalable API layers for production deployments.
Fine-tuning & Model Lifecycle Management
Fine-tune LLaMA, Mistral, Phi-3, and diffusion models for domain-specific tasks.
Use MLflow for tracking experiments, hyperparameters, metrics, and versioning.
Conduct evaluation on hallucinations, retrieval consistency, reasoning depth, and multimodal accuracy.

Required Skills & Qualifications
Core AI/LLM Skills

Experience with LLMs, RAG systems, LangChain, LangGraph, LlamaIndex
Hands-on with VLLM, SGLang, or similar inference engines
Model quantization (GGUF), optimization, and GPU memory tuning
Agent frameworks & tool calling (FastMCP, Groq, Hugging Face)

Multimodal & Image Generation

Stable Diffusion, ControlNet, LoRA fine-tuning, custom pipelines
Diffusers, ComfyUI, or InvokeAI experience (bonus)

Engineering & Systems

Kafka-based event-driven systems
backend development
Third-party API integrations
Docker, CI/CD, and cloud platforms (GCP/Azure)

Databases & Retrieval

MongoDB, DuckDB,
Embedding stores, vector databases (Pinecone / Qdrant), retrieval optimization

Observability & MLOps

MLflow for experiment tracking and model lifecycle
Performance monitoring, logging, auditing, API observability

Frontend (Good to have)

React, Redux, , for dashboards and AI interfaces

Ai engineer- Genrative Ai

2 days ago

Chennai, Tamil Nadu, India Weekday AI Full time ₹ 20,00,000 - ₹ 35,00,000

This role is for one of the Weekday's clientsSalary range: Rs Rs ie INR 20-35 LPA)Min Experience: 5 yearsLocation: Hyderabad, ChennaiJobType: full-timeAs an AI/ML Engineer, you will be responsible for creating end-to-end machine learning solutions—from data exploration to model deployment. You will work closely with cross-functional teams to understand...
Ai engineer- Genrative Ai

2 days ago

Chennai, Tamil Nadu, India Weekday AI (YC W21) Full time ₹ 2,00,00,000 - ₹ 6,00,00,000 per year

This role is for one of the Weekday's clientsSalary range: Rs Rs ie INR 20-35 LPA)Min Experience: 5 yearsLocation: Hyderabad, ChennaiJobType: full-timeAs an AI/ML Engineer, you will be responsible for creating end-to-end machine learning solutions—from data exploration to model deployment. You will work closely with cross-functional teams to understand...
AI & Data Science Intern

1 week ago

Chennai, Tamil Nadu, India Twite AI Technologies Full time ₹ 1,00,000 - ₹ 3,00,000 per year

Internship Opportunity at Twite AI TechnologiesPosition: AI & Data Science InternDuration: 3 Months (Internship)Location: Chennai (On-site)Type: Internship (Unpaid for 3 months – Full-Time Employment based on performance)Experience: FreshersAbout Twite AI Technologies:Twite AI Technologies is a fast-growing AI-powered solutions provider working with top...
AI Engineer

2 days ago

Chennai, Tamil Nadu, India SWITS DIGITAL Private Limited Full time ₹ 8,00,000 - ₹ 16,00,000 per year

Job Title: AI EngineerExperience:3+ YearsLocation:chennaiJob DescriptionWe are seeking a talentedMid-Level AI Engineerwith hands-on experience in developing AI/ML models, building POCs, and deploying solutions onGoogle Cloud Platform (GCP). The ideal candidate will have strong Python skills, experience with ML frameworks, and a passion for innovation in...
AI Engineer

7 days ago

Chennai, Tamil Nadu, India LatentView Full time ₹ 20,00,000 - ₹ 25,00,000 per year

About the JobLatentView Analytics is a leading global analytics and decision sciences provider, delivering solutions that help companies drive digital transformation and use data to gain a competitive advantage. With analytics solutions that provide a 360-degree view of the digital consumer, fuel machine learning capabilities, and support artificial...
AI Engineer

7 days ago

Chennai, Tamil Nadu, India LatentView Analytics Full time ₹ 15,00,000 - ₹ 25,00,000 per year

We are seeking a Full Stack AI Engineer to design and delivery of innovative AI-driven prototypes and solutions for clients. This role requires a mix of hands-on engineering, project leadership, and stakeholder management. You will architect and implement AI/ML systems, manage client expectations, and mentor junior engineers while ensuring projects are...
DevOps Engineer

2 weeks ago

Chennai, Tamil Nadu, India LuMay AI Full time ₹ 5,00,000 - ₹ 15,00,000 per year

Company DescriptionLuMay AI is a forward-thinking technology company specializing in cutting-edge artificial intelligence solutions that revolutionize industries through intelligent automation, predictive analytics, and next-generation AI applications. Our mission is to build scalable, secure, and ethical AI systems that empower businesses and communities...
Gen AI Inference Engineer

5 days ago

Chennai, Tamil Nadu, India artcube (Artcube AI Pvt. Ltd.) Full time ₹ 5,00,000 - ₹ 12,00,000 per year

Job Title: GenAI Inference Engineer (1–2 Years Experience)Location: Chennai, IndiaCompany: Artcube AI – Pioneers in GenAI for Virtual Product PlacementAbout UsWe are a next-generation AI company building proprietary models and intelligent algorithms for post-production product placement in TV/OTT and movies. Our GenAI models allow us to seamlessly insert...
AI Engineer

1 week ago

Chennai, Tamil Nadu, India Evnek Full time ₹ 15,00,000 - ₹ 25,00,000 per year

We are seeking a Mid-Level AI Engineer with 3-8 years of hands-on experience in designing, developing, and deploying AI/ML models and solutionsThe ideal candidate will focus on building Proof of Concept (POC) applications that demonstrate the feasibility and business value of AI-driven solutionsThis role involves working with machine learning frameworks such...
AI Engineer

4 days ago

Chennai, Tamil Nadu, India Aegan Technologies Private Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Location: ChennaiEmployment Type: Contract to HireExperience - 3yrs to 6yrsAre you passionate about building intelligent systems and exploring cutting-edge AI solutions? We're looking for a Mid-Level AI Engineer to design, develop, and implement AI/ML models that drive innovation and business impact.Key ResponsibilitiesDesign, develop, and implement AI/ML...

Americas

Europe

Asia / Oceania

Africa

AI Engineer