AI Engineer
5 days ago
About MAKO
Founded in 2013, Mako IT Lab is a global software development company with a strong presence across the USA, UK, India, and Nepal. Over the years, we've partnered with companies globally helping them solve complex challenges and build meaningful digital experiences.
What truly defines Mako is our culture. We believe in creating an environment where people feel empowered to take ownership, exercise freedom in their ideas, and contribute to solutions that genuinely make an impact. Learning is at the heart of who we are—our teams constantly grow through hands-on exposure, real-world problem solving, and continuous knowledge sharing across functions and geographies.
We don't just build long-term partnerships with clients—we build long-term careers for our people. At Mako, you'll be part of a collaborative, supportive, and fast-growing global team where curiosity is encouraged, initiative is celebrated, and every individual plays a meaningful role in shaping the company's journey.
Role Overview
We are seeking an experienced AI Engineer with deep expertise in LLM-driven architectures, RAG systems, agentic workflows, and multimodal AI development. The ideal candidate will be skilled in building scalable AI pipelines using FastAPI, Kafka, FastMCP, and Tavily Web Search, while also having hands-on experience with vllm-based inference and Stable Diffusion pipelines.You will architect and implement intelligent systems leveraging Large Language Models, vision models, and autonomous agents, with a strong focus on observability, performance, and production reliability.
Key Responsibilities
- LLM, VLLM & Agentic System Development
- Build autonomous LLM agents using LangChain, LangGraph, and FastMCP.
- Develop RAG workflows using embeddings, vector stores, and knowledge-grounded reasoning.
- Integrate VLLM / SGLang / other high-throughput inference backends for low-latency model serving.
- Implement Tavily web-search integrations for real-time knowledge augmentation.
- Optimize inference using quantized GGUF, tensorized formats, and GPU-accelerated pipelines.
- Multimodal & Image Generation Systems
- Build and deploy Stable Diffusion (SDXL/SD 1.5/ControlNet/T2I) pipelines for image generation tasks.
- Integrate LoRAs, control modules, and diffusion-based fine-tuning for custom domains.
- Develop multimodal agents that combine LLM reasoning with vision tasks such as classification, captioning, or image prompts.
- Backend & Infrastructure Engineering
- Build robust FastAPI services for orchestrating LLMs, Stable Diffusion, retrieval, and agentic tasks.
- Develop event-driven workflows using Kafka for distributed AI systems.
- Implement auditing, agent-output monitoring, and API-layer logging for end-to-end traceability.
- High-level API & Third-party Integrations
- Integrate third-party services: authentication, analytics, search APIs, cloud inference APIs, and enterprise data sources.
- Build secure and scalable API layers for production deployments.
- Fine-tuning & Model Lifecycle Management
- Fine-tune LLaMA, Mistral, Phi-3, and diffusion models for domain-specific tasks.
- Use MLflow for tracking experiments, hyperparameters, metrics, and versioning.
- Conduct evaluation on hallucinations, retrieval consistency, reasoning depth, and multimodal accuracy.
Required Skills & Qualifications
Core AI/LLM Skills
- Experience with LLMs, RAG systems, LangChain, LangGraph, LlamaIndex
- Hands-on with VLLM, SGLang, or similar inference engines
- Model quantization (GGUF), optimization, and GPU memory tuning
- Agent frameworks & tool calling (FastMCP, Groq, Hugging Face)
Multimodal & Image Generation
- Stable Diffusion, ControlNet, LoRA fine-tuning, custom pipelines
- Diffusers, ComfyUI, or InvokeAI experience (bonus)
Engineering & Systems
- Kafka-based event-driven systems
- backend development
- Third-party API integrations
- Docker, CI/CD, and cloud platforms (GCP/Azure)
Databases & Retrieval
- MongoDB, DuckDB,
- Embedding stores, vector databases (Pinecone / Qdrant), retrieval optimization
Observability & MLOps
- MLflow for experiment tracking and model lifecycle
- Performance monitoring, logging, auditing, API observability
Frontend (Good to have)
- React, Redux, , for dashboards and AI interfaces
-
Ai engineer- Genrative Ai
2 days ago
Chennai, Tamil Nadu, India Weekday AI Full time ₹ 20,00,000 - ₹ 35,00,000This role is for one of the Weekday's clientsSalary range: Rs Rs ie INR 20-35 LPA)Min Experience: 5 yearsLocation: Hyderabad, ChennaiJobType: full-timeAs an AI/ML Engineer, you will be responsible for creating end-to-end machine learning solutions—from data exploration to model deployment. You will work closely with cross-functional teams to understand...
-
Ai engineer- Genrative Ai
2 days ago
Chennai, Tamil Nadu, India Weekday AI (YC W21) Full time ₹ 2,00,00,000 - ₹ 6,00,00,000 per yearThis role is for one of the Weekday's clientsSalary range: Rs Rs ie INR 20-35 LPA)Min Experience: 5 yearsLocation: Hyderabad, ChennaiJobType: full-timeAs an AI/ML Engineer, you will be responsible for creating end-to-end machine learning solutions—from data exploration to model deployment. You will work closely with cross-functional teams to understand...
-
AI & Data Science Intern
1 week ago
Chennai, Tamil Nadu, India Twite AI Technologies Full time ₹ 1,00,000 - ₹ 3,00,000 per yearInternship Opportunity at Twite AI TechnologiesPosition: AI & Data Science InternDuration: 3 Months (Internship)Location: Chennai (On-site)Type: Internship (Unpaid for 3 months – Full-Time Employment based on performance)Experience: FreshersAbout Twite AI Technologies:Twite AI Technologies is a fast-growing AI-powered solutions provider working with top...
-
AI Engineer
2 days ago
Chennai, Tamil Nadu, India SWITS DIGITAL Private Limited Full time ₹ 8,00,000 - ₹ 16,00,000 per yearJob Title: AI EngineerExperience:3+ YearsLocation:chennaiJob DescriptionWe are seeking a talentedMid-Level AI Engineerwith hands-on experience in developing AI/ML models, building POCs, and deploying solutions onGoogle Cloud Platform (GCP). The ideal candidate will have strong Python skills, experience with ML frameworks, and a passion for innovation in...
-
AI Engineer
7 days ago
Chennai, Tamil Nadu, India LatentView Full time ₹ 20,00,000 - ₹ 25,00,000 per yearAbout the JobLatentView Analytics is a leading global analytics and decision sciences provider, delivering solutions that help companies drive digital transformation and use data to gain a competitive advantage. With analytics solutions that provide a 360-degree view of the digital consumer, fuel machine learning capabilities, and support artificial...
-
AI Engineer
7 days ago
Chennai, Tamil Nadu, India LatentView Analytics Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are seeking a Full Stack AI Engineer to design and delivery of innovative AI-driven prototypes and solutions for clients. This role requires a mix of hands-on engineering, project leadership, and stakeholder management. You will architect and implement AI/ML systems, manage client expectations, and mentor junior engineers while ensuring projects are...
-
DevOps Engineer
2 weeks ago
Chennai, Tamil Nadu, India LuMay AI Full time ₹ 5,00,000 - ₹ 15,00,000 per yearCompany DescriptionLuMay AI is a forward-thinking technology company specializing in cutting-edge artificial intelligence solutions that revolutionize industries through intelligent automation, predictive analytics, and next-generation AI applications. Our mission is to build scalable, secure, and ethical AI systems that empower businesses and communities...
-
Gen AI Inference Engineer
5 days ago
Chennai, Tamil Nadu, India artcube (Artcube AI Pvt. Ltd.) Full time ₹ 5,00,000 - ₹ 12,00,000 per yearJob Title: GenAI Inference Engineer (1–2 Years Experience)Location: Chennai, IndiaCompany: Artcube AI – Pioneers in GenAI for Virtual Product PlacementAbout UsWe are a next-generation AI company building proprietary models and intelligent algorithms for post-production product placement in TV/OTT and movies. Our GenAI models allow us to seamlessly insert...
-
AI Engineer
1 week ago
Chennai, Tamil Nadu, India Evnek Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are seeking a Mid-Level AI Engineer with 3-8 years of hands-on experience in designing, developing, and deploying AI/ML models and solutionsThe ideal candidate will focus on building Proof of Concept (POC) applications that demonstrate the feasibility and business value of AI-driven solutionsThis role involves working with machine learning frameworks such...
-
AI Engineer
4 days ago
Chennai, Tamil Nadu, India Aegan Technologies Private Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per yearLocation: ChennaiEmployment Type: Contract to HireExperience - 3yrs to 6yrsAre you passionate about building intelligent systems and exploring cutting-edge AI solutions? We're looking for a Mid-Level AI Engineer to design, develop, and implement AI/ML models that drive innovation and business impact.Key ResponsibilitiesDesign, develop, and implement AI/ML...