Generative AI Engineer

2 hours ago


Gurugram Haryana India, IN Recro Full time

About the job


Role Overview

As the AI Systems Architect, you’ll own the end-to-end design and delivery of production-grade agentic and Generative AI systems. This is a highly hands-on role requiring deep architectural insight, coding proficiency, and an obsession with performance, scalability, and reliability. You’ll architect secure, cost-efficient AI platforms on AWS, guide developers through complex debugging and optimization, and ensure all systems are observable, governed, and production-ready.


Key Responsibilities

  • Architect Production AI Systems: Design robust architectures for agentic systems (planning, reasoning, tool-calling), GenAI/RAG pipelines, and evaluation workflows. Create detailed design documents, including flow/UML/sequence diagrams and AWS deployment topologies.
  • Optimize for Cost & Performance: Model throughput, latency, concurrency, autoscaling, CPU/GPU sizing, and vector index performance to ensure scalable, efficient deployments.
  • Lead Debugging & Stability Efforts: Conduct deep-dive debugging, fix critical defects, and resolve production incidents; pair-program with developers to improve code quality and performance.
  • Standardize Agentic Frameworks: Build reference implementations using Semantic Kernel (preferred), LangGraph, AutoGen, or CrewAI with strong schema validation, grounding, and memory management.
  • Engineer Retrieval & Search Systems: Architect hybrid retrieval solutions including ingestion, chunking, embeddings, ranking, caching, and freshness management while minimizing hallucination risk.
  • Productionize on AWS: Deploy and manage systems using Amazon EKS, Bedrock, S3, SQS/SNS, RDS, and ElastiCache. Integrate IAM/Okta, Secrets Manager, and Datadog for observability, enforcing SLIs/SLOs and error budgets.
  • Implement Observability & Monitoring: Set up distributed tracing, metrics, and logging via OpenTelemetry and Datadog. Standardize dashboards, alerts, and incident response workflows.
  • Govern Evaluation & Rollouts: Build test and evaluation frameworks—golden sets, A/B experiments, regression suites, and controlled rollouts—to ensure consistent quality across releases.
  • Embed Security & Safety: Enforce least privilege, PII protection, and policy compliance through threat modeling, sandboxed execution, and prompt-injection defense.
  • Establish Engineering Standards: Create reusable SDKs, connectors, CI/CD templates, and architecture review checklists to promote consistency across teams.
  • Cross-Functional Leadership: Collaborate with product, data, and SRE teams for capacity planning, DR strategies, and post-incident RCA reviews. Mentor engineers to strengthen design and reliability practices.


Must-Have Qualifications

  • 7–10 years in software/AI engineering, including 4+ years in GenAI application development and 2+ years architecting agentic AI systems.
  • Expert in Python 3.11+ (asyncio, typing, packaging, profiling, pytest).
  • Hands-on experience with Semantic Kernel, LangGraph, AutoGen, or CrewAI.
  • Proven delivery of GenAI/RAG systems on AWS Bedrock or equivalent vector-based platforms (OpenSearch Serverless, Pinecone, Redis).
  • Deep understanding of AWS ecosystem: EKS, Bedrock, S3, SQS/SNS, RDS, ElastiCache, Secrets Manager, IAM/Okta, Kong API Gateway, Datadog.
  • Expertise in observability and incident management using OpenTelemetry and Datadog.
  • Strong focus on cost, performance, and security engineering—FinOps mindset, autoscaling, caching, and policy enforcement.
  • Exceptional communication—clear diagrams, ADRs, and peer review practices.


Nice-to-Have Skills

  • Multi-agent orchestration (task decomposition, coordinator-worker, graph-based planning).
  • Expertise with vector databases (OpenSearch, Pinecone, pgvector, Redis).
  • Experience with AI evaluation, guardrails, and rollout gating.
  • Familiarity with frontend agent interfaces, secure APIs, and AuthN/Z best practices.
  • Exposure to policy-as-code, multi-tenant architectures, and feature management (Kong, LaunchDarkly, Flipt).
  • Experience with CI/CD via GitHub Actions and IaC (Terraform/AWS CloudFormation).


  • Gurugram, Haryana, India, IN Terra Technology Circle Consulting Private Limited Full time

    Job Title: Generative AI (GenAI) EngineerExperience Level: 4–6 Years Location: Gurugram Employment Type: Contractual (3-6 Months)About the RoleWe are seeking a skilled Generative AI Engineer with 4–6 years of hands-on experience in building, fine-tuning, and deploying AI/ML models—particularly LLMs and diffusion models. You will work closely with data...

  • Senior AI/ML Engineer

    2 hours ago


    Gurugram, Haryana, India, IN NextDimension AI Full time

    Compensation: INR 12-30 LPA Base + Bonus + EquityLocation: GurgaonAbout UsNextDimension is a US-based technology startup building AI Agents in Healthcare, established by a team of distinguished AI/ML Scientists and Engineers from Google, Amazon, and Snowflake. We're empowering Enterprises by building sophisticated, high-impact AI agents that automate sales,...


  • Gurugram, Haryana, India, IN NextDimension AI Full time

    Compensation: INR 12-30 LPA Base + Bonus + EquityLocation: GurgaonNextDimension is a US-based technology startup building AI Agents in Healthcare, established by a team of distinguished AI/ML Scientists and Engineers from Google, Amazon, and Snowflake. We're empowering Enterprises by building sophisticated, high-impact AI agents that automate sales,...

  • AI Engineer Intern

    2 hours ago


    india, IN Qureal AI Full time

    Hiring: AI Engineer Intern (Remote + Paid)We are looking for a dedicated AI Engineer Intern to join our team for a 4-month paid internship. This role offers a real opportunity to transition into a full-time position based on performance. Role Details:Role: AI Engineer InternLocation: RemoteType: Paid Internship (4 Months)Future: PPO / Full-time conversion...


  • Gurugram, Haryana, India, IN PepsiCo Full time

    OverviewThe Senior Technical Architect – Generative AI and Agent Factory is responsible for leading the end-to-end architecture, design, and strategic enablement of PepsiCo's enterprise-grade GenAI platforms—PepGenX, Agent Factory, and PepVigil.This role defines scalable, event-driven agent orchestration frameworks, modular agent templates, and...


  • Gurugram, Haryana, India, IN DeepRunner AI Full time

    Role Overview We are seeking an exceptional VP of Engineering to lead and scale our engineering organization as we revolutionize enterprise AI solutions. Reporting directly to the CEO, you will be a hands-on technical leader who sets the engineering vision while actively contributing to our most critical technical challenges. This is a role for a builder at...

  • Associate Director

    2 hours ago


    Gurugram, Haryana, India, IN Sirius AI Full time

    Role OverviewWe are seeking a dynamic and visionary Associate Director to lead solutioning and innovation initiatives within our AI Innovations Lab. This role involves designing, delivering, and scaling AI/ML solutions for clients in the financial services ecosystem. The ideal candidate brings a mix of hands-on technical expertise, strategic thinking, and...


  • india, IN Cognizant Full time

    Role: Data ScientistSkillset: Gen AIExperience: 6+ yearsLocation: PAN IndiaMode: RemoteWe are seeking a talented and experienced AI & Generative AI Developer to join our team and help us create groundbreaking generative AI models and applications. In this role, you will work closely with our team of data scientists, machine learning engineers, and...


  • Gurugram, Haryana, India, IN Sirius AI Full time

    Key ResponsibilitiesEngage with clients to understand their business objectives and challenges, providing data-driven recommendations and AI/ML solutions that enhance decision-making and deliver tangible value.Translate business needs - particularly within financial services domains such as marketing, risk, compliance and customer lifecycle management into...

  • AI and ML Manager

    2 hours ago


    Gurugram, Haryana, India, IN Advanced AI research and product company Full time

    About the Company: Our client is an advanced AI research and product company focused on building intelligent systems that combine deep reasoning, natural language understanding, and adaptive learning. Its mission is to develop technologies that can seamlessly assist individuals and enterprises in decision-making, creativity, and automation.The company...