Ai architect

3 weeks ago


India Recro Full time

Role Overview As the AI Systems Architect , you’ll own the end-to-end design and delivery of production-grade agentic and Generative AI systems. This is a highly hands-on role requiring deep architectural insight, coding proficiency, and an obsession with performance, scalability, and reliability. You’ll architect secure, cost-efficient AI platforms on AWS, guide developers through complex debugging and optimization, and ensure all systems are observable, governed, and production-ready. Key Responsibilities Architect Production AI Systems: Design robust architectures for agentic systems (planning, reasoning, tool-calling), Gen AI/RAG pipelines, and evaluation workflows. Create detailed design documents including flow/UML/sequence diagrams and AWS deployment topologies. Optimize for Cost & Performance: Model throughput, latency, concurrency, autoscaling, CPU/GPU sizing, and vector index performance to ensure scalable, efficient deployments. Lead Debugging & Stability Efforts: Conduct deep-dive debugging, fix critical defects, and resolve production incidents; pair-program with developers to improve code quality and performance. Standardize Agentic Frameworks: Build reference implementations using Semantic Kernel (preferred), Lang Graph, Auto Gen, or Crew AI with strong schema validation, grounding, and memory management. Engineer Retrieval & Search Systems: Architect hybrid retrieval solutions including ingestion, chunking, embeddings, ranking, caching, and freshness management while minimizing hallucination risk. Productionize on AWS: Deploy and manage systems using Amazon EKS, Bedrock, S3, SQS/SNS, RDS, and Elasti Cache. Integrate IAM/Okta, Secrets Manager, and Datadog for observability, enforcing SLIs/SLOs and error budgets. Implement Observability & Monitoring: Set up distributed tracing, metrics, and logging via Open Telemetry and Datadog. Standardize dashboards, alerts, and incident response workflows. Govern Evaluation & Rollouts: Build test and evaluation frameworks—golden sets, A/B experiments, regression suites, and controlled rollouts—to ensure consistent quality across releases. Embed Security & Safety: Enforce least privilege, PII protection, and policy compliance through threat modeling, sandboxed execution, and prompt-injection defense. Establish Engineering Standards: Create reusable SDKs, connectors, CI/CD templates, and architecture review checklists to promote consistency across teams. Cross-Functional Leadership: Collaborate with product, data, and SRE teams for capacity planning, DR strategies, and post-incident RCA reviews. Mentor engineers to strengthen design and reliability practices. Must-Have Qualifications 7–10 years in software/AI engineering, including 4+ years in Gen AI application development and 2+ years architecting agentic AI systems. Expert in Python 3.11+ (asyncio, typing, packaging, profiling, pytest). Hands-on experience with Semantic Kernel , Lang Graph , Auto Gen , or Crew AI . Proven delivery of Gen AI/RAG systems on AWS Bedrock or equivalent vector-based platforms (Open Search Serverless, Pinecone, Redis). Deep understanding of AWS ecosystem : EKS, Bedrock, S3, SQS/SNS, RDS, Elasti Cache, Secrets Manager, IAM/Okta, Kong API Gateway, Datadog. Expertise in observability and incident management using Open Telemetry and Datadog. Strong focus on cost, performance, and security engineering —Fin Ops mindset, autoscaling, caching, and policy enforcement. Exceptional communication—clear diagrams, ADRs, and peer review practices. Nice-to-Have Skills Multi-agent orchestration (task decomposition, coordinator-worker, graph-based planning). Expertise with vector databases (Open Search, Pinecone, pgvector, Redis). Experience with AI evaluation, guardrails, and rollout gating. Familiarity with frontend agent interfaces, secure APIs, and Auth N/Z best practices. Exposure to policy-as-code , multi-tenant architectures, and feature management (Kong, Launch Darkly, Flipt). Experience with CI/CD via Git Hub Actions and Ia C (Terraform/AWS Cloud Formation).



  • India STEM AI Studio Full time

    Job Description STEM AI Studio is seeking a passionate and knowledgeable AI Learning & Teaching Architect to deliver personalized 1:1 online tutoring in Computer Science, with a strong emphasis on data structures and foundational AI/ML concepts. Key Responsibilities - Deliver engaging and personalized 1:1 online sessions to students. - Teach core concepts in...


  • India Ironbook AI Full time

    Job Title: Presales Azure Data & AI Architect (Microsoft)Location: Remote (India)Employment Type: Full-timeAbout Ironbook AIIronbook AI helps enterprises accelerate their AI adoption journey through cloud-native data solutions and intelligent automation. Our mission is to build scalable, secure, and AI-ready infrastructures for our clients across diverse...

  • AI Architect

    1 week ago


    India Mulya Technologies Full time

    AI Architect We are a US based Stealth mode Start-up location: Hyderabad / Bangalore / Remote ( any where in India ) We unify the processes used in Semiconductor and Hardware Systems design - thus reducing bugs, improving efficiency and productivity Our breakthrough technology has drawn investment from Silicon Valley’s boldest VCs and semiconductor...


  • India Ironbook AI Full time

    Job Title: Presales Azure Data & AI Architect (Microsoft) Location: Remote (India) Employment Type: Full-time About Ironbook AI Ironbook AI helps enterprises accelerate their AI adoption journey through cloud-native data solutions and intelligent automation. Our mission is to build scalable, secure, and AI-ready infrastructures for our clients across diverse...


  • India Ironbook AI Full time

    Job Title: Presales Azure Data & AI Architect (Microsoft) Location: Remote (India) Employment Type: Full-time About Ironbook AI Ironbook AI helps enterprises accelerate their AI adoption journey through cloud-native data solutions and intelligent automation. Our mission is to build scalable, secure, and AI-ready infrastructures for our clients across diverse...

  • AI Architect

    16 hours ago


    Bengaluru, India EvoluteIQ Full time

    Job Description Life at EvoluteIQ We at EvoluteIQ believe in the power of transformation. We are committed to building an industry leading technology that will revolutionize the way enterprises conduct business. To make that happen, we need people who are generous, genuine, self-driven, and collaborative. People who not only want to be a part of a...


  • india Braind AI Full time

    Company DescriptionBraindAI is an AI Consultancy transforming how enterprises leverage artificial intelligence. We bridge the gap between strategy and execution by delivering sophisticated automation solutions for companies and high-growth organisations across Ireland and the UK. Our team builds enterprise-grade AI systems that integrate seamlessly with...

  • AI Engineering Intern

    15 hours ago


    India Neusearch AI Full time

    About Neusearch AI At Neusearch AI, we're building a product at the intersection of AI x Marketing x Ecommerce. The applications of AI to influence the future of shopping experience is immense and we are solving some of the problems around how consumers discover, evaluate, and purchase products in an AI-first world. We are creating intelligent systems that...

  • Gen AI Architect

    3 weeks ago


    India IGT Solutions Full time

    Job Title: Architect - Gen AI, LLM, Big Data Experience: 18+ Years Location: Pune/ Gurugram Employment Type: Full-Time / Permanent Job Summary: We are looking for an experienced Architect with deep expertise in Generative AI (Gen AI), Large Language Models (LLM), and Big Data technologies. The ideal candidate will have 18+ years of experience in designing...

  • Gen AI Architect

    3 days ago


    India IGT Solutions Full time

    Job Title: Architect - Gen AI, LLM, Big Data Experience: 18+ Years Location: Pune/ Gurugram Employment Type: Full-Time / Permanent Job Summary: We are looking for an experienced Architect with deep expertise in Generative AI (Gen AI), Large Language Models (LLM), and Big Data technologies. The ideal candidate will have 18+ years of experience in designing...