AI Tech Architect

1 day ago


Vellore, India Recro Full time

AI Tech Architect (7–10 yrs) — Agentic & Gen AI Platforms Location: Bengaluru / Gurugram Team: AI Platforms & Architecture Employment: Full-time Key Skills:Python, FastAPI, AWS (EKS, Bedrock, OpenSearch, S3, RDS), GenAI & RAG Architecture, Agent Frameworks (Semantic Kernel, LangGraph, AutoGen), Vector Databases, Observability (OpenTelemetry, Datadog), Security & Scalability Design. Overview Own the end-to-end architecture of production AI systems with a strong hands-on bias. You’ll design robust, cost-efficient, and secure agentic/GenAI solutions on AWS. Part of your job will be to unblock lead developers by debugging code, optimizing performance, and guiding best practices. Expect to turn complex requirements into scalable, observable, and well-governed platforms.Responsibilities • Define target architectures for agentic systems (planning/reasoning/tool-calling), GenAI/RAG pipelines, and evaluation loops; produce clear design documents with Flow/UML/sequence diagrams and AWS deployment topologies. • Size and optimize infrastructure for cost and performance: model throughput/latency, concurrency, autoscaling policies, CPU/GPU needs, memory footprints, vector index sizing, storage/egress, and token budgets. • Lead deep-dive debugging and incident resolution: profile bottlenecks, fix defects, stabilize services; pair-program with developers to raise the engineering bar. • Establish reference implementations for multi-agent frameworks (Semantic Kernel preferred; LangGraph/AutoGen/CrewAI acceptable), tool/function schemas, validation, memory, grounding, and multi-step planning. • Architect retrieval and hybrid search systems: ingestion, chunking, embeddings, ranking, caching, freshness, and grounding; evaluate recall, precision, and hallucination risk. • Productionize on AWS using Amazon EKS, S3, SQS/SNS, and AWS Bedrock; integrate identity (Okta/IAM), secrets (AWS Secrets Manager), eventing, and observability; enforce SLIs/SLOs and error budgets. • Make systems observable: distributed tracing, metrics, and logs using OpenTelemetry and Datadog; standardize dashboards, alerts, and tool/trace replay. • Build evaluation and promotion workflows: prompt/flow tests, golden sets, offline batch runs, A/B experiments, regression suites, and rollout gates. • Design security and safety controls: threat modeling, prompt-injection defense, sandboxed tools, policy enforcement, red-team testing, PII/data governance, and audit trails. • Define platform standards: reusable SDKs, connectors, CI/CD templates, runbooks, and architecture review checklists. • Partner with product, data, and SRE teams to plan capacity, disaster recovery, multi-region posture, and upgrade paths; lead readiness reviews and post-incident RCAs. • Mentor engineers and review PRs with a focus on reliability, correctness, and maintainability.Must Have • 7–10 years in software/AI engineering with at least 4+ years building GenAI applications and 2+ years architecting production agentic systems. • Strong hands-on expertise in Python 3.11+ (typing, asyncio, packaging, profiling, pytest); able to dive into code, fix bugs, and optimize performance-critical paths. • Experience with one or more agent frameworks (Semantic Kernel, LangGraph, AutoGen, CrewAI) and function/tool calling with schema and argument validation. • Proven design of GenAI/RAG/hybrid retrieval systems using AWS Bedrock, OpenSearch Serverless, or other vector databases; grounding and retrieval evaluation experience. • Deep knowledge of AWS architecture: Amazon EKS, Bedrock, S3, SQS/SNS, RDS (SQL Server/PostgreSQL), ElastiCache (Redis), Secrets Manager, IAM/Okta, Kong API Gateway, OpenSearch Serverless, and Datadog. • Observability expertise: distributed tracing (OpenTelemetry), metrics, logs, correlation IDs, and service-level objectives; mature incident response and on-call practices. • Cost and performance engineering mindset: capacity modeling, GPU/CPU sizing, autoscaling (HPA), batching/streaming, caching, and FinOps discipline. • Security and safety fundamentals: least privilege, data isolation, policy enforcement, content moderation, jailbreak/PII defenses, and compliance awareness. • Excellent technical communication: clear diagrams, ADRs, design docs; empathetic, structured code and architecture reviews.Good to Have • Multi-agent orchestration patterns: task decomposition, coordinator-worker, human-in-the-loop, graph-based planning. • Deep expertise with vector databases and retrieval: OpenSearch Serverless, Pinecone, pgvector, Redis. • Evaluation frameworks: red teaming, automated guardrails, regression testing, rollout gates, canary deployments. • Frontend integration for agent UIs (streaming responses, tool traces), secure connector APIs, and AuthN/Z best practices. • Policy-as-code (OPA) and multi-tenant architecture (RBAC, quotas, usage metering). • Knowledge of Kong API Gateway, LaunchDarkly/Flipt for feature management, and NeMo Guardrails for runtime safety. • CI/CD exposure (build/test with GitHub Actions, deployments via Terraform/AWS IaC templates).Core Tech Stack (our core; equivalents welcome) • Python 3.11+, FastAPI, Pydantic v2, SQLAlchemy 2.x, Alembic, pytest. • Amazon EKS, AWS Bedrock, Amazon SQS/SNS, Amazon RDS (SQL Server/PostgreSQL), ElastiCache (Redis). • Amazon S3 for storage, Amazon ECR for container images, OpenSearch Serverless for vector storage. • AWS Secrets Manager, Okta IAM, NeMo Guardrails, Kong API Gateway. • OpenTelemetry + Datadog for observability and monitoring. • Custom RAG Services, Bedrock Knowledge Base, and LLM evaluation with Phoenix, Arize, and Promptfoo.


  • Ai/ml engineer

    2 days ago


    Vellore, India Kresta Softech Private Limited Full time

    Job Title: Senior AI/ML Engineer - Generative AI & LLMs Industry: Healthcare / Automotive OEC Location: Hyderabad (On-Site)/ Remote We're seeking an expert AI/ML Engineer to lead development of cutting-edge Generative AI solutions for transformative projects in Healthcare and Automotive industries. Who We Need: 5-12 years core AI/ML experience in Healthcare...


  • Vellore, India Debales AI Full time

    About Debales AI  Debales AI builds autonomous AI Agents that seamlessly integrate into existing systems — no new dashboards, no added workflow overhead. With 100+ integrations and 80+ specialized AI Agents, we streamline high-volume operations across Logistics, E-Commerce, and Education, helping teams scale efficiency without scaling headcount. Role...


  • Vellore, India DigiHelic Solutions Pvt. Ltd. Full time

    Job descriptionSenior AI/ML Specialist/LeadExperience Range:7+ years of professional experienceMandatory SkillsAiml,Python,Data Science,Deep LearningKey Responsibilities:Define and execute the AI/ML strategy for content generation (Gen-AI), entity linkage and matching, image processing, and content understandingLead the development of next-generation content...


  • Vellore, India GoZupees — Building AI Agents for Tomorrow Full time

    Multiple Positions • Remote • Full-Time Company: GoZupees (Project Atlas – Agentic AI) Location: Remote (Preference: Gurgaon / NCR) About GoZupees GoZupees builds AI Agents for small businesses in the UK, helping them automate conversations, capture leads, and deliver human-like customer support. Role Overview What You’ll Do - Source and close...

  • Founding Engineer

    3 weeks ago


    Vellore, India Aonxi Full time

    Location: US or Remote (US overlap) Type: Full-time, founding equity About Aonxi Aonxi is building the Neural Revenue OS—AI that listens to every sales call, learns what actually sells, and turns those patterns into automated, compounding ROI. We need a hands-on founding engineer to architect and ship a CRM where calls, transcripts, actions, and results...

  • Digital marketing

    3 weeks ago


    Vellore, India Smart AI EdTech Full time

    Location: "Remote"Job Type: Full-timeSalary:Base Salary: ₹1.8 Lakhs per annum for first year with probation of 6 months.After successful first year completion and based on performance ₹2.4 Lakhs per annum.Performance-based bonus on deal closures.About Us:Inno Cloud is an Edu Tech AI provider. If you are passionate about technology and thrive in a...


  • Vellore, India AnyFeast Full time

    🌍 Senior Nutrition Thought Leader (Remote – India Based)Are you an experienced Nutritionist or Dietitian ready to shape how AI meets nutrition? At AnyFeast, we’re building the future of healthy, sustainable eating — and we’re looking for visionary professionals who want to lead that conversation.About AnyFeastAnyFeast is a UK- and India-based...

  • Tech Lead

    2 weeks ago


    Vellore, India Innovation Kite Full time

    We’re looking for a dynamic and experiencedTech Lead – PHPto lead full stack development, architect scalable solutions, and drive technical excellence across projects. If you're comfortable working across modern frameworks, eCommerce platforms, and complex integrations, we want to hear from you.About the Role As aTech Lead , you’ll take ownership of...

  • Operations manager

    2 weeks ago


    Vellore, India Embergrit Full time

    Founding D2 C Operator (Operations & Automation)We are Embergrit, a new D2 C brand on a mission to build the next iconic men's grooming brand in America. Our philosophy is "Quiet Confidence" , a blend of rugged minimalism and engineered quality. We are on a trajectory for explosive growth, and you will be the first operational pillar of our empire.This is...


  • Vellore, India Alchemic Full time

    About us:Alchemic (previously Echo) is a market research platform that helps companies get customer insights in hours, not months - without sacrificing quality. We make it possible to run AI interviews over voice, video, text, and Whats App, and automatically generate consultancy-grade reports for business decision-makers. Every customer conversation is...