AI Tech Architect
5 days ago
OverviewOwn the end-to-end architecture of production AI systems with a strong hands-on bias. You’ll design robust, cost-efficient, and secure agentic/GenAI solutions on AWS. Part of your job will be to unblock lead developers by debugging code, optimizing performance, and guiding best practices. Expect to turn complex requirements into scalable, observable, and well-governed platforms.Responsibilities• Define target architectures for agentic systems (planning/reasoning/tool-calling), GenAI/RAG pipelines, and evaluation loops; produce clear design documents with Flow/UML/sequence diagrams and AWS deployment topologies.• Size and optimize infrastructure for cost and performance: model throughput/latency, concurrency, autoscaling policies, CPU/GPU needs, memory footprints, vector index sizing, storage/egress, and token budgets.• Lead deep-dive debugging and incident resolution: profile bottlenecks, fix defects, stabilize services; pair-program with developers to raise the engineering bar.• Establish reference implementations for multi-agent frameworks (Semantic Kernel preferred; LangGraph/AutoGen/CrewAI acceptable), tool/function schemas, validation, memory, grounding, and multi-step planning.• Architect retrieval and hybrid search systems: ingestion, chunking, embeddings, ranking, caching, freshness, and grounding; evaluate recall, precision, and hallucination risk.• Productionize on AWS using Amazon EKS, S3, SQS/SNS, and AWS Bedrock; integrate identity (Okta/IAM), secrets (AWS Secrets Manager), eventing, and observability; enforce SLIs/SLOs and error budgets.• Make systems observable: distributed tracing, metrics, and logs using OpenTelemetry and Datadog; standardize dashboards, alerts, and tool/trace replay.• Build evaluation and promotion workflows: prompt/flow tests, golden sets, offline batch runs, A/B experiments, regression suites, and rollout gates.• Design security and safety controls: threat modeling, prompt-injection defense, sandboxed tools, policy enforcement, red-team testing, PII/data governance, and audit trails.• Define platform standards: reusable SDKs, connectors, CI/CD templates, runbooks, and architecture review checklists.• Partner with product, data, and SRE teams to plan capacity, disaster recovery, multi-region posture, and upgrade paths; lead readiness reviews and post-incident RCAs.• Mentor engineers and review PRs with a focus on reliability, correctness, and maintainability.Must Have• 7–10 years in software/AI engineering with at least 4+ years building GenAI applications and 2+ years architecting production agentic systems.• Strong hands-on expertise in Python 3.11+ (typing, asyncio, packaging, profiling, pytest); able to dive into code, fix bugs, and optimize performance-critical paths.• Experience with one or more agent frameworks (Semantic Kernel, LangGraph, AutoGen, CrewAI) and function/tool calling with schema and argument validation.• Proven design of GenAI/RAG/hybrid retrieval systems using AWS Bedrock, OpenSearch Serverless, or other vector databases; grounding and retrieval evaluation experience.• Deep knowledge of AWS architecture: Amazon EKS, Bedrock, S3, SQS/SNS, RDS (SQL Server/PostgreSQL), ElastiCache (Redis), Secrets Manager, IAM/Okta, Kong API Gateway, OpenSearch Serverless, and Datadog.• Observability expertise: distributed tracing (OpenTelemetry), metrics, logs, correlation IDs, and service-level objectives; mature incident response and on-call practices.• Cost and performance engineering mindset: capacity modeling, GPU/CPU sizing, autoscaling (HPA), batching/streaming, caching, and FinOps discipline.• Security and safety fundamentals: least privilege, data isolation, policy enforcement, content moderation, jailbreak/PII defenses, and compliance awareness.• Excellent technical communication: clear diagrams, ADRs, design docs; empathetic, structured code and architecture reviews.Good to Have• Multi-agent orchestration patterns: task decomposition, coordinator-worker, human-in-the-loop, graph-based planning.• Deep expertise with vector databases and retrieval: OpenSearch Serverless, Pinecone, pgvector, Redis.• Evaluation frameworks: red teaming, automated guardrails, regression testing, rollout gates, canary deployments.• Frontend integration for agent UIs (streaming responses, tool traces), secure connector APIs, and AuthN/Z best practices.• Policy-as-code (OPA) and multi-tenant architecture (RBAC, quotas, usage metering).• Knowledge of Kong API Gateway, LaunchDarkly/Flipt for feature management, and NeMo Guardrails for runtime safety.• CI/CD exposure (build/test with GitHub Actions, deployments via Terraform/AWS IaC templates).Core Tech Stack (our core; equivalents welcome)• Python 3.11+, FastAPI, Pydantic v2, SQLAlchemy 2.x, Alembic, pytest.• Amazon EKS, AWS Bedrock, Amazon SQS/SNS, Amazon RDS (SQL Server/PostgreSQL), ElastiCache (Redis).• Amazon S3 for storage, Amazon ECR for container images, OpenSearch Serverless for vector storage.• AWS Secrets Manager, Okta IAM, NeMo Guardrails, Kong API Gateway.• OpenTelemetry + Datadog for observability and monitoring.• Custom RAG Services, Bedrock Knowledge Base, and LLM evaluation with Phoenix, Arize, and Promptfoo.
-
Senior AI/ML Engineer
6 days ago
haryana, India NextDimension AI Full timeCompensation: INR 12-30 LPA Base + Bonus + Equity Location: Gurgaon About Us NextDimension is a US-based technology startup building AI Agents in Healthcare, established by a team of distinguished AI/ML Scientists and Engineers from Google, Amazon, and Snowflake. We're empowering Enterprises by building sophisticated, high-impact AI agents that automate...
-
Senior AI/ML Engineer
5 days ago
haryana, India NextDimension AI Full timeCompensation: INR 12-30 LPA Base + Bonus + EquityLocation: GurgaonAbout UsNextDimension is a US-based technology startup building AI Agents in Healthcare, established by a team of distinguished AI/ML Scientists and Engineers from Google, Amazon, and Snowflake. We're empowering Enterprises by building sophisticated, high-impact AI agents that automate sales,...
-
Senior AI Engineer
5 days ago
haryana, India NextDimension AI Full timeCompensation: INR 12-30 LPA Base + Bonus + EquityLocation: GurgaonNextDimension is a US-based technology startup building AI Agents in Healthcare, established by a team of distinguished AI/ML Scientists and Engineers from Google, Amazon, and Snowflake. We're empowering Enterprises by building sophisticated, high-impact AI agents that automate sales,...
-
Tech Journalist
4 days ago
Gurgaon, Haryana, India Weekday AI Full timeThis role is for one of the Weekday s clients Salary range Rs 2000000 ie INR 20 LPA Min Experience 10 years Location NCR JobType full-time As a Tech Journalist you will serve as a leading voice in technology reporting analyzing trends breaking news and uncovering stories that shape the digital world This role demands a seasoned professional with deep...
-
Agentic Ai Architect
3 days ago
Gurugram, Haryana, India S&P Global Full time**About the Role**: **Grade Level (for internal use)**: 13 **Location**: Gurgaon, Hyderabad and Bangalore Key Responsibilities **As an Agentic AI Architect, you will**: **AI Architecture and System Design**: Architect and design robust, scalable, and autonomous AI systems that seamlessly integrate with enterprise workflows, cloud platforms, and advanced LLM...
-
haryana, India beBeeLeader Full timeJob Title: We are seeking a seasoned technology leader to spearhead the development of cutting-edge SaaS platforms and AI-integrated products. As our Chief Technology Architect, you will be responsible for leading the architecture, design, and implementation of scalable digital solutions that drive business growth and innovation.About the Role:Lead the...
-
Applied Ai Engineer
2 days ago
Gurgaon, Haryana, India Weekday AI Full timeThis role is for one of Weekday s clients Min Experience 0 years Location Gurgaon Gurugram JobType full-time Requirements At Shipsy we re building proactive AI systems that anticipate and automate customer needs reimagining customer experience beyond reactive support This role focuses on applied AI engineering to solve real customer challenges You ll start...
-
Ai Presales Solution Architect
5 days ago
Gurugram, Haryana, India NTT DATA Full time**Req ID**: 307369 We are currently seeking a AI Presales Solution Architect to join our team in Gurgaon, Haryāna (IN-HR), India (IN). Job Duties: Overall Experience in Data and Analytics of more than 12 years. Experience of working as presales solutions architect of more than 5 years Should have done End to End presales solutioning for data project...
-
Delivery Solution Architect
3 days ago
Gurugram, Haryana, India SoftwareONE Full time**Why SoftwareOne?** **The role** - As a Delivery Solution Architect specializing in Data and Artificial Intelligence (AI) with a focus on AWS technologies, you will design and implement robust, scalable, and cutting-edge solutions leveraging the AWS ecosystem. This role involves collaborating with diverse teams, understanding business needs, and delivering...
-
Genai / Ai Platform Architect
12 hours ago
Gurgaon, Haryana, India BOSTON SCIENTIFIC Full timeAdditional Locations India-Haryana Gurgaon Diversity - Innovation - Caring - Global Collaboration - Winning Spirit - High Performance At Boston Scientific we ll give you the opportunity to harness all that s within you by working in teams of diverse and high-performing employees tackling some of the most important health industry challenges With access to...