Ai tech architect
11 hours ago
Overview Own the end-to-end architecture of production AI systems with a strong hands-on bias. You’ll design robust, cost-efficient, and secure agentic/Gen AI solutions on AWS. Part of your job will be to unblock lead developers by debugging code, optimizing performance, and guiding best practices. Expect to turn complex requirements into scalable, observable, and well-governed platforms. Responsibilities • Define target architectures for agentic systems (planning/reasoning/tool-calling), Gen AI/RAG pipelines, and evaluation loops; produce clear design documents with Flow/UML/sequence diagrams and AWS deployment topologies. • Size and optimize infrastructure for cost and performance: model throughput/latency, concurrency, autoscaling policies, CPU/GPU needs, memory footprints, vector index sizing, storage/egress, and token budgets. • Lead deep-dive debugging and incident resolution: profile bottlenecks, fix defects, stabilize services; pair-program with developers to raise the engineering bar. • Establish reference implementations for multi-agent frameworks (Semantic Kernel preferred; Lang Graph/Auto Gen/Crew AI acceptable), tool/function schemas, validation, memory, grounding, and multi-step planning. • Architect retrieval and hybrid search systems: ingestion, chunking, embeddings, ranking, caching, freshness, and grounding; evaluate recall, precision, and hallucination risk. • Productionize on AWS using Amazon EKS, S3, SQS/SNS, and AWS Bedrock; integrate identity (Okta/IAM), secrets (AWS Secrets Manager), eventing, and observability; enforce SLIs/SLOs and error budgets. • Make systems observable: distributed tracing, metrics, and logs using Open Telemetry and Datadog; standardize dashboards, alerts, and tool/trace replay. • Build evaluation and promotion workflows: prompt/flow tests, golden sets, offline batch runs, A/B experiments, regression suites, and rollout gates. • Design security and safety controls: threat modeling, prompt-injection defense, sandboxed tools, policy enforcement, red-team testing, PII/data governance, and audit trails. • Define platform standards: reusable SDKs, connectors, CI/CD templates, runbooks, and architecture review checklists. • Partner with product, data, and SRE teams to plan capacity, disaster recovery, multi-region posture, and upgrade paths; lead readiness reviews and post-incident RCAs. • Mentor engineers and review PRs with a focus on reliability, correctness, and maintainability. Must Have • 7–10 years in software/AI engineering with at least 4+ years building Gen AI applications and 2+ years architecting production agentic systems. • Strong hands-on expertise in Python 3.11+ (typing, asyncio, packaging, profiling, pytest); able to dive into code, fix bugs, and optimize performance-critical paths. • Experience with one or more agent frameworks (Semantic Kernel, Lang Graph, Auto Gen, Crew AI) and function/tool calling with schema and argument validation. • Proven design of Gen AI/RAG/hybrid retrieval systems using AWS Bedrock, Open Search Serverless, or other vector databases; grounding and retrieval evaluation experience. • Deep knowledge of AWS architecture: Amazon EKS, Bedrock, S3, SQS/SNS, RDS (SQL Server/Postgre SQL), Elasti Cache (Redis), Secrets Manager, IAM/Okta, Kong API Gateway, Open Search Serverless, and Datadog. • Observability expertise: distributed tracing (Open Telemetry), metrics, logs, correlation IDs, and service-level objectives; mature incident response and on-call practices. • Cost and performance engineering mindset: capacity modeling, GPU/CPU sizing, autoscaling (HPA), batching/streaming, caching, and Fin Ops discipline. • Security and safety fundamentals: least privilege, data isolation, policy enforcement, content moderation, jailbreak/PII defenses, and compliance awareness. • Excellent technical communication: clear diagrams, ADRs, design docs; empathetic, structured code and architecture reviews. Good to Have • Multi-agent orchestration patterns: task decomposition, coordinator-worker, human-in-the-loop, graph-based planning. • Deep expertise with vector databases and retrieval: Open Search Serverless, Pinecone, pgvector, Redis. • Evaluation frameworks: red teaming, automated guardrails, regression testing, rollout gates, canary deployments. • Frontend integration for agent UIs (streaming responses, tool traces), secure connector APIs, and Auth N/Z best practices. • Policy-as-code (OPA) and multi-tenant architecture (RBAC, quotas, usage metering). • Knowledge of Kong API Gateway, Launch Darkly/Flipt for feature management, and Ne Mo Guardrails for runtime safety. • CI/CD exposure (build/test with Git Hub Actions, deployments via Terraform/AWS Ia C templates). Core Tech Stack (our core; equivalents welcome) • Python 3.11+, Fast API, Pydantic v2, SQLAlchemy 2.x, Alembic, pytest. • Amazon EKS, AWS Bedrock, Amazon SQS/SNS, Amazon RDS (SQL Server/Postgre SQL), Elasti Cache (Redis). • Amazon S3 for storage, Amazon ECR for container images, Open Search Serverless for vector storage. • AWS Secrets Manager, Okta IAM, Ne Mo Guardrails, Kong API Gateway. • Open Telemetry + Datadog for observability and monitoring. • Custom RAG Services, Bedrock Knowledge Base, and LLM evaluation with Phoenix, Arize, and Promptfoo.
-
Full Stack Developer
1 day ago
Tiruchirappalli, India Debales AI Full timeJob Opportunity: Full Stack Developer (Remote | 0–1 Year Experience)Location: Work from HomeExperience: 0–1 Year (Full-time internship or junior developer experience required)About Debales AI builds autonomous AI Agents that seamlessly integrate into existing systems — no new dashboards, no added workflow overhead. With 100+ integrations and 80+...
-
Data scientist
3 weeks ago
Tiruchirappalli, India Aviso AI Full timeAbout Aviso AI: Aviso AI is revolutionizing enterprise sales intelligence with its cutting-edge AI solutions for forecasting, deal guidance, and revenue operations. By leveraging AI and machine learning, we transform how enterprise teams operate, allowing them to make data-driven decisions, optimize sales strategies, and increase productivity. We are looking...
-
Data Scientist
4 weeks ago
Tiruchirappalli, India Aviso AI Full timeAbout Aviso AI: Aviso AI is revolutionizing enterprise sales intelligence with its cutting-edge AI solutions for forecasting, deal guidance, and revenue operations. By leveraging AI and machine learning, we transform how enterprise teams operate, allowing them to make data-driven decisions, optimize sales strategies, and increase productivity. We are looking...
-
Google Cloud Specialist
4 weeks ago
Tiruchirappalli, India KV TECH SOLUTIONS PVT LTD Full timeKV Tech Solutions Pvt Ltd embodies the spirit of innovation and adaptability in today's rapidly evolving digital landscape. Our mission is to transform businesses with Google Cloud excellence. We are a team of experts dedicated to providing cutting-edge solutions in areas like VM migrations, Generative AI, data transformation, and predictive analytics. To...
-
Google Cloud Specialist
4 weeks ago
Tiruchirappalli, India KV TECH SOLUTIONS PVT LTD Full timeKV Tech Solutions Pvt Ltd embodies the spirit of innovation and adaptability in today's rapidly evolving digital landscape. Our mission is to transform businesses with Google Cloud excellence. We are a team of experts dedicated to providing cutting-edge solutions in areas like VM migrations, Generative AI, data transformation, and predictive analytics. To...
-
Microsoft Cloud Data Architect
4 weeks ago
Tiruchirappalli, India INFOC Full timeINFOC is a forward-thinking Digital Transformation and Cloud Consulting company. We help organizations modernize operations, harness AI, and maximize the value of Microsoft Cloud solutions across industries including Retail, Manufacturing, Technology, and Energy.We are seeking a Data Engineer with expertise in AI and Microsoft Cloud platforms to design,...
-
Tech Lead
3 weeks ago
Tiruchirappalli, India Innovation Kite Full timeWe’re looking for a dynamic and experienced Tech Lead – PHP to lead full stack development, architect scalable solutions, and drive technical excellence across projects. If you're comfortable working across modern frameworks, eCommerce platforms, and complex integrations, we want to hear from you.About the RoleAs a Tech Lead, you’ll take ownership of...
-
Tech Lead
3 weeks ago
Tiruchirappalli, India Alaan الآن Full timeAbout AlaanAlaan is the SuperCard™ for businesses and the most loved fintech in the Middle East. Our mission is to simplify finance for businesses so they can save time and money.Alaan provides everything businesses need to manage and control expenses, including the SuperCard™, AI-powered automation and insights, streamlined accounting, and centralized...
-
Tech Lead
3 weeks ago
Tiruchirappalli, India Alaan الآن Full timeAbout AlaanAlaan is the SuperCard™ for businesses and the most loved fintech in the Middle East. Our mission is to simplify finance for businesses so they can save time and money.Alaan provides everything businesses need to manage and control expenses, including the SuperCard™, AI-powered automation and insights, streamlined accounting, and centralized...
-
Data Engineer
4 weeks ago
Tiruchirappalli, India INFOC Full timeINFOC is a forward-thinking Digital Transformation and Cloud Consulting company. We help organizations modernize operations, harness AI, and maximize the value of Microsoft Cloud solutions across industries including Retail, Manufacturing, Technology, and Energy.We are seeking a Data Engineer with expertise in AI and Microsoft Cloud platforms to design,...