Ai Tech Architect

2 days ago


Kannur, India Whatjobs IN C2 Full time

Overview Own the end-to-end architecture of production AI systems with a strong hands-on bias. You’ll design robust, cost-efficient, and secure agentic/GenAI solutions on AWS. Part of your job will be to unblock lead developers by debugging code, optimizing performance, and guiding best practices. Expect to turn complex requirements into scalable, observable, and well-governed platforms. Responsibilities - Define target architectures for agentic systems (planning/reasoning/tool-calling), GenAI/RAG pipelines, and evaluation loops; produce clear design documents with Flow/UML/sequence diagrams and AWS deployment topologies. - Size and optimize infrastructure for cost and performance: model throughput/latency, concurrency, autoscaling policies, CPU/GPU needs, memory footprints, vector index sizing, storage/egress, and token budgets. - Lead deep-dive debugging and incident resolution: profile bottlenecks, fix defects, stabilize services; pair-program with developers to raise the engineering bar. - Establish reference implementations for multi-agent frameworks (Semantic Kernel preferred; LangGraph/AutoGen/CrewAI acceptable), tool/function schemas, validation, memory, grounding, and multi-step planning. - Architect retrieval and hybrid search systems: ingestion, chunking, embeddings, ranking, caching, freshness, and grounding; evaluate recall, precision, and hallucination risk. - Productionize on AWS using Amazon EKS, S3, SQS/SNS, and AWS Bedrock; integrate identity (Okta/IAM), secrets (AWS Secrets Manager), eventing, and observability; enforce SLIs/SLOs and error budgets. - Make systems observable: distributed tracing, metrics, and logs using OpenTelemetry and Datadog; standardize dashboards,alerts, and tool/trace replay. - Build evaluation and promotion workflows: prompt/flow tests, golden sets, offline batch runs, A/B experiments, regression suites, and rollout gates. - Design security and safety controls: threat modeling, prompt-injection defense, sandboxed tools, policy enforcement, red-team testing, PII/data governance, and audit trails. - Define platform standards: reusable SDKs, connectors, CI/CD templates, runbooks, and architecture review checklists. - Partner with product, data, and SRE teams to plan capacity, disaster recovery, multi-region posture, and upgrade paths; lead readiness reviews and post-incident RCAs. - Mentor engineers and review PRs with a focus on reliability, correctness, and maintainability. Must Have - 7–10 years in software/AI engineering with at least 4+ years building GenAI applications and 2+ years architecting production agentic systems. - Strong hands-on expertise in Python 3.11+ (typing, asyncio, packaging, profiling, pytest); able to dive into code,fix bugs, and optimize performance-critical paths. - Experience with one or more agent frameworks (Semantic Kernel, LangGraph, AutoGen, CrewAI) and function/tool calling with schema and argument validation. - Proven design of GenAI/RAG/hybrid retrieval systems using AWS Bedrock, OpenSearch Serverless, or other vector databases; grounding and retrievalevaluation experience. - Deep knowledge of AWS architecture: Amazon EKS, Bedrock, S3, SQS/SNS, RDS (SQL Server/PostgreSQL), ElastiCache (Redis), Secrets Manager, IAM/Okta, Kong API Gateway, OpenSearch Serverless, and Datadog. - Observability expertise: distributed tracing (OpenTelemetry), metrics, logs, correlation IDs, and service-level objectives; mature incident response and on-call practices. - Cost and performance engineering mindset: capacity modeling, GPU/CPU sizing, autoscaling (HPA), batching/streaming, caching, and FinOps discipline. - Security and safety fundamentals: least privilege, data isolation, policy enforcement, content moderation, jailbreak/PII defenses, and compliance awareness. - Excellent technical communication: clear diagrams, ADRs, design docs; empathetic, structured code and architecture reviews. Good to Have - Multi-agent orchestration patterns: task decomposition, coordinator-worker, human-in-the-loop, graph-based planning. - Deep expertise with vector databases and retrieval: OpenSearch Serverless, Pinecone, pgvector, Redis. - Evaluation frameworks: red teaming, automated guardrails, regression testing, rollout gates, canary deployments. - Frontend integration for agent UIs (streaming responses, tool traces), secure connector APIs, and AuthN/Z best practices. - Policy-as-code (OPA) and multi-tenant architecture (RBAC, quotas, usage metering). - Knowledge of Kong API Gateway, LaunchDarkly/Flipt for feature management, and NeMo Guardrails for runtime safety. - CI/CD exposure (build/test with GitHub Actions, deployments via Terraform/AWS IaC templates). Core Tech Stack (our core; equivalents welcome) - Python 3.11+, FastAPI, Pydantic v2, SQLAlchemy 2.X, Alembic, pytest. - Amazon EKS, AWS Bedrock, Amazon SQS/SNS, Amazon RDS (SQL Server/PostgreSQL), ElastiCache (Redis). - Amazon S3 for storage, Amazon ECR for container images, OpenSearch Serverless for vector storage. - AWS Secrets Manager, Okta IAM, NeMo Guardrails, Kong API Gateway. - OpenTelemetry + Datadog for observability and monitoring. - Custom RAG Services, Bedrock Knowledge Base, and LLM evaluation with Phoenix, Arize, and Promptfoo.


  • Senior AI Engineer

    1 day ago


    Kannur, India NextDimension AI Full time

    Compensation: INR 12-30 LPA Base + Bonus + EquityLocation: GurgaonNextDimension is a US-based technology startup building AI Agents in Healthcare, established by a team of distinguished AI/ML Scientists and Engineers from Google, Amazon, and Snowflake. We're empowering Enterprises by building sophisticated, high-impact AI agents that automate sales,...

  • AI Generalist

    4 weeks ago


    Kannur, India CEECO INTERNATIONAL CONSULTANCY Full time

    About Us CEECO International Consultancy is a trusted overseas education firm helping students secure admissions in top global universities, especially for MBBS and healthcare studies. Based in Kerala with a strong international presence, we offer expert guidance in counseling, admission, and visa processing. Our mission is to make global education...

  • AI Generalist

    4 days ago


    Kannur, Kerala, India CEECO INTERNATIONAL CONSULTANCY Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    *AI Generalist**Location: Kannur | Experience: 3+ Years | Full-Time*About Us:*Ceeco International* is a leading overseas education consultancy driving digital transformation across its operations.Role:We are looking for a proactive AI Generalist to integrate AI tools across marketing, sales, HR, accounts, and operation improving efficiency and automating...


  • Kannur, India Domnic Lewis Full time

    Role : : Automation / Integration Architect Grade: Manager / Senior Manager Department / Division: IT Location: Navi Mumbai Reporting To: CIO Team Size: N/A Purpose of the Role: The Automation / Integration Architect will lead enterprise-wide automation initiatives using Microsoft Power Automate, RPA, AI/ML, and low-code platforms to enhance operational...

  • AI Consultant

    3 weeks ago


    Kannur, India Aventis Solutions Full time

    Aventis Solutions is igniting the AI revolution: Now, our tech partner is establishing a new AI Innovation Hub in Pune, India, and we are hiring AI Consultants to join this flagship build. This is a rare opportunity for ambitious graduates and early-career professionals to work at the intersection of business and technology, supporting AI and GenAI...

  • Cloud Architect

    3 weeks ago


    Kannur, India Whatjobs IN C2 Full time

    Job Summary: We are looking for an experienced GCP Cloud Architect with expertise in networking, security, and AI platforms to design and implement secure, scalable Google Cloud environments. This role involves creating GCP landing zones, private access architectures, and collaboration with AI and security teams for enterprise-scale workloads. Job...

  • Ai Engineer

    3 weeks ago


    Kannur, India Whatjobs IN C2 Full time

    QX Labs is looking for a full stack AI engineer with a strong software background to join our expanding team of AI experts. We are an early-stage London-based startup building cutting-edge workflow and data automation products for investment firms. Our flagship product ( ) is a platform that transforms how financial firms and investment companies source...

  • Sprinklr architect

    4 weeks ago


    Kannur, India Client Of Neerinfo Full time

    Project Role : Application DeveloperProject Role Description : Design, build and configure applications to meet business process and application requirements.Must have skills : SprinklrMinimum 12 year(s) of experience is requiredEducational Qualification : 15 years full time educationSummary:We are seeking a highly skilled Sprinklr Architect to lead the...

  • Oracle Analytics

    20 hours ago


    Kannur, India TribolaTech Inc Full time

    Oracle Analytics & AI Solutions ArchitectLocation: RemoteExperience : 10+ YearsOur client believes in connecting people and business to Insurance in ways that are Innovative, Hyper-Relevant, Compelling and Personal. They bring together the brightest minds to build the future of Insurance; a world where Insurance makes life and business easier, more...


  • Kannur, India DBiz.ai Full time

    The Enterprise Data Architect/Data Modeler is a senior technical leader responsible for defining, designing and governing the organization’s enterprise data architecture. You will ensure that data is modeled, structured and integrated across the organization to support analytics and AI/ML initiatives at scale. You will work closely with business...