
GenAI Platform Architect
17 hours ago
Transform Cerebry Research designs into production-grade GenAI features—retrieval-grounded, safe, observable, and ready for seamless product rollout. Architect, code, evaluate, and package GenAI services that power Cerebry end-to-end.
- Founder-mindset equity. We emphasize meaningful ownership from day one.
- Upside compounds with impact. Initial grants are designed for real participation in value creation, with refresh opportunities tied to scope and milestones.
- Transparent offers. We share the full comp picture (salary, equity targets, vesting cadence, strike/valuation context) during the process.
- Long-term alignment. Packages are crafted for builders who want to grow the platform and their stake as it scales.
- Retrieval & data grounding: connectors for warehouses/blobs/APIs;
schema validation and PII-aware pipelines;
chunking/embeddings;
hybrid search with rerankers;
multi-tenant index management. - Orchestration & reasoning: function/tool calling with structured outputs;
controller logic for agent workflows;
context/prompt management with citations and provenance. - Evaluation & observability: gold sets + LLM-as-judge;
regression suites in CI;
dataset/version tracking;
traces with token/latency/cost attribution. - Safety & governance: input/output filtering, policy tests, prompt hardening, auditable decisions.
- Performance & efficiency: streaming, caching, prompt compression, batching;
adaptive routing acrossmodels/providers;
fallback and circuit strategies. - Product-ready packaging: versioned APIs/SDKs/CLIs, Helm/Terraform, config schemas, feature flags, progressive delivery playbooks.
- Quality: higher factuality, task success, and user trust across domains.
- Speed: rapid time-to-value via templates, IaC, and repeatable rollout paths.
- Unit economics: measurable gains in latency and token efficiency at scale.
- Reliability: clear SLOs, rich telemetry, and smooth, regression-free releases.
- Reusability: template repos, connectors, and platform components adopted across product teams.
- Collaborate asynchronously with Research, Product, and Infra/SRE.
- Share designs via concise docs and PRs;
ship behind flags;
measure, iterate, and document. - Enable product teams through well-factored packages, SDKs, and runbooks.
- LLMs & providers: OpenAI, Anthropic, Google, Azure OpenAI, AWS Bedrock;
targeted OSS where it fits. - Orchestration/evals: LangChain/LlamaIndex or lightweight custom layers;
test/eval harnesses. - Retrieval: pgvector/FAISS/Pinecone/Weaviate;
hybrid search + rerankers. - Services & data: Python (primary), TypeScript;
FastAPI/Flask/Express;
Postgres/BigQuery;
Redis;
queues. - Ops: Docker, CI/CD, Terraform/CDK, metrics/logs/traces;
deep experience in at least one of AWS/Azure/GCP.
- A track record of shipping and operating GenAI/ML-backed applications in production.
- Strong Python, solid SQL, and systems design skills (concurrency, caching, queues, backpressure).
- Hands-on RAG experience (indexing quality, retrieval/reranking) and function/tool use patterns.
- Experience designing eval pipelines and using telemetry to guide improvements.
- Clear, concise technical writing (design docs, runbooks, PRs).
- Evaluation scores (task success, factuality) trending upward
- Latency and token-cost improvements per feature
- SLO attainment and incident trends
- Adoption of templates/connectors/IaC across product teams
- Clarity and usage of documentation and recorded walkthroughs
- Focused coding exercise (2–3h): ingestion → retrieval → tool-calling endpoint with tests, traces, and evals
- Systems design (60m): multi-tenant GenAI service, reliability, and rollout strategy
- GenAI deep dive (45m): RAG, guardrails, eval design, and cost/latency tradeoffs
- Docs review (30m): discuss a short design doc or runbook you’ve written (or from the exercise)
- Founder conversation (30m)
Share links to code (GitHub/PRs/gists) or architecture docs you authored, plus a brief note on a GenAI system you built—problem, approach, metrics, and improvements over time.
Email: info@cerebry.co
-
AI Platform Engineer
3 days ago
Industrial Area, India EXL Full timeEXL (NASDAQ: EXLS) is a $7 billion public-listed NASDAQ company and a rapidly expanding global digital data-led AI transformation solutions company with double digit growth. EXL Digital division spearheads the development and implementation of Generative AI (GenAI) business solutions for our clients in Banking & Finance, Insurance, and Healthcare.As a global...
-
PAS Platform Architect
17 hours ago
Industrial Area, India Newgen Software Full timeJob Title: Policy Admin System Lead ArchitectFunction: InsuranceLocation: NoidaPurpose of Job:The PAS Lead Architect will be responsible for the design, development, and support of Policy Administration System (PAS) platforms within Property & Casualty (P&C) and Life insurance domains. The role involves Architecting, Developing, customizing and extending...
-
GenAI Technical Architect
1 week ago
Greater Chennai Area, India Prodapt Full time ₹ 15,00,000 - ₹ 25,00,000 per yearOverviewProdapt is looking for a strongbackend and platform-focused engineerto join the AI Gateway team. This individual will play a key role in theend-to-end design, development, and deploymentof GenAI services built on Google Cloud Platform (GCP), leveragingcloud-native architecture and scalable infrastructure.This is a high-impact engineering role suited...
-
GenAI Solution Architect
7 days ago
Greater Bengaluru Area, India Quikr Full time US$ 1,50,000 - US$ 2,00,000 per yearJob Role—GEN-AI Solution ArchitectJob-Location:Bangalore/Hyderabad locationExperience required: 7+ yearsClient Organization: OpZen.AIJob-Requirements:Minimum 7+ years' experienceResponsibilities/ Accountabilities of the job:You will lead and collaborate with a team of AI/ML Developers, data scientists,Data Analysts, and other software developers to build,...
-
Lead Enterprise Architect
16 hours ago
Industrial Area, India LTIMindtree Full timeHiPlease find below JD:look for someone who have good exp in Travel, hospitality, airlines domain & have worked as architect for significate period. Pls see the line - “Cross-industry exposure is preferred, but travel and hospitality expertise is a must. Below points we mentioned as shortcut to search cvs. - Enterprise Architect with TOGAF/Zachman...
-
Technical Architect
4 hours ago
Greater Chennai Area, India Chargebee Full timeSummary: We are seeking a Staff Engineer/Architect to drive the design and scalability of AI-powered automation systems (agentic AI) that will transform billing and revenue operations for our global SaaS customers. This role is critical in shaping Chargebee’s next generation of intelligent, multi-tenant automation products. You will report to the Senior...
-
Python API and Microservices Developer
17 hours ago
Industrial Area, India Info Origin Inc. Full timeRole: Python Backend EngineerLocation: Noida,/Dehradun/PuneWork Mode: OnsiteEmployment Type: Full-TimeInterview Mode: Video or In PersonJob Description Why This Role Matters As our Python Backend Engineer, you will own and evolve the backend architecture for a large-scale analytical platform that processes massive datasets and serves thousands of concurrent...
-
Lead Data Architect
3 days ago
Industrial Area, India Coforge Full timeRole: Senior Technical Lead – Associate Architect (Microsoft Fabric)Experience : 8-12 yearsLocation: Greater Noida, Work from office 5 daysKey Responsibilities:Architect & Design Solutions: Lead the end-to-end architecture of data platforms using Microsoft Fabric, ensuring alignment with business goals and technical standards.Data Engineering Leadership:...
-
Technical Architect
6 days ago
Greater Chennai Area, India Chargebee Full timeSummary: We are seeking a Staff Engineer/Architect to drive the design and scalability of AI-powered automation systems (agentic AI) that will transform billing and revenue operations for our global SaaS customers. This role is critical in shaping Chargebee’s next generation of intelligent, multi-tenant automation products. You will report to the Senior...
-
Technical Architect
17 hours ago
Greater Chennai Area, India Chargebee Full timeSummary: We are seeking a Staff Engineer/Architect to drive the design and scalability of AI-powered automation systems (agentic AI) that will transform billing and revenue operations for our global SaaS customers. This role is critical in shaping Chargebee’s next generation of intelligent, multi-tenant automation products.You will report to the Senior...