Cerebry - Generative AI Engineer - LLama

3 days ago


Noida, India Cerebry Full time

What youll build :

Retrieval & data grounding :


- Connectors for warehouses/blobs/APIs; schema validation and PII-aware pipelines; chunking/embeddings; hybrid search with rerankers; multi-tenant index management.

Orchestration & reasoning :


- Function/tool calling with structured outputs; controller logic for agent workflows; context/prompt management with citations and provenance.

Evaluation & observability :


- Gold sets + LLM-as-judge; regression suites in CI; dataset/version tracking; traces with token/latency/cost attribution.

Safety & governance :


- Input/output filtering, policy tests, prompt hardening, auditable decisions.

Performance & efficiency :


- Streaming, caching, prompt compression, batching; adaptive routing across models/providers; fallback and circuit strategies.

Product-ready packaging :


- Versioned APIs/SDKs/CLIs, Helm/Terraform, config schemas, feature flags, progressive delivery playbooks.

How youll work :

- Collaborate asynchronously with Research, Product, and Infra/SRE.

- Share designs via concise docs and PRs; ship behind flags; measure, iterate, and document.

- Enable product teams through well-factored packages, SDKs, and runbooks.

Tech youll use :

LLMs & providers :


- OpenAI, Anthropic, Google, Azure OpenAI, AWS Bedrock; targeted OSS where it fits.

Orchestration/evals :


- LangChain/LlamaIndex or lightweight custom layers; test/eval harnesses.

Services & data :


- Python (primary), TypeScript; FastAPI/Flask/Express; Postgres/BigQuery; Redis; queues.

Ops :


- Docker, CI/CD, Terraform/CDK, metrics/logs/traces; deep experience in at least one of AWS/Azure/GCP.


(ref:hirist.tech)

  • Noida, India Cerebry Full time

    Mission Transform Cerebry Research designs into production-grade Gen AI features —retrieval-grounded, safe, observable, and ready for seamless product rollout. Architect, code, evaluate, and package Gen AI services that power Cerebry end-to-end. Why this is exciting (Ownership-Forward) Founder-mindset equity. We emphasize meaningful ownership from...


  • Noida, India Cerebry Full time

    Mission Transform Cerebry Research designs into production-grade GenAI features—retrieval-grounded, safe, observable, and ready for seamless product rollout. Architect, code, evaluate, and package GenAI services that power Cerebry end-to-end. Why this is exciting (Ownership-Forward) - Founder-mindset equity. We emphasize meaningful ownership from day...


  • Noida, India Cerebry Full time

    MissionTransform Cerebry Research designs into production-grade GenAI features—retrieval-grounded, safe, observable, and ready for seamless product rollout. Architect, code, evaluate, and package GenAI services that power Cerebry end-to-end.Why this is exciting (Ownership-Forward)Founder-mindset equity. We emphasize meaningful ownership from day one.Upside...


  • Noida, India Cerebry Full time

    Mission Transform Cerebry Research designs into production-grade GenAI features —retrieval-grounded, safe, observable, and ready for seamless product rollout. Architect, code, evaluate, and package GenAI services that power Cerebry end-to-end. Why this is exciting (Ownership-Forward) Founder-mindset equity. We emphasize meaningful ownership from day one....


  • Noida, Uttar Pradesh, India 4th quarter technologies Full time ₹ 10,00,000 - ₹ 20,00,000 per year

    e are looking for a skilled and hands-on AI Engineer to design, develop, and deploy an in-house AI assistant powered by LLaMA 3 and integrated with our MS SQL-based ERP system (4QT ERP). This role includes responsibility for setting up LLM infrastructure, voice input (Whisper), natural language to SQL translation, and delivering accurate, context-aware...


  • Noida, India 4th quarter technologies Full time

    e are looking for a skilled and hands-on AI Engineer to design, develop, and deploy an in-house AI assistant powered by LLaMA 3 and integrated with our MS SQL-based ERP system (4QT ERP). This role includes responsibility for setting up LLM infrastructure, voice input (Whisper), natural language to SQL translation, and delivering accurate, context-aware...


  • Noida, Uttar Pradesh, India SourceBae Full time

    Job Title : Generative AI Lead.Location : Noida(Hybrid) once in a week.Experience Required : 7+ years (including 3 years in GenAI/LLMs).About the Role :We are seeking a highly skilled Generative AI Architect to lead the design, development, and deployment of cutting-edge GenAI solutions across enterprise-grade applications.This role requires deep expertise...


  • Noida, Uttar Pradesh, India Cerebry Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    What Youll BuildRetrieval & data grounding :Connectors for warehouses/blobs/APIs; schema validation and PII-aware pipelines; chunking/embeddings; hybrid search with rerankers; multi-tenant index management.Orchestration & ReasoningFunction/tool calling with structured outputs; controller logic for agent workflows; context/prompt management with citations and...


  • Noida, India Cerebry Full time

    Frontend Software Development Engineer :As a Frontend Software Development Engineer, you will own the end?to-end development of rich, high?performance web applications.You will work with modern React (v18) while maintaining and upgrading legacy codebases built in earlier versions (8 &?16), design micro?frontend architectures for modular delivery, and...

  • Generative AI

    1 week ago


    Noida, Uttar Pradesh, India Medecro Full time ₹ 4,00,000 - ₹ 8,00,000 per year

    Location: Noida (Work from Office)Experience: 01 YearPosition Type: Internship (Full-Time, Onsite)Period: 3 months (extendable to 6 months or more) About Us is an innovative health-tech startup revolutionizing healthcare in India with AI-driven solutions. We are looking for a passionate and motivated Generative AI (LLM) Intern to join our team and contribute...