Azure Cloud

6 days ago


Bandra, India Sereno Full time

Who you areYou're someone who’s already shipped GenAI stuff—even if it was small: a chatbot, a RAG tool, or an agent prototype. You live in Python, LangChain, LlamaIndex, Hugging Face, and vector DBs like FAISS or Milvus. You know your way around prompts—noisy chains, rerankers, retrievals. You've deployed models or services on Azure/AWS/GCP, wrapped them into FastAPI endpoints, and maybe even wired a bit of terraform/ARM. You’re not building from spreadsheets; you're iterating with real data, debugging hallucinations, and swapping out embeddings in production. You can read blog posts and paper intros, follow new methods like QLoRA, and build on them. You're fine with ambiguity and startup chaos—no strict specs, no roadmap, just a mission. You work in async Slack, ask quick questions, push code that works, and help teammates stay afloat. You're not satisfied with just getting things done—you want GenAI to feel reliable, usable, and maybe even fun.What you’ll actually doYou’ll build real GenAI features: agentic chatbots for document lookup, conversation assistants, or knowledge workflows. You’ll design and implement RAG systems: data ingestion, embeddings, vector indexing, retrievals, and prompt pipelines. You’ll write inference APIs in FastAPI that work with vector stores and cloud LLM endpoints. You’ll containerize services with Docker, push to Azure/AWS/GCP, wire basic CI/CD, monitor latency and faulty responses, and iterate fast. You’ll experiment with LoRA/QLoRA fine-tuning on small LLMs, test prompt variants, and measure output quality. You’ll collaborate with DevOps to ensure deployment reliability, QA to make tests more robust, and frontend folks to shape UX. You’ll share your work in quick “demo & dish” sessions: what's working, what's broken, what you're trying next. You’ll tweak embeddings, watch logs, and improve pipelines one experiment at a time. You’ll help write internal docs or “how-tos” so others can reuse your work.Skills and knowledgeYou have solid experience in Python backend development (FastAPI/Django)Experienced with LLM frameworks: LangChain, LlamaIndex, CrewAI, or similarComfortable with vector databases: FAISS, Pinecone, MilvusAble to fine-tune models using PEFT/LoRA/QLoRAKnowledge of embeddings, retrieval systems, RAG pipelines, and prompt engineeringFamiliar with cloud deployment and infra-as-code (Azure, AWS, GCP with Docker/K8s, Terraform/ARM)Good understanding of monitoring and observability—tracking response latency, hallucinations, and costsAble to read current research, try prototypes, and apply them pragmaticallyWorks well in minimal-structure startups; self-driven, team-minded, proactive communicator


  • Azure Cloud

    4 days ago


    Bandra, Maharashtra, India Sereno Full time

    Who you areYou're someone who's already shipped GenAI stuff—even if it was small: a chatbot, a RAG tool, or an agent prototype. You live in Python, LangChain, LlamaIndex, Hugging Face, and vector DBs like FAISS or Milvus. You know your way around prompts—noisy chains, rerankers, retrievals. You've deployed models or services on Azure/AWS/GCP, wrapped...

  • Azure Cloud

    4 days ago


    Bandra, India Sereno Full time

    Who you areYou're someone who’s already shipped GenAI stuff—even if it was small: a chatbot, a RAG tool, or an agent prototype. You live in Python, LangChain, LlamaIndex, Hugging Face, and vector DBs like FAISS or Milvus. You know your way around prompts—noisy chains, rerankers, retrievals. You've deployed models or services on Azure/AWS/GCP, wrapped...

  • Azure Cloud

    3 days ago


    Bandra, India Sereno Full time

    Who you areYou're someone who’s already shipped GenAI stuff—even if it was small: a chatbot, a RAG tool, or an agent prototype. You live in Python, LangChain, LlamaIndex, Hugging Face, and vector DBs like FAISS or Milvus. You know your way around prompts—noisy chains, rerankers, retrievals. You've deployed models or services on Azure/AWS/GCP, wrapped...

  • Azure Cloud

    5 days ago


    Bandra, India Sereno Full time

    Who you are You're someone who’s already shipped GenAI stuff—even if it was small: a chatbot, a RAG tool, or an agent prototype. You live in Python, LangChain, LlamaIndex, Hugging Face, and vector DBs like FAISS or Milvus. You know your way around prompts—noisy chains, rerankers, retrievals. You've deployed models or services on Azure/AWS/GCP, wrapped...

  • Azure Cloud

    5 days ago


    Bandra, India Sereno Full time

    Who you are You're someone who’s already shipped GenAI stuff—even if it was small: a chatbot, a RAG tool, or an agent prototype. You live in Python, LangChain, LlamaIndex, Hugging Face, and vector DBs like FAISS or Milvus. You know your way around prompts—noisy chains, rerankers, retrievals. You've deployed models or services on Azure/AWS/GCP, wrapped...

  • Azure Cloud

    5 days ago


    Bandra, India Sereno Full time

    Who you areYou're someone who’s already shipped GenAI stuff—even if it was small: a chatbot, a RAG tool, or an agent prototype. You live in Python, LangChain, LlamaIndex, Hugging Face, and vector DBs like FAISS or Milvus. You know your way around prompts—noisy chains, rerankers, retrievals. You've deployed models or services on Azure/AWS/GCP, wrapped...

  • Azure Cloud

    6 days ago


    Bandra, India Sereno Full time

    Who you areYou're someone who’s already shipped GenAI stuff—even if it was small: a chatbot, a RAG tool, or an agent prototype. You live in Python, LangChain, LlamaIndex, Hugging Face, and vector DBs like FAISS or Milvus. You know your way around prompts—noisy chains, rerankers, retrievals. You've deployed models or services on Azure/AWS/GCP, wrapped...

  • Azure Cloud

    5 days ago


    Bandra, Maharashtra, India, Mumbai Sereno Full time

    Who you areYou're someone who’s already shipped GenAI stuff—even if it was small: a chatbot, a RAG tool, or an agent prototype. You live in Python, LangChain, LlamaIndex, Hugging Face, and vector DBs like FAISS or Milvus. You know your way around prompts—noisy chains, rerankers, retrievals. You've deployed models or services on Azure/AWS/GCP, wrapped...


  • Mumbai, bandra kurla complex, India TYD Ideas Full time ₹ 2,50,000 - ₹ 12,00,000 per year

    Job responsibilities:-Work on with full stack development using C# .Net & Angular 8+ (both UI and back-end services). -Enhance existing systems by analyzing business objectives, preparing an action plan and identifying areas for modification and improvement -Prepare and maintain code for various .NET applications and resolve any defects in systems. ...


  • Bandra, Mumbai, Maharashtra, India Azent Overseas Education Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Azent Overseas Education:Roles & Responsibilities:Work with the Business & Sales Teams, understand the requirements and implement itBuild the code independently according to the technical specifications, detailed design, maintainability, and coding and efficiency standardsCreate a technical design from Functional Design Doc / Requirements Doc and be able to...