▷ [3 Days Left] Python Developer - RAG/LLM Model

2 days ago


Pune, India Xpetize Technology Solutions Private Limited Full time

Job Description

Job Description

- Design, develop, and maintain backend services using Flask and Django for integrating and deploying RAG-based LLM models.
- Develop RESTful APIs and data pipelines to interact with AI models and integrate them into client-facing applications.
- Build and maintain database models, manage large data sources, and optimize API endpoints for performance.
-

Model Integration And Deployment

- Implement and integrate RAG-based LLM models into production environments using Flask and Django frameworks.
- Work closely with AI and data science teams to ensure proper data flow and retrieval between the model and backend systems.
- Optimize model performance for inference efficiency, memory management, and response time.
-

Model Optimization And Fine-Tuning

- Fine-tune and optimize LLMs to meet specific business use cases, such as content generation, summarization, and question answering.
- Collaborate with machine learning engineers to improve accuracy, reduce inference times, and scale model usage.
- Monitor and debug model performance and address any issues related to latency or correctness in the model's output.
-

Data Management & Analysis

- Handle large datasets and develop data pipelines to provide inputs for training and testing models.
- Ensure proper preprocessing of input data to enhance model performance.
- Write and optimize SQL/NoSQL queries for data extraction, transformation, and storage related to the models.
-

Collaboration & Continuous Improvement

- Collaborate with cross-functional teams, including product managers, AI researchers, and front-end developers, to design solutions that meet business goals.
- Write unit and integration tests to ensure the quality of backend components.
- Follow Agile practices for development, including participating in sprint planning, stand-ups, and code reviews.
-

Model Monitoring And Maintenance

- Implement monitoring systems to ensure model performance, and address any degradation or anomalies.
- Continuously discover and implement new techniques to improve the efficiency and reliability of the deployed models.
-

Required Skills And Qualifications

- 3+ years of experience in Python development with expertise in Flask and Django.
- Strong understanding of AI/ML concepts and experience working with Retriever-Augmented Generation (RAG) models and LLMs (e., GPT, BERT, T5).
- Experience with model deployment and integration of AI models into backend applications.
- Proficiency in Python, including working with libraries such as TensorFlow, PyTorch, Hugging Face Transformers, and spaCy.
- Experience with developing RESTful APIs using Flask or Django REST Framework (DRF).
- SQL/NoSQL Database experience for managing and retrieving large datasets.
- Version control using Git for collaboration.
- Familiarity with deploying applications to cloud platforms (AWS, GCP, Azure).
- Strong problem-solving skills and ability to troubleshoot complex production issues.
-

Preferred Skills

- Knowledge of CI/CD pipelines for deploying Python applications.
- Familiarity with Docker and Kubernetes for containerization and orchestration of model deployments.
- Experience with Natural Language Processing (NLP) and data preprocessing techniques.
- Knowledge of distributed computing for scaling AI models.
- Familiarity with tools like FastAPI for faster API development and deployment.
- Knowledge of AI performance monitoring and optimization techniques for large models in production.
- Exposure to marketing analytics, sentiment analysis, or content generation with LLMs.
-

Educational Qualifications

- Bachelor's degree in Computer Science, Software Engineering, Artificial Intelligence, or related field.
- Master's degree in a relevant field is a plus



  • Pune, India AKSHAYA BUSINESS IT SOLUTIONS PRIVATE LIMITED Full time

    Position name : Python AI Backend EngineerExperience : 4+YrsLocation : Pune, BalewadiNotice Period : Immediate to 30 Days ,Serving Mode : Face to Face on Saturday on 2nd Aug 2025Key responsibilities :Description :We are seeking Python AI Backend Engineers to play a pivotal role in building our Agentic Workflow Service and Retrieval-Augmented Generation...


  • Pune, India TEAM GEEK SOLUTIONS PRIVATE LIMITED Full time

    Job Title : Generative AI Developer.Experience Required : 5+ years.Location : Pune.Employment Type : Full-time.Job Summary :We are seeking an experienced Generative AI Developer with a strong background in Python, modern web frameworks, and advanced AI concepts such as LLMs and RAG pipelines.The ideal candidate will be responsible for building, deploying,...

  • Python Developer

    2 weeks ago


    Pune, Maharashtra, India CA-One Tech Cloud Full time

    Job Title : Python Developer GenAI/LLM (C2H Position)Experience : 5+ YearsWork Mode : Remote / Hybrid (as per client the Role :We are seeking an experienced Python Developer with strong expertise in Generative AI (GenAI), Retrieval-Augmented Generation (RAG), and Large Language Models (LLMs) to join our team on a Contract-to-Hire (C2H) basis. The ideal...


  • Pune, India sfHawk Solutions Pvt Ltd Full time

    Role Overview :InnovaPoint is seeking a dynamic Senior AI Engineer to lead the development and deployment of cutting-edge AI solutions. This role is ideal for a self-driven technologist with deep expertise in AI/ML and a passion for building intelligent systems such as AI chatbots, agents, automation tools, and recommendation engines.You will be responsible...


  • Pune, Maharashtra, India Bosch Full time

    Company Description Bosch Global Software Technologies Private Limited is a 100 owned subsidiary of Robert Bosch GmbH one of the world s leading global supplier of technology and services offering end-to-end Engineering IT and Business Solutions With over 28 200 associates it s the largest software development center of Bosch outside Germany ...


  • Pune, India Enterprise Minds, Inc Full time

    Job Title: AI/ML Engineer – Generative AI Location : PuneRole Summary The AI/ML Engineer – Generative AI & RAG will be responsible for designing, developing, and deploying state-of-the-art machine learning models with a focus on Generative AI and Retrieval-Augmented Generation (RAG). The role involves building scalable solutions using Python, large...

  • AI/ML Engineer

    2 days ago


    Pune, India Left Right Mind Full time

    Job Overview: We are seeking a highly skilled and motivated Senior AI Agent Developer to join our dynamic team. The ideal candidate should have experience in developing AI-driven agents, particularly in the context of Generative AI (Gen AI), Retrieval-Augmented Generation (RAG), and Large Language Models (LLMs). You will contribute to the development of...


  • Pune, India TheBriminc Full time

    Key Responsibilities : - Design and Implement Intelligent Agents : Lead the architecture, development, and deployment of sophisticated multi-step intelligent agents using LangGraph for complex workflows.- Integrate and Optimize AI Tools : Leverage MCP tools effectively within agent designs to enhance functionality and performance.- Cloud-Native Deployment :...

  • AI/ML Developer

    5 days ago


    Pune, India NPG Consultants Full time

    Job Description :We are seeking a highly skilled AI/ML Developer with expertise in Python to build scalable AI systems for production. You will develop machine learning and LLM-based AI applications using cutting-edge technologies like Retrieval Augmented Generation (RAG), LangChain, and vector databases.Key Responsibilities :- Design and implement ML models...


  • Pune, India Zorba AI Full time

    Job Description Primary Title: Senior LLM Engineer (4+ years) Hybrid, India About The Opportunity A technology consulting firm operating at the intersection of Enterprise AI, Generative AI and Cloud Engineering seeks an experienced LLM-focused engineer. You will build and productionize LLM-powered products and integrations for enterprise customers across...