Backend Engineer Generative AI

2 days ago


Bengaluru, Karnataka, India Impacto Digifin Technologies Full time

Role & resp

Backend Engineer Generative AI (Infra)

Experience Required: 2 5 Years

Location: Bangalore (In-office)Brookfeild

Employment Type: Full-Time

About the Role

We are looking for a Backend Engineer with experience in AI-based systems, particularly in building and maintaining Generative AI pipelines and scalable backend infrastructures. The ideal candidate will have a solid background in backend engineering, API design, and AI model integration, with a working understanding of LLMs, diffusion models, and multimodal AI applications.

You will be responsible for developing and managing backend systems that power AI services — from model APIs to data retrieval pipelines — ensuring they run efficiently and securely in production. Experience in banking or enterprise-grade applications is highly preferred, as the role requires a strong understanding of security, performance, and compliance standards.

Key Responsibilities

  • Backend Architecture & Development

Design and develop robust backend services to support AI and Generative AI workloads.

Build and optimize RESTful and GraphQL APIs for model inference, prompt processing, and data access.

Implement modular backend components for scalable, high-performance AI operations.

  • Generative AI Pipeline Integration

Collaborate with AI engineers to integrate and deploy Generative AI models (LLMs, diffusion models, text-to-image, speech-to-text, etc.) into production environments.

Implement retrieval-augmented generation (RAG) pipelines using Vector Databases like Pinecone, Weaviate, FAISS, or Milvus.

Handle AI model serving and inference optimization using frameworks like TensorFlow Serving, TorchServe, FastAPI, or custom endpoints.

  • API Management & Optimization

Develop and manage secure APIs for AI-driven services, ensuring version control, authentication (OAuth2, JWT), and scalability.

Integrate third-party AI APIs (OpenAI, Anthropic, Google Gemini, Stability AI, etc.) within internal systems.

Monitor API usage, optimize latency, and handle concurrency for high-throughput AI operations.

  • System & Environment Setup

Configure backend environments using Docker, Kubernetes, and CI/CD pipelines for automated deployments.

Manage scalable cloud environments on AWS, Azure, or GCP, ensuring load balancing and fault tolerance.

Set up development environments and backend utilities for cross-functional AI and data teams.

  • Performance & Security

Optimize server performance for inference-heavy AI pipelines through caching, queuing, and load balancing.

Ensure compliance with data privacy and security regulations, especially in financial and regulated domains.

Implement observability tools for monitoring, tracing, and performance analytics.

  • Collaboration

Work closely with AI/ML engineers, data scientists, frontend developers, and DevOps teams.

Contribute to architecture discussions, documentation, and backend process improvements.

Participate in agile ceremonies, code reviews, and team-level technical design sessions.

Required Technical Skills

  • Languages: Python (primary), and Java
  • Backend Frameworks: FastAPI, Flask, Django, , Spring Boot
  • AI & Generative Tools:

Familiarity with LLMs (GPT, Claude, Gemini, LLaMA, Mistral) and diffusion models (Stable Diffusion, Midjourney APIs)

Experience integrating transformer models via Hugging Face or OpenAI APIs

Understanding of prompt engineering concepts and model-serving best practices

  • Databases: PostgreSQL, MongoDB, Redis, Elasticsearch
  • Vector Databases: Pinecone, Weaviate, Milvus, FAISS (for RAG systems)
  • Infrastructure & DevOps: Docker, Kubernetes, Jenkins, GitLab CI/CD, AWS Lambda, ECS, or GCP Cloud Run
  • Monitoring: Prometheus, Grafana, ELK Stack, New Relic
  • API Technologies: REST, GraphQL, gRPC

Preferred Qualifications

  • Experience with AI product backend systems or enterprise/banking-grade software applications.
  • Exposure to AI pipeline orchestration tools (LangChain, LlamaIndex, or Haystack).
  • Understanding of model lifecycle management, vector embeddings, and inference scaling.
  • Familiarity with data encryption, tokenization, and compliance frameworks (PCI DSS, ISO 27001, etc.).

Soft Skills

  • Strong problem-solving and debugging skills.
  • Excellent communication and collaboration with cross-functional teams.
  • High attention to detail and performance optimization.
  • Curiosity and enthusiasm for emerging AI technologies and architectures.

Education

  • Bachelor's or Master's degree in Computer Science, Information Technology, Artificial Intelligence, or a related discipline.
  • Certifications in AI/ML, Cloud Architecture, or Backend Development are a plus.

Why Join Us

  • Work at the intersection of backend engineering and Generative AI innovation.
  • Build production-ready systems for intelligent, large-scale applications.
  • Collaborate with a team pushing the boundaries of applied AI and infrastructure optimization.

onsibilities

Preferred candidate profile



  • Bengaluru, Karnataka, India RB Consultancy Services Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Role:Backend Engineer – Generative AI & Agentic Systems (Python | LLMs)Role Description: We're evolving to integrate LLM-based intelligence into our ecosystem and looking for aPython Backend Engineer with expertise in building Agentic AI systems, LLM integrations, and context-aware RAG pipelines. This is a hands-on, full-time opportunity to help shape the...

  • Backend Engineer

    4 days ago


    Bengaluru, Karnataka, India Valenta AI Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Backend Engineer Freelance (Part-time / Full-time) | RemoteJob Title / Designation: Backend EngineerEngagement: Freelance Part-time or Full-timeStart Date: ImmediateWork Mode / Location: Remote (Base Locations: Bangalore, Vizag, Mohali – Valenta AI offices)About the RoleWe are building an automation platform that integrates sales, accounting, and marketing...


  • Bengaluru, Karnataka, India Deloitte Consulting Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Job DescriptionWe are seeking a talented Generative AI Engineer with a strong foundation in Python and backend engineering to build scalable AI-driven applications. This role blends hands-on development with cutting-edge AI research, focusing on integrating large language models (LLMs), prompt engineering, RAG systems, and vector databases to deliver...


  • Bengaluru, Karnataka, India Nexoria Techworks Inc. Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Job Description: Generative AI EngineerLocation: Remote / BangaloreEmployment Type: Full-timeDepartment: AI & ResearchIndustry: IT Services & ConsultingRole Category: AI/ML – Generative AIRole & Responsibilities:As aGenerative AI Engineer, you will be responsible for designing, developing, and deploying generative AI models, large language model (LLM)...

  • AI Backend engineer

    2 weeks ago


    Bengaluru, Karnataka, India SuperKalam Full time ₹ 1,60,00,000 - ₹ 2,20,00,000 per year

    Application form: What is this role?This role is for a fullstack engineer (backend focussed) who has worked on production-grade apps. You will find this role amazing IF - you have worked with python && pro in NodeJS && done solid work in Gen AI (beyond prompts and basic agents)About us?We are the crazy ones — shaping education with a deep commitment to our...

  • AI Backend engineer

    2 weeks ago


    Bengaluru, Karnataka, India SuperKalam (YC W23) Full time ₹ 16,00,000 - ₹ 22,00,000 per year

    Application form:What is this role?This role is for a fullstack engineer (backend focussed) who has worked on production-grade apps. You will find this role amazing IF - you have worked with python && pro in NodeJS && done solid work in Gen AI (beyond prompts and basic agents)About us?We are the crazy ones — shaping education with a deep commitment to our...

  • Backend Engineer

    1 week ago


    Bengaluru, Karnataka, India Yupp AI Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    About YuppWe are a well-funded, rapidly growing, early-stage AI startup headquartered in Silicon Valley that is building a two-sided product -- one side meant for global consumers and the other side for AI builders and researchers. We work on the cutting edge of AI across the stack. Check out our product that was launched recently, and how it solves the...


  • Bengaluru, Karnataka, India Yupp AI Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    About YuppWe are a well-funded, rapidly growing, early-stage AI startup headquartered in Silicon Valley that is building a two-sided product -- one side meant for global consumers and the other side for AI builders and researchers. We work on the cutting edge of AI across the stack. Check out our product that was launched recently, and how it solves the...


  • Bengaluru, Karnataka, India DecisionX AI Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Location: Bengaluru / HybridExperience: 6–10 yearsType: Full-time | Founding Team | Equity + Compensation⸻About UsWe're building DecisionX AI – a Self-Learning, Goal-Aware Decision OS that transforms how enterprises move fromGoal → Plan → Act → Learn → Evolve.Our platform unifies siloed enterprise data, builds domain-rich ontologies, and powers...


  • Bengaluru, Karnataka, India AI Employees Inc Full time ₹ 4,20,000 - ₹ 10,80,000 per year

    Company DescriptionAI Employees Inc is a Canada-based startup building the next generation of AI-powered customer support agents for small businesses. Our flagship product is a SaaS platform that provides Voice and Chat AI Agents capable of handling phone calls, appointment bookings, rescheduling, FAQs, and customer interactions in real-time.We are on a...