Data & ai engineer (data pipelines & rag)

20 hours ago


Rajahmundry, India Pro5.ai Full time

Data & AI Engineer (Data Pipelines & RAG) We are seeking a versatile Data & AI Engineer with 4-7 years of experience to build, deploy & maintain end-to-end data pipelines for downstream Gen AI applications. You’ll design data models and transformations, build scalable ETL/ELT workflows, while learning fast and working on the AI agent space. Key Responsibilities Data Modeling & Pipeline development Automate data ingestion from diverse sources (Databases, APIs, files, Sharepoint/ document management tools, URLs). Most files are expected to be unstructured documents with different file formats, tables, charts, process flows, schedules, construction layouts/drawings, etc. Own chunking strategy, embedding, indexing all unstructured & structured data for efficient retrieval by downstream RAG/agent systems Build, test, and maintain robust ETL/ELT workflows using Spark (batch & streaming) Define and implement logical/physical data models and schemas. Develop schema mapping and data dictionary artifacts for cross-system consistency Gen AI Integration Instrument data pipelines to surface real-time context into LLM prompts Implement prompt engineering and RAG for varied workflows within the RE/Construction industry vertical Observability & Governance Implement monitoring, alerting, and logging (data quality, latency, errors) Apply access controls and data privacy safeguards (e.g., Unity Catalog, IAM) CI/CD & Automation Develop automated testing, versioning, and deployment (Azure Dev Ops, Git Hub Actions, Prefect/Airflow) Maintain reproducible environments with infrastructure as code (Terraform, ARM templates) Required Skills & Experience 5 years in Data Engineering or similar role, with at least 12-24 months of exposure to building pipelines for unstructured data extraction including document processing with OCR, cloud-native solutions and chunking, indexing etc. for downstream consumption by RAG/ Gen AI applications. Proficiency in Python, dlt for ETL/ELT pipeline, duck DB or equivalent tools for analytical in-process analysis, dvc for managing large files efficiently. Solid SQL skills and experience designing and scaling relational databases. Familiarity with non-relational column based databases is preferred. Familiarity with Prefect is preferred or others (e.g. Azure Data Factory) Proficiency with the Azure ecosystem. Should have worked on Azure services in production. Familiarity with RAG indexing, chunking and storage across file types for efficient retrieval. Strong Dev Ops/Git workflows and CI/CD (Circle CI / Azure Dev Ops) Experience deploying ML artifacts using MLflow, Docker, or Kubernetes is good to have. Bonus skillsets: Experience with Computer vision based extraction or experience in building ML models for production Knowledge of agentic AI system design - memory, tools, context, orchestration Knowledge of data governance, privacy laws (GDPR) and enterprise security patterns We are an early-stage startup, so you are expected to wear many hats, working with things out of your comfort zone, but with real and direct impact in production. If you think you are a good fit for this fast-paced environment, please apply.


  • AI Intern

    1 week ago


    Rajahmundry, India Job Listings by Babblebots Full time

    Babblebots is running a 𝗔𝗜 𝗝𝗼𝗯 𝗙𝗲𝘀𝘁 to help startups to hire AI Engineers/AI Interns. These roles involve building production-grade software using cutting-edge AI and ML technologies across the full AI lifecycle, with work spanning traditional ML and computer vision as well as modern Generative AI (GenAI) and Large Language Models...

  • AI Engineer

    4 weeks ago


    Rajahmundry, India Uplevyl Full time

    Job Title: AI Engineer Location: Onsite – Noida Type: Full-time About Us At Uplevyl, we're redefining what intelligent communities look like. Through our AI-powered agents, we’re building scalable, agentic community systems that reduce manual work and increase member engagement—especially for women-centric organizations. We’re looking for...

  • Data AI Innovator

    2 weeks ago


    rajahmundry, India beBeeMachineLearning Full time

    Artificial Intelligence EngineerImprove your career through cutting-edge technology and innovative ideas. Our organization is committed to creating a work environment that values diversity, equity, and inclusion.Develop and implement end-to-end GenAI powered RAG & multi-agent systems:Provide guidance on system architecture & components:Build TTD system...


  • rajahmundry, India beBeeDataEngineer Full time

    Senior Data Engineering PositionWe are seeking an experienced Senior Data Engineer to design, develop and maintain large-scale data pipelines using Google Cloud Platform (GCP) services.Responsibilities:Design, develop, test and deploy scalable data processing solutionsCollaborate with cross-functional teams to meet business objectivesRequirements:5+ years of...


  • rajahmundry, India beBeeDataEngineer Full time

    Job TitleWe are seeking a highly skilled Data Engineer with strong experience in Python, PySpark, Snowflake, and AWS Glue to join our data team.Job Description:The ideal candidate will be responsible for designing, developing, and maintaining scalable and reliable data pipelines that drive business insights and operational efficiency.Design, develop, and...


  • rajahmundry, India beBeeDataScientist Full time

    Senior Data ScientistThis role involves developing and deploying AI/ML models to solve complex business problems. The ideal candidate should have hands-on expertise in Generative AI, Retrieval-Augmented Generation (RAG), and Deep Learning.Key Responsibilities:Develop and deploy Generative AI models including LLMs, diffusion models, and transformers.Design...

  • AI Platform Architect

    2 weeks ago


    rajahmundry, India beBeeMachineLearning Full time

    Job OverviewWe are seeking a highly skilled AI Platform Engineer to design, build, and operate our next-generation AI application platform. In this role, you will work on advanced AI systems including Retrieval-Augmented Generation pipelines, multi-model gateways, Model Context Protocol tools, agentic workflow automations, and secure chat interfaces.You will...


  • rajahmundry, India beBeeMachine Full time

    AI Innovator RoleAs an AI Innovator, you will drive the development of cutting-edge AI solutions to tackle real-world challenges. Leveraging state-of-the-art technologies, including NLP and Generative AI, you will create pioneering Retrieval-Augmented Generation (RAG) pipelines and agentic workflows.Key ResponsibilitiesDesign and implement intelligent AI...


  • rajahmundry, India beBeeDataEngineer Full time

    We're looking for a talented Data Engineer to join our team.Job DescriptionAs a Data Engineer, you'll play a key role in designing and developing ETL processes that drive business value.Create efficient data pipelines using advanced technologies like ETL toolsValidate data extraction, transformation, and loading processes for accuracy and completenessAnalyze...

  • AI Systems Architect

    2 weeks ago


    rajahmundry, India beBeeArtificial Full time

    Job Title: AI Systems ArchitectWe are seeking a skilled Ai Systems Architect to design, develop and implement cutting-edge artificial intelligence systems.The ideal candidate will have strong expertise in both backend engineering and machine learning. They will work closely with our team to create intelligent solutions that drive business growth.Key...