AI Data Platform Reliability

2 weeks ago


New Delhi, India Oracle Full time

Responsibilities- Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). - Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies. - Develop and maintain automated test frameworks supporting E2E, integration, performance, and regression testing for distributed data/AI services - Monitor system health across the stack (infrastructure, data pipelines, AI/ML workloads), proactively detect failures or SLA breaches. - Champion SRE best practices including observability, incident management, blameless postmortems, and runbook automation. - Analyze logs, traces, and metrics to identify reliability, latency, and scalability issues; drive root cause analysis and corrective actions. - Partner with engineering to drive high-availability, fault tolerance, and continuous delivery (CI/CD) improvements. - Participate in on-call rotation to support critical services, ensuring rapid resolution and minimizing customer impact.Desired Qualifications:- Bachelor’s or master’s degree in computer science, Engineering, or related field (or demonstrated equivalent experience) - 3+ years’ experience in software QA/validation, SRE, or DevOps roles, ideally in data platforms, cloud, or AI/ML environments. - Proficient with DevOps automation and tools for continuous integration, deployment, and monitoring (e.g., Terraform, Jenkins, GitLab CI/CD, Prometheus). - Working knowledge of distributed systems, data engineering pipelines, and cloud-native architectures (OCI, AWS, Azure, GCP, etc.). - Strong proficiency in Java, Python and related technologies - Hands-on experience with test automation frameworks (e.g., Selenium, pytest, JUnit) and scripting (Python, Bash, etc.). - Familiarity with SRE practices: service-level objectives (SLO/SLA), incident response, observability (Prometheus, Grafana, ELK, etc.). - Strong troubleshooting and analytical skills with a passion for reliability engineering and process automation. - Excellent communication and cross-team collaboration abilities.



  • New Delhi, India Oracle Full time

    ResponsibilitiesKey Responsibilities:- Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). - Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test...


  • New Delhi, India Oracle Full time

    ResponsibilitiesKey Responsibilities:- Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). - Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test...


  • New Delhi, India Oracle Full time

    ResponsibilitiesKey Responsibilities:Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation...


  • New Delhi, India Saaki Argus and Averil Consulting Full time

    Role Summary: ThePrincipal Architect, Generative AI & Data Platformsis a strategic role responsible for architecting and operationalizing the end-to-end data ecosystem that powersGenerative AI, Machine Learning (ML), and advanced analytics . This role focuses on building a robust, scalable, and compliant platform—leveraging modern cloud and Gen AI...


  • New Delhi, India BharatGen Full time

    Job Summary:BharatGen is on a mission to create AI that truly represents the diversity, culture, and unique context of India. At the heart of this mission lies the need for robust, scalable infrastructure to build multilingual and multimodal datasets that power foundational AI models. We're seeking a skilled Data Platform Engineer to build scalable tools,...


  • New Delhi, India Wayfair Full time

    We are looking for an experienced architect to join our AI Platforms team that is building Wayfair's cutting edge solutions to the complex challenges teams encounter when developing AI applications at scale. The work includes but is not limited to building platforms, frameworks, and services that solve the difficult problems of observability, tool...

  • AI Platform Engineer

    2 weeks ago


    New Delhi, India BayOne Solutions Full time

    Job DescriptionWe are seeking a highly skilled AI Platform Engineer to design, build, and operate our next-generation AI application platform. In this role, you will work on advanced AI systems including Retrieval-Augmented Generation (RAG) pipelines, multi-model gateways, Model Context Protocol (MCP) tools, agentic workflow automations (e.g., n8n), and...


  • New Delhi, India Antriksh Cloud Private Limited Full time

    Company Description Antriksh Cloud Private Limited is a leader in sustainable AI infrastructure, specializing in eco-efficient GPU data centers powered by hydroelectric energy. We enable global AI innovation with scalable and energy-efficient GPU-as-a-service solutions that are tailored to the diverse needs of our clients. With a focus on environmental...


  • New Delhi, India BayOne Solutions Full time

    Job Description We are seeking a highly skilledAI Platform Engineerto design, build, and operate our next-generationAI application platform . In this role, you will work on advanced AI systems includingRetrieval-Augmented Generation (RAG)pipelines,multi-model gateways ,Model Context Protocol (MCP) tools ,agentic workflow automations(e.g., n8n), and secure...

  • Data engineer

    3 days ago


    New Delhi, India Bloom AI Full time

    Company SummaryBloom AI is a modern intelligence layer that accelerates decision-making through AI-driven synthesized insights. We empower enterprises to unlock the value of data with human-like synthesis and decision intelligence at scale. Our proprietary tools and solutions are trusted by investment managers, insurance, private equity, and Fortune 1000...