Senior AI Data Platform Reliability

2 days ago


india Oracle Full time

DescriptionOracle's AI Data Platform is accelerating enterprise AI and redefining how AI applications are built. The AI Data Platform team is seeking an experience engineer to help drive AI platform reliability. This role is vital to ensuring our enterprise-scale, AI-powered data platform is robust, performant, and reliable. You will develop and execute end-to-end scenario tests across distributed systems, You will design and execute end-to-end scenario tests across distributed systems, and partner with engineering and architecture teams to develop tooling that improves and maintains the platform. You will also embed operational excellence by applying modern SRE practices.Responsibilities Key Responsibilities:Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.).Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies.Develop and maintain automated test frameworks supporting E2E, integration, performance, and regression testing for distributed data/AI servicesMonitor system health across the stack (infrastructure, data pipelines, AI/ML workloads), proactively detect failures or SLA breaches.Champion SRE best practices including observability, incident management, blameless postmortems, and runbook automation.Analyze logs, traces, and metrics to identify reliability, latency, and scalability issues; drive root cause analysis and corrective actions.Partner with engineering to drive high-availability, fault tolerance, and continuous delivery (CI/CD) improvements.Participate in on-call rotation to support critical services, ensuring rapid resolution and minimizing customer impact. Desired Qualifications:Bachelor's or master's degree in computer science, Engineering, or related field (or demonstrated equivalent experience)3+ years' experience in software QA/validation, SRE, or DevOps roles, ideally in data platforms, cloud, or AI/ML environments.Proficient with DevOps automation and tools for continuous integration, deployment, and monitoring (e.g., Terraform, Jenkins, GitLab CI/CD, Prometheus). Working knowledge of distributed systems, data engineering pipelines, and cloud-native architectures (OCI, AWS, Azure, GCP, etc.).Strong proficiency in Java, Python and related technologiesHands-on experience with test automation frameworks (e.g., Selenium, pytest, JUnit) and scripting (Python, Bash, etc.).Familiarity with SRE practices: service-level objectives (SLO/SLA), incident response, observability (Prometheus, Grafana, ELK, etc.).Strong troubleshooting and analytical skills with a passion for reliability engineering and process automation.Excellent communication and cross-team collaboration / infrastructure QualificationsCareer Level - IC3



  • India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    DescriptionOracle's AI Data Platform is accelerating enterprise AI and redefining how AI applications are built. The AI Data Platform team is seeking an experience engineer to help drive AI platform reliability. This role is vital to ensuring our enterprise-scale, AI-powered data platform is robust, performant, and reliable. You will develop and execute...


  • India Oracle Full time

    Job Description Job Summary: Oracle's AI Data Platform is accelerating enterprise AI and redefining how AI applications are built. The AI Data Platform team is seeking an experience engineer to help drive AI platform reliability. This role is vital to ensuring our enterprise-scale, AI-powered data platform is robust, performant, and reliable. You will...


  • India Oracle Full time

    Job Description Oracle's AI Data Platform is accelerating enterprise AI and redefining how AI applications are built. The AI Data Platform team is seeking an experience engineer to help drive AI platform reliability. This role is vital to ensuring our enterprise-scale, AI-powered data platform is robust, performant, and reliable. You will develop and execute...


  • India Weekday AI Full time

    This role is for one of the Weekday s clients Salary range Rs 6000000 - Rs 9000000 ie INR 60-90 LPA Min Experience 7 years Location Remote India JobType full-time As a Senior Applied AI Engineer you will be responsible for designing building and productionizing advanced AI systems powered by Large Language Models LLMs and intelligent agents You ll work on...


  • India DataOrbit AI Full time

    Company Description DataOrbit AI is a leading AI consulting firm dedicated to empowering small and medium investment managers to achieve AI readiness. By uniting strategy, data preparation, technology, and execution, we enable organizations to adopt AI solutions that unlock growth opportunities and operational efficiencies. With expertise in AI strategy,...


  • India DataOrbit AI Full time

    Company Description DataOrbit AI is a leading AI consulting firm dedicated to empowering small and medium investment managers to achieve AI readiness. By uniting strategy, data preparation, technology, and execution, we enable organizations to adopt AI solutions that unlock growth opportunities and operational efficiencies. With expertise in AI strategy,...


  • India DataOrbit AI Full time

    Company Description DataOrbit AI is a leading AI consulting firm dedicated to empowering small and medium investment managers to achieve AI readiness. By uniting strategy, data preparation, technology, and execution, we enable organizations to adopt AI solutions that unlock growth opportunities and operational efficiencies. With expertise in AI strategy,...


  • Chennai, India Platform Science Full time

    Job Description Who We Are At Platform Science, we're working to connect everything that moves. Founded in 2015, we are an open IoT platform that partners with innovative fleets, application developers, vehicle manufacturers, and equipment providers in the transportation industry to deliver revolutionary solutions to supply chain professionals across the...


  • India DataOrbit AI Full time

    DataOrbit AI is a leading AI consulting firm dedicated to empowering small and medium investment managers to achieve AI readiness. By uniting strategy, data preparation, technology, and execution, we enable organizations to adopt AI solutions that unlock growth opportunities and operational efficiencies. With expertise in AI strategy, data readiness, and...


  • Bengaluru, India TymeX Full time

    Job Description About Us At TymeX, we're empowering Tyme Group's digital banking ecosystem across Asia and Africa. Our mission: build a secure, scalable, intelligent data & AI platform that delivers real-time, personalized experiences to millions of users. We are seeking a Senior Solutions Architect - Data & AI Platforms someone who blends deep technical...