Senior AI Data Platform Reliability

4 days ago


Bangalore Division, India Oracle Full time

Responsibilities Key Responsibilities: Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies. Develop and maintain automated test frameworks supporting E2E, integration, performance, and regression testing for distributed data/AI services Monitor system health across the stack (infrastructure, data pipelines, AI/ML workloads), proactively detect failures or SLA breaches. Champion SRE best practices including observability, incident management, blameless postmortems, and runbook automation. Analyze logs, traces, and metrics to identify reliability, latency, and scalability issues; drive root cause analysis and corrective actions. Partner with engineering to drive high-availability, fault tolerance, and continuous delivery (CI/CD) improvements. Participate in on-call rotation to support critical services, ensuring rapid resolution and minimizing customer impact. Desired Qualifications: Bachelor’s or master’s degree in computer science, Engineering, or related field (or demonstrated equivalent experience)5+ years’ experience in software QA/validation, SRE, or DevOps roles, ideally in data platforms, cloud, or AI/ML environments. Proficient with DevOps automation and tools for continuous integration, deployment, and monitoring (e.g., Terraform, Jenkins, GitLab CI/CD, Prometheus). Working knowledge of distributed systems, data engineering pipelines, and cloud-native architectures (OCI, AWS, Azure, GCP, etc.). Strong proficiency in Java, Python and related technologies Hands-on experience with test automation frameworks (e.g., Selenium, pytest, JUnit) and scripting (Python, Bash, etc.). Familiarity with SRE practices: service-level objectives (SLO/SLA), incident response, observability (Prometheus, Grafana, ELK, etc.). Strong troubleshooting and analytical skills with a passion for reliability engineering and process automation. Excellent communication and cross-team collaboration abilities.oling / infrastructure



  • Bangalore Division, India Oracle Full time

    Responsibilities Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies. Develop...


  • bangalore, India Oracle Full time

    ResponsibilitiesDesign, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.).Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies.Develop and...


  • bangalore, India Oracle Full time

    Responsibilities Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies. Develop...


  • bangalore, India Oracle Full time

    ResponsibilitiesKey Responsibilities: Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.).Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation...


  • bangalore, India Oracle Full time

    Responsibilities Key Responsibilities: Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test...


  • bangalore district, India Oracle Full time

    Responsibilities Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies. Develop...


  • Bangalore, India Oracle Full time

    Design, develop, and execute end-to-end (E2E) scenario validations that simulate real-world usage of complex AI data platform workflows (data ingestion, transformation, ML pipeline orchestration, etc.). Collaborate closely with product, engineering, and field teams to identify gaps in coverage and propose test automation strategies. Develop and maintain...

  • Senior AI Engineer

    2 days ago


    bangalore, India NextDimension AI Full time

    Compensation: INR 12-30 LPA Base + Bonus + EquityLocation: GurgaonNextDimension is a US-based technology startup building AI Agents in Healthcare, established by a team of distinguished AI/ML Scientists and Engineers from Google, Amazon, and Snowflake. We're empowering Enterprises by building sophisticated, high-impact AI agents that automate sales,...

  • Staff Engineer

    2 hours ago


    Bangalore Division, India Wayfair Full time

    Staff Engineer - Data Platforms- Software Engineering Who we are Wayfair is seeking a passionate and driven Staff Engineer to join our “Data Platforms” team. In this pivotal role, you will be instrumental in shaping and executing Wayfair’s New-Age Data Platform strategy across large-scale Data Ingestion, Transformation, Real-time Messaging Platforms...

  • Platform Engineer- AI

    2 hours ago


    Bangalore Division, India Smarsh Full time

    Who are we? Smarsh empowers its customers to manage risk and unleash intelligence in their digital communications. Our growing community of over 6500 organizations in regulated industries counts on Smarsh every day to help them spot compliance, legal or reputational risks in 80+ communication channels before those risks become regulatory fines or headlines....