[21/09/2025] Senior Site Reliability Developer

16 hours ago


India Oracle Full time

Job Description

OCI is Oracle's next-generation cloud platform, built for the most demanding enterprise workloads. We deliver high-performance computing, storage, networking, and platform services at global scale.

The AI Platform, Services & Solutions organization within OCI is building the foundation for enterprise AI-spanning GPU infrastructure, training pipelines, orchestration systems, and model deployment services. As part of this mission, we are looking for a Senior Site Reliability Engineer (SRE) to join our team and take ownership of managing and evolving our OKE (Oracle Kubernetes Engine) infrastructure.

This is a hands-on, high-impact role where you will be responsible for ensuring the reliability, scalability, and security of cloud-scale services that power AI workloads across Oracle Cloud.

Qualifications

- 4-10 years of experience in site reliability, DevOps, or systems engineering.
- Strong background in operating large-scale, distributed, and highly available systems.
- Proficient with Linux, Python, and shell scripting.
- Hands-on experience with Kubernetes (OKE, EKS, GKE, or similar) and Docker.
- Experience with Infrastructure as Code (Terraform, Ansible, etc.) on a major cloud provider.
- Knowledge of cloud networking, security, and routing (VPC, CIDR, security groups).
- Familiarity with observability tools (Prometheus, Elasticsearch, Fluentd, Grafana).
- Experience with CI/CD pipelines, git workflows, and agile development.
- Understanding of disaster recovery, redundancy, and operational uptime planning.
- Strong troubleshooting, problem-solving, and communication skills.
- BS/MS in Computer Science or equivalent experience.

Desired Attributes

- Resourceful and pragmatic in solving operational challenges.
- Strong focus on automating repetitive tasks and reducing toil.
- Committed to shared responsibility and improving the on-call experience.
- Detail-oriented with strong critical-thinking skills.
- Eager to learn and to mentor others in a collaborative environment.

- Design, automate, and operate infrastructure resources in OCI (compute, storage, networking, load balancing).
- Manage large-scale OKE clusters and containerized workloads.
- Build automation for service provisioning, monitoring, and lifecycle management.
- Develop dashboards, alerts, runbooks, and tooling to improve observability and reliability.
- Troubleshoot and resolve complex production issues with a focus on resilience and uptime.
- Contribute to service authentication, authorization, and security best practices.
- Collaborate with software and ML engineers to deliver highly available AI infrastructure.
- Participate in on-call rotations and improve incident response processes.

Career Level - IC3


  • Walk In Interview

    2 days ago


    India ZYDUS LIFESCIENCES LTD Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    DivisionINFOTECHJob posted onSep 17, 2025Employee TypeP-P8-Probationer-HO ExecutiveExperience range (Years)3 years - 6 yearsWalk-In Interview at Zydus Lifesciences Limited – IT CSV RoleDate: Sunday, 21st September 2025Time: 09:00 AM to 01:00 PMLocations: Ankleshwar & AhmedabadAddress: Zydus Lifesciences Limited : Zydus Corporate Park, Scheme No. 63, Survey...

  • Walk In Interview

    13 hours ago


    India ZYDUS LIFESCIENCES LTD Full time

    Division INFOTECH Job posted on Sep 17, 2025 Employee Type P-P8-Probationer-HO Executive Experience range (Years) 3 years - 6 years Walk-In Interview at Zydus Lifesciences Limited – IT CSV Role Date: Sunday, 21st September 2025Time: 09:00 AM to 01:00 PMLocations: Ankleshwar & Ahmedabad Address: Zydus Lifesciences Limited : Zydus Corporate Park, Scheme No....


  • india Synechron Full time

    We have immediate opportunity forSRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron –BangaloreJob Role: -SRE (Senior Site Reliability Engineer) Job Location: -Bangalore Notice Period:Within 30daysAbout Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to 14,500+...


  • India S&P Global Full time

    About the Role Grade Level for internal use 11The Team As a member of the Data Transformation team you will work on building ML powered products and capabilities to power natural language understanding data extraction information retrieval and data sourcing solutions for S P Global Market Intelligence and our clients You will spearhead development of...


  • Bengaluru, India Commonwealth Bank Full time

    Job Description Organization: At CommBank, we never lose sight of the role we play in other people's financial wellbeing. Our focus is to help people and businesses move forward to progress. To make the right financial decisions and achieve their dreams, targets, and aspirations. Regardless of where you work within our organisation, your initiative, talent,...

  • Data Engineer

    16 hours ago


    India Innodata Inc. Full time

    We are looking for a Data Engineer with strong experience in CI/CD practices, Databricks (Spark), Python, Github and SQL. The ideal candidate should have hands-on expertise in building and automating data pipelines, managing multi-environment deployments, and working with modern DevOps/configuration management tools. Key Responsibilities - Design,...

  • YouTube Manager

    16 hours ago


    Kota, India Motion Education Pvt Ltd Full time

    Job Description Company Description Motion Education Pvt Ltd, located in Kota, is a pioneering coaching institute dedicated to helping students achieve success in various competitive exams like IIT, AIIMS, NIT, NEET, JEE, and Boards. Renowned for its consistent results, the institute combines top-tier faculty, advanced technology, and comprehensive...


  • Hyderabad, India Intrainz Full time

    Job Description Company Description At Intrainz, we aim to prep university students to become industry-ready by providing them with opportunities to develop specialized skill sets in high-demand domains. Our comprehensive industrial training program, combined with certified live internship projects, equips students with the necessary industrial project...


  • India Akamai Full time US$ 1,50,000 - US$ 2,00,000 per year

    Would you enjoy improving stability and safety of one of the largest global networks?Would you enjoy hands-on network operations work on a global scale to improve our operational efficiency?Join the Platform Cloud Services Engineering TeamThe Platform Cloud Services SRE team supports globally distributed hosting and database systems for Akamai. These systems...


  • Lucknow, India HCL Technologies Limited Full time

    Job Description Job Description (Posting). About HCLTech HCLTech is a global technology company, spread across 60 countries, delivering industry-leading capabilities centered around digital, engineering, cloud and AI, powered by a broad portfolio of technology services and products. We work with clients across all major verticals, providing industry...