Oracle | Senior Site Reliability Engineer
1 week ago
Role : Senior Site Reliability Engineer
Team: OCI Reliability
Shift : 6am - 2pm
Skills required : Production Incidence, Automation, Python.
Location : Remote
Job description
As a Senior Site Reliability Engineer, you will focus on detecting, triaging, and mitigating OCI service-impacting events quickly and efficiently. You will be responsible for minimising downtime by delivering exceptional major incident management and ensuring the reliability, scalability, performance, and security of the systems that prevent incidents from occurring. Your work will directly contribute to reducing event duration by leveraging your operational expertise, best practices, and the ability to develop tools that automate and improve incident management processes.
Oracle Cloud is cutting-edge and continuously evolving. When issues arise, your team will respond within minutes to mitigate customer impact and ensure service continuity. This role will give you deep insight into the inner workings of OCI’s systems and operations. You’ll collaborate with and influence leaders across Oracle, driving organisational initiatives aimed at continually improving OCI-wide service availability. As part of an agile, high-impact team, you will play a crucial role in shaping the future of Oracle Cloud. If you're excited to be part of a fast-moving team that’s pushing the boundaries of innovation, we’d love to connect with you
We are looking for candidates who are flexible to work APAC shift hours (6 AM to 2 PM IST).
Career Level - IC3
Responsibilities :
- Lead major incident recovery by orchestrating cross-functional collaboration, driving rapid escalation, clear communication, and seamless stakeholder alignment to ensure swift and effective resolution.
- Identify opportunities to automate and streamline critical incident workflows, taking full ownership of developing and implementing innovative solutions to enhance efficiency and drive faster resolutions.
- Leverage deep expertise in cloud computing design patterns and dependencies to proactively mitigate complex major incidents and optimize cloud-based solutions and Leverage your expertise to quickly diagnose root causes, mitigate impact, and implement long-term fixes.
- Proficient in troubleshooting cloud infrastructure issues using observability platforms to monitor, analyse, and resolve performance and reliability challenges.
- Continuously improve operational processes, tools, and workflows to enhance the reliability and efficiency of the cloud infrastructure.
Minimum Qualifications
- Bachelor's degree or higher in Computer Science or a related field, or equivalent work experience.
- 4+ years of experience in Site Reliability Engineering (SRE), DevOps, or Systems Engineering.
- Extensive hands-on experience with public cloud operations (e.g., AWS, Azure, GCP, OCI).
- Proven track record in Major Incident Management within cloud-based environments, with the ability to drive effective incident resolution.
- Strong understanding of automation and orchestration principles, with a focus on improving system reliability and efficiency.
- Proficiency in at least one modern object-oriented programming language (e.g., Python, Java, Go, etc.).
- Solid experience in software engineering best practices, including Agile methodologies, coding standards, code reviews, version control, build processes, testing, and operations.
- Familiarity with infrastructure automation tools such as Chef, Ansible, Jenkins, and Terraform.
- Expertise in several key technologies, including Infrastructure-as-a-Service (IaaS), CI/CD systems, Docker, RESTful APIs, log analysis, and debugging tools.
- Experience with observability platforms such as Grafana, Prometheus, and other monitoring, logging, and tracing tools to optimize system visibility, performance, and issue resolution.
-
Oracle | Senior Site Reliability Engineer
2 weeks ago
india Oracle Full timeRole : Senior Site Reliability Engineer Team: OCI Reliability Shift : 6am - 2pmSkills required : Production Incidence, Automation, Python.Location : RemoteJob descriptionAs a Senior Site Reliability Engineer, you will focus on detecting, triaging, and mitigating OCI service-impacting events quickly and efficiently. You will be responsible for minimising...
-
Senior Site Reliability Engineer
1 week ago
India Oracle Full timeRole : Senior Site Reliability Engineer Team: OCI Reliability Shift : 6am - 2pm Skills required : Production Incidence, Automation, Python. Location : Remote Job description As a Senior Site Reliability Engineer, you will focus on detecting, triaging, and mitigating OCI service-impacting events quickly and efficiently. You will...
-
Senior Site Reliability Engineer
2 weeks ago
India Oracle Full timeRole : Senior Site Reliability Engineer Team: OCI Reliability Shift : 6am - 2pmSkills required : Production Incidence, Automation, Python.Location : RemoteJob descriptionAs a Senior Site Reliability Engineer, you will focus on detecting, triaging, and mitigating OCI service-impacting events quickly and efficiently. You will be responsible for minimising...
-
Senior Site Reliability Engineer
2 weeks ago
India Oracle Full timeRole : Senior Site Reliability Engineer Team: OCI Reliability Shift : 6am - 2pm Skills required : Production Incidence, Automation, Python. Location : Remote Job description As a Senior Site Reliability Engineer, you will focus on detecting, triaging, and mitigating OCI service-impacting events quickly and efficiently. You will be...
-
Senior Site Reliability Engineer
2 weeks ago
India Oracle Full timeRole : Senior Site Reliability Engineer Team: OCI Reliability Shift : 6am - 2pm Skills required : Production Incidence, Automation, Python. Location : Remote Job description As a Senior Site Reliability Engineer, you will focus on detecting, triaging, and mitigating OCI service-impacting events quickly and efficiently. You will...
-
Cloud Reliability Engineering Lead
10 hours ago
India Oracle Full timeJob DescriptionWe are seeking a seasoned Senior Member of Technical Staff to join our Oracle Cloud Infrastructure (OCI) Problem Engineering team as a Cloud Reliability Engineering Lead.
-
Site Reliability Developer
2 months ago
india Oracle Full timeAs a member of the SRE division, you will take an active role in the definition and evolution of standard practices and procedures. You will be responsible for defining and developing software for tasks associated with the developing, designing and debugging of software applications or operating systems. Manage reliability and performance aspects for hotel...
-
Site Reliability Developer
2 months ago
india Oracle Full timeAs a member of the SRE division, you will take an active role in the definition and evolution of standard practices and procedures. You will be responsible for defining and developing software for tasks associated with the developing, designing and debugging of software applications or operating systems. Manage reliability and performance aspects for hotel...
-
Site Reliability Developer
2 months ago
india Oracle Full timeAs a member of the SRE division, you will take an active role in the definition and evolution of standard practices and procedures. You will be responsible for defining and developing software for tasks associated with the developing, designing and debugging of software applications or operating systems.Manage reliability and performance aspects for hotel...
-
india Oracle Full timeWHO ARE WE? We're the Technical Architecture group, and we're defining Oracle's next generation application architecture based on cloud-native principles. We're building new shared microservices for data access, messaging, security, job scheduling, and more. Our shared services, built on Oracle Cloud Infrastructure (OCI), provide the platform on which Fusion...
-
Senior Site Reliability Engineer
4 weeks ago
india HCLTech Full timeUrgent Opening for Cloud Senior Site Reliability Engineer role for Pan India location with HCL TechInterested candidates kindly share your updated resume to sagardo@hcltech.com with the subject line "Cloud Senior Site Reliability Engineer Role_ your name & preferred location"Job Description: Ability to learn SRE practices across Red Hat Open Shift, Google...
-
Site Reliability Engineering Lead
1 week ago
India Tata Consultancy Services Full timeTCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together. What we are looking for Role: Site Reliability Engineering Lead Experience Range: 8 – 12 Years Location: Pune & Chennai, Bangalore , Delhi Must-Have: ...
-
Oracle Exadata Dbma
6 months ago
India Oracle Full timeAs a member of the Support organization, your focus is to deliver post-sales support and solutions to the Oracle customer base while serving as an advocate for customer needs. This involves resolving post-sales non-technical customer inquiries via phone and electronic means, as well as, technical questions regarding the use of and troubleshooting for our...
-
Senior site reliability engineer
4 weeks ago
India Vertex Agility Full timeSenior Site Reliability Engineer - Remote Vertex Agility is a dynamic, cross-geographic remote consultancy specializing in software engineering, Dev Ops, and cloud, partnered with some of the most well-known brands globally. We specialise in transforming businesses by providing tailored cloud solutions to our client's needs. Operating from 15+ countries...
-
india HCLTech Full timeUrgent Opening for Cloud Senior Site Reliability Engineer role for Pan India location with HCL Tech Interested candidates kindly share your updated resume to with the subject line "Cloud Senior Site Reliability Engineer Role_ your name & preferred location" Job Description: Ability to learn SRE practices across Red Hat Open Shift, Google Cloud or...
-
Senior Software Engineering Manager
1 month ago
india Oracle Full timeWe’re looking for a Senior Software Development Manager with expertise and passion in building teams, coaching individuals, and solving difficult problems in distributed systems, and highly available services.As a Senior Software Development Manager, you and your team will solve exciting technical challenges by analyzing, troubleshooting, and designing...
-
Senior Software Engineering Manager
1 month ago
india Oracle Full timeWe’re looking for a Senior Software Development Manager with expertise and passion in building teams, coaching individuals, and solving difficult problems in distributed systems, and highly available services. As a Senior Software Development Manager, you and your team will solve exciting technical challenges by analyzing, troubleshooting, and designing...
-
india Oracle Full timeOracle Cloud Infrastructure (OCI) is a strategic focal point for Oracle, offering comprehensive cloud services across IaaS, PaaS, and SaaS. OCI is built from the ground up to meet the needs of mission-critical applications, enterprise-grade security, and unparalleled availability and scalability. Thousands of customers across industries and regions already...
-
india Oracle Full timeOracle Cloud Infrastructure (OCI) is a strategic focal point for Oracle, offering comprehensive cloud services across IaaS, PaaS, and SaaS. OCI is built from the ground up to meet the needs of mission-critical applications, enterprise-grade security, and unparalleled availability and scalability. Thousands of customers across industries and regions already...
-
Site Reliability Engineer
4 weeks ago
India Tata Consultancy Services Full timeDear Candidate, Greetings from TCS !!! TCS is hiring for SRE, please find the below JD….. Experience range – 5+ years Location- Bangalore, Pune, Hyderabad, Chennai Skills Required - Site Reliability Engineer Role& Responsibilities – Collaborates with cloud platform engineers and teams to design, develop, test, and implement...