(Apply Now) Site Reliability Developer 2

4 weeks ago


Thiruvananthapuram Trivandrum India Oracle Full time

Job Description

Job Description

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.

This role will be responsible for deploying, administering, securing, and the reliability of production systems in OCI and traditional data centers.

Responsibilities

- Design, implement, and maintain Infrastructure as Code (Terraform) to provision and modify production environments via Git-based change control (PRs, reviews, CI/CD) aligned with change management policies.
- Administer and optimize Oracle Cloud Infrastructure (OCI): compute, networking, storage, IAM/policies, compartments, tagging, observability, and cost controls.
- Install, upgrade, configure, and patch enterprise database platforms across dev/test/prod; validate backup/restore and maintain configuration baselines.
- Implement and maintain advanced database security: least-privilege IAM, encryption in transit/at rest, auditing, key/secret management, data masking, and compliance controls.
- Build and enhance automation with Python, Bash, and Terraform to reduce toil, standardize workflows, and create reusable modules and pipelines.
- Establish and improve observability: SLIs/SLOs, actionable alerts, dashboards, logging/metrics/tracing, and runbooks to reduce noise and MTTR.
- Conduct proactive and reactive database monitoring and maintenance: capacity and health checks, statistics management, patching, space/index management.
- Design, configure, monitor, and maintain database replication and HA/DR solutions; regularly test failover and validate RTO/RPO objectives.
- Troubleshoot complex infrastructure and database alerts/incidents; perform root cause analysis; implement corrective and preventive actions; automate remediation where feasible.
- Optimize availability, capacity, and performance through query tuning, execution plan management, resource governance, and system-level tuning.
- Uphold security, privacy, and compliance standards; enforce least-privilege access, vulnerability remediation, patch governance, and backup/DR readiness.
- Document standards, runbooks, and architectural diagrams; contribute to postmortems; drive continuous improvement across reliability, performance, and cost.
- Participate in on-call rotations and support incident response, problem management, and change reviews.

Qualifications

Career Level - IC2

About Us

As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's challenges. We've partnered with industry-leaders in almost every sectorand continue to thrive after 40+ years of change by operating with integrity.

We know that true innovation starts when everyone is empowered to contribute. That's why we're committed to growing an inclusive workforce that promotes opportunities for all.

Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing [Confidential Information] or by calling +1 888 404 2494 in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.



  • Trivandrum, India Zafin Full time

    Senior Site Reliability Engineer (SRE II) Own availability, latency, performance, and efficiency for Zafin’s Saa S on Azure. You’ll define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale. Error budgeting (policy & tooling): ~ Run the error-budget policy with multi-window, multi-burn-rate alerts;...


  • Bengaluru, India Oracle Full time

    Job Description Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems....


  • Thiruvananthapuram / Trivandrum, India Reflections Info Systems Full time

    Job Description As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability. Should be able to gather the technical requirements from the DevOps team and the operational requirements from the Application Support team. With the Site Reliability...


  • Thiruvananthapuram / Trivandrum, India Reflections Info Systems Full time

    Job Description As a Site Reliability Engineer (SRE) you will be responsible for improving the overall reliability of applications by ensuring its availability, performance, and scalability. Should be able to gather the technical requirements from the DevOps team and the operational requirements from the Application Support team. With the Site Reliability...


  • Noida, India Oracle Full time

    Job Description Job Description Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale...


  • Thiruvananthapuram / Trivandrum, India Equifax Full time

    Job Description Site Reliability Engineering (SRE) at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles. SRE is...


  • Trivandrum, India Zafin Full time

    Senior Site Reliability Engineer (SRE II) Own availability, latency, performance, and efficiency for Zafin’s Saa S on Azure. You’ll define and enforce reliability standards, lead high-impact projects, mentor engineers, and eliminate toil at scale. Reports to the Director of SRE. What you’ll do SLIs/SLOs & contracts: Define customer-centric...


  • , India, IN Sonata Software Full time

    We're Hiring: Senior Site Reliability Engineer Location: Onsite (Office: Hyderabad – Mandatory from Day 1) Employment Type: Full-time Notice Period: Immediate to 15 Days Only Experience: 8+ Years About the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact...


  • Bengaluru, India Deutsche Bank Full time

    Job Description Position Overview Job Title: Site Reliability Engineer Location: Bangalore, India Corporate Title: Associate Role Description - You will work closely with application teams to ensure stable, well monitored applications that are resilient to faults. You will agree and review Service Level Objectives (SLOs) to achieve high availability for...


  • Thiruvananthapuram / Trivandrum, India Geosys IT Solutions Full time

    Job Description Key Deliverables: 1. Develop and support Windows-based applications using .NET technologies. 2. Write and debug web services to ensure smooth backend functionality. 3. Perform comprehensive testing, troubleshooting, and bug fixing. 4. Collaborate with cross-functional teams to deliver robust solutions. Role Responsibilities: 1. Analyze,...