Site Reliability Developer 2

2 weeks ago


India Oracle Full time

Job Description

Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.

This role will be responsible for deploying, administering, securing, and the reliability of production systems in OCI and traditional data centers.

- Design, implement, and maintain Infrastructure as Code (Terraform) to provision and modify production environments via Git-based change control (PRs, reviews, CI/CD) aligned with change management policies.
- Administer and optimize Oracle Cloud Infrastructure (OCI): compute, networking, storage, IAM/policies, compartments, tagging, observability, and cost controls.
- Install, upgrade, configure, and patch enterprise database platforms across dev/test/prod validate backup/restore and maintain configuration baselines.
- Implement and maintain advanced database security: least-privilege IAM, encryption in transit/at rest, auditing, key/secret management, data masking, and compliance controls.
- Build and enhance automation with Python, Bash, and Terraform to reduce toil, standardize workflows, and create reusable modules and pipelines.
- Establish and improve observability: SLIs/SLOs, actionable alerts, dashboards, logging/metrics/tracing, and runbooks to reduce noise and MTTR.
- Conduct proactive and reactive database monitoring and maintenance: capacity and health checks, statistics management, patching, space/index management.
- Design, configure, monitor, and maintain database replication and HA/DR solutions regularly test failover and validate RTO/RPO objectives.
- Troubleshoot complex infrastructure and database alerts/incidents perform root cause analysis implement corrective and preventive actions automate remediation where feasible.
- Optimize availability, capacity, and performance through query tuning, execution plan management, resource governance, and system-level tuning.
- Uphold security, privacy, and compliance standards enforce least-privilege access, vulnerability remediation, patch governance, and backup/DR readiness.
- Document standards, runbooks, and architectural diagrams contribute to postmortems drive continuous improvement across reliability, performance, and cost.
- Participate in on-call rotations and support incident response, problem management, and change reviews.

Career Level - IC2



  • India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or Dev Ops Engineering roles Location – Fully Remote Type - 6 months Contract Work Ex - 5+ Yrs We’re working with a AI product company that’s building the next generation of Gen AI powered developer platforms . We’re looking for an experienced Site Reliability Engineer to join...


  • India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation – Bangalore/ RemoteType - ContractWork Ex - 4-6 yrsWe're working with a AI product company that's building the next generation of GenAI powered developer platforms.We're looking for an experienced Site Reliability Engineer to join their Platform Engineering...


  • India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering roles Location – Bangalore/ Remote Type - Contract Work Ex - 4-6 yrs We're working with a AI product company that's building the next generation of GenAI powered developer platforms . We're looking for an experienced Site Reliability Engineer to join their Platform...


  • India Oracle Full time

    Job Description You will be responsible to work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the...


  • India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering roles Location – Fully Remote Type - 6 months Contract Work Ex - 5+ Yrs We’re working with a AI product company that’s building the next generation of GenAI powered developer platforms . We’re looking for an experienced Site Reliability...


  • India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering roles Location – Fully Remote Type - 6 months Contract Work Ex - 5+ Yrs We’re working with a AI product company that’s building the next generation of GenAI powered developer platforms . We’re looking for an experienced Site Reliability Engineer to join their...


  • India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation – Fully RemoteType - 6 months ContractWork Ex - 5+ YrsWe’re working with a AI product company that’s building the next generation of GenAI powered developer platforms.We’re looking for an experienced Site Reliability Engineer to join their Platform...


  • Bengaluru, India VidPro Consultancy Services Full time

    Job Description Experience: 2.55 Years Location: Bangalore (On-site) Work Mode: 5 Days WFO Mandatory Skills: Site Reliability engineer or SRE ,Linux, System architecture, TCP/IP. HTTP,DNS ,Grafana, Prometheus and Loki Troubleshooting ,Root cause, complex systems ,Ci/CD, Docker, Kubernetes Experience : 2-4 years of relevant experience Key Skills...


  • India Elgebra Full time

    Hiring: Site Reliability Engineer – 7+ Years Location: Bangalore / Chennai Payroll: Elgebra


  • India Akamai Full time

    Do you want to grow your career in Linux and Site Reliability Engineering? Would you like to contribute to the foundation of a new public cloud platform? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure and services that power the backbone of our cloud platform. This is a rare opportunity to help build a...