
Site Reliability Developer 2
2 weeks ago
Job Description
Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale distributed systems. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
This role will be responsible for deploying, administering, securing, and the reliability of production systems in OCI and traditional data centers.
- Design, implement, and maintain Infrastructure as Code (Terraform) to provision and modify production environments via Git-based change control (PRs, reviews, CI/CD) aligned with change management policies.
- Administer and optimize Oracle Cloud Infrastructure (OCI): compute, networking, storage, IAM/policies, compartments, tagging, observability, and cost controls.
- Install, upgrade, configure, and patch enterprise database platforms across dev/test/prod validate backup/restore and maintain configuration baselines.
- Implement and maintain advanced database security: least-privilege IAM, encryption in transit/at rest, auditing, key/secret management, data masking, and compliance controls.
- Build and enhance automation with Python, Bash, and Terraform to reduce toil, standardize workflows, and create reusable modules and pipelines.
- Establish and improve observability: SLIs/SLOs, actionable alerts, dashboards, logging/metrics/tracing, and runbooks to reduce noise and MTTR.
- Conduct proactive and reactive database monitoring and maintenance: capacity and health checks, statistics management, patching, space/index management.
- Design, configure, monitor, and maintain database replication and HA/DR solutions regularly test failover and validate RTO/RPO objectives.
- Troubleshoot complex infrastructure and database alerts/incidents perform root cause analysis implement corrective and preventive actions automate remediation where feasible.
- Optimize availability, capacity, and performance through query tuning, execution plan management, resource governance, and system-level tuning.
- Uphold security, privacy, and compliance standards enforce least-privilege access, vulnerability remediation, patch governance, and backup/DR readiness.
- Document standards, runbooks, and architectural diagrams contribute to postmortems drive continuous improvement across reliability, performance, and cost.
- Participate in on-call rotations and support incident response, problem management, and change reviews.
Career Level - IC2
-
Site reliability engineer
1 week ago
India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or Dev Ops Engineering roles Location – Fully Remote Type - 6 months Contract Work Ex - 5+ Yrs We’re working with a AI product company that’s building the next generation of Gen AI powered developer platforms . We’re looking for an experienced Site Reliability Engineer to join...
-
Site Reliability Engineer
4 weeks ago
India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation – Bangalore/ RemoteType - ContractWork Ex - 4-6 yrsWe're working with a AI product company that's building the next generation of GenAI powered developer platforms.We're looking for an experienced Site Reliability Engineer to join their Platform Engineering...
-
Site Reliability Engineer
4 weeks ago
India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering roles Location – Bangalore/ Remote Type - Contract Work Ex - 4-6 yrs We're working with a AI product company that's building the next generation of GenAI powered developer platforms . We're looking for an experienced Site Reliability Engineer to join their Platform...
-
Site Reliability Developer 4
3 days ago
India Oracle Full timeJob Description You will be responsible to work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the...
-
Site Reliability Engineer
2 weeks ago
India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering roles Location – Fully Remote Type - 6 months Contract Work Ex - 5+ Yrs We’re working with a AI product company that’s building the next generation of GenAI powered developer platforms . We’re looking for an experienced Site Reliability...
-
Site Reliability Engineer
2 weeks ago
India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering roles Location – Fully Remote Type - 6 months Contract Work Ex - 5+ Yrs We’re working with a AI product company that’s building the next generation of GenAI powered developer platforms . We’re looking for an experienced Site Reliability Engineer to join their...
-
Site Reliability Engineer
2 weeks ago
India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation – Fully RemoteType - 6 months ContractWork Ex - 5+ YrsWe’re working with a AI product company that’s building the next generation of GenAI powered developer platforms.We’re looking for an experienced Site Reliability Engineer to join their Platform...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, India VidPro Consultancy Services Full timeJob Description Experience: 2.55 Years Location: Bangalore (On-site) Work Mode: 5 Days WFO Mandatory Skills: Site Reliability engineer or SRE ,Linux, System architecture, TCP/IP. HTTP,DNS ,Grafana, Prometheus and Loki Troubleshooting ,Root cause, complex systems ,Ci/CD, Docker, Kubernetes Experience : 2-4 years of relevant experience Key Skills...
-
Site Reliability Engineer
7 days ago
India Elgebra Full timeHiring: Site Reliability Engineer – 7+ Years Location: Bangalore / Chennai Payroll: Elgebra
-
Site Reliability Engineer
1 week ago
India Akamai Full timeDo you want to grow your career in Linux and Site Reliability Engineering? Would you like to contribute to the foundation of a new public cloud platform? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure and services that power the backbone of our cloud platform. This is a rare opportunity to help build a...