
Senior Site Reliability Engineer
3 weeks ago
Objectives of this role:
• Build software and systems to manage platform infrastructure and applications.
• Run the production environment by monitoring availability and taking a holistic view of system health.
• Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement.
• Improve reliability, quality, and time-to-market of our suite of software solutions.
• Provide primary operational support and engineering for multiple large-scale distributed software applications.
• This role requires participation in an on-call schedule, including availability during nights and weekends as needed, to ensure timely incident response and resolution.
Job Requirements
Bachelor's degree (or equivalent) in computer science or related discipline.
8-12 years of software industry experience with at least 4 years of experience in SRE role.
Ability to design cloud infrastructure on AWS cloud for software products for high availability, scalability, resilience, reliability, and performance.
Ability to automate infrastructure deployment via Infrastructure as Code using Terraform.
Ability to program (structured and OOP) using one or more high-level languages such as Python, Java, Ruby, and JavaScript.
Experience with container management technology (Docker, Kubernetes, Yarn, ECS, EKS).
Experience monitoring infrastructure and visualization software like Prometheus, Grafana etc.
Participate in a shared on-call rotation to respond to production incidents and resolve them with minimal downtime.
This position requires participation in an on-call schedule, including nights, weekends, and holidays, on a rotating basis.
We strive to foster a supportive on-call culture, focused on automation, documentation, and continuous improvement to minimize disruptions.
AWS certification on 1 or more of the following:
o SysOps Administrator
o DevOps Engineer
o Solutions Architect
Proactive approach to identifying problems, performance bottlenecks.
Excellent communication skills and leadership skills with pro-active mindset.
Coaching and Mentoring skills are necessary to connect with the product teams and influence them to adhere to Cloud best practices
-
Site Reliability Engineer
1 week ago
india Synechron Full timeWe have immediate opportunity forSRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron –BangaloreJob Role: -SRE (Senior Site Reliability Engineer) Job Location: -Bangalore Notice Period:Within 30daysAbout Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to 14,500+...
-
Senior II Site Reliability Engineer
1 day ago
India Akamai Technologies Full timeJob Description Job Description Do you have the passion to architect and lead the next generation of public cloud infrastructure Would you like to lead modernization initiatives while building a public cloud platform from scratch Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure and services that power...
-
Senior II Site Reliability Engineer
5 days ago
India Akamai Full timeDo you have the passion to architect and lead the next generation of public cloud infrastructure? Would you like to lead modernization initiatives while building a public cloud platform from scratch? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure and services that power the backbone of our cloud...
-
Site Reliability Engineer
5 days ago
India Akamai Full timeDo you want to grow your career in Linux and Site Reliability Engineering? Would you like to contribute to the foundation of a new public cloud platform? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure and services that power the backbone of our cloud platform. This is a rare opportunity to help build a...
-
Site Reliability Engineer
1 day ago
India Elgebra Full timeHiring: Site Reliability Engineer – 7+ Years Location: Bangalore / Chennai Payroll: Elgebra
-
Site reliability engineer
3 days ago
India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or Dev Ops Engineering roles Location – Fully Remote Type - 6 months Contract Work Ex - 5+ Yrs We’re working with a AI product company that’s building the next generation of Gen AI powered developer platforms . We’re looking for an experienced Site Reliability Engineer to join...
-
Senior site reliability engineer- elk expert
3 days ago
India IVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice Location: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone. Role Summary: Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure? We're looking for an...
-
Senior Site Reliability Engineer- ELK Expert
1 week ago
India iVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice Location: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone. Role Summary: Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure? We're looking for an SRE...
-
Senior Site Reliability Engineer
2 weeks ago
India beBeeReliability Full time ₹ 20,00,000 - ₹ 25,00,000We are seeking a seasoned Site Reliability Engineer to join our team. This role is focused on leading the operational health of our platforms, ensuring they deliver highly reliable financial applications and data services. This critical position will play a pivotal role in ensuring the stability, scalability, and operational excellence of Accounting and...
-
Site Reliability Engineer
1 week ago
India Concord Full timeSRE Sr. Engineers (Individual Contributors) Key Attributes: - Strong SRE (Site Reliability Engineering) experience - DevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc. - Excellent troubleshooting and debugging skills (infrastructure + application level) - Perseverance – must push through complex/challenging issues without...