
Senior Site Reliability Expert
4 days ago
Job Description:
We are seeking a skilled and experienced Senior Site Reliability Engineer to join our organization. The ideal candidate will have a strong background in engineering and experience working with cross-functional teams.
Key Responsibilities:
- Bridge the gap between development and operations teams by developing scripts, implementing tools, and automation frameworks to reduce manual intervention efforts
- Work closely with business teams to define Service Level Objectives (SLO) and agreements (SLA)
- Deploy and manage monitoring tools to gain insights into system health and performance
- Analyze performance, identify bottlenecks, and implement solutions to improve scalability and latency durations
- Design and execute chaos experiments to test system failure resiliency
- Own, define, and implement Disaster Recovery (DR) processes for systems
- Maintain documentation of processes, playbooks, and systems
Required Skills and Qualifications:
- Bachelor's degree in CS or related field, or equivalent experience
- 12+ years of overall IT experience
- 7+ years of proven work experience as a Senior Site Reliability Engineer or similar position
- AWS Cloud experience with AWS Certified DevOps Engineer or SysOps, etc.
- Experience with CDN and/or Cache systems like Fastly, Akamai, CloudFront, etc.
- Strong understanding of cloud deployments (AWS/Docker/Kubernetes)
- Knowledge on provisioning IAC Tools like Terraform, Chef, Ansible, Shell, Groovy, Python, etc.
- Experience with monitoring systems such as CloudWatch, NewRelic, Datadog/Splunk, and ELK stack
- Platform or Application Engineering and Operational Knowledge in any of the CI/CD tooling, like GitHub Actions, Jenkins, etc.
Good To Have:
- Experience with GitHub Actions
- Experience with CloudFront, Fastly
-
Delhi, Delhi, India beBeeELKexpert Full time US$ 1,50,000 - US$ 2,10,000Senior Site Reliability Engineer ELK ExpertWe are seeking an exceptional Senior Site Reliability Engineer with in-depth expertise in the ELK stack to join our team.This role requires a highly skilled professional who can design, manage, and scale large-scale observability infrastructure, enhancing reliability across distributed systems. The ideal candidate...
-
Site Reliability Engineer
1 week ago
Delhi, Delhi, India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation – Bangalore/ RemoteType - ContractWork Ex yrsWe're working with a AI product company that's building the next generation of GenAI powered developer platforms .We're looking for an experienced Site Reliability Engineer to join their Platform...
-
Senior Site Reliability Engineer
4 days ago
Delhi, Delhi, India MindBrain Full timePosition SITE Reliability Engineer.Budget 1.7 LPM.Exp 10 yrs.Duration 6 months.Technical Skills : - Programming : Proficiency in languages like Python.- Operating Systems : Deep understanding of Linux/Windows operating systems and networking concepts.- Cloud Technologies : Experience with Azure including services, architecture, and best practices.-...
-
Reliable Systems Expert
4 days ago
Delhi, Delhi, India beBeeCloudEngineer Full time ₹ 15,00,000 - ₹ 25,00,000Job OpportunityWe are seeking a skilled Site Reliability Professional to fill a key position in our organization.Primary ResponsibilitiesEnsure the reliability and scalability of distributed systems by implementing best practices for monitoring, logging, and troubleshooting.Collaborate with development and operations teams to identify and resolve system...
-
Senior Cloud Reliability Engineer
17 hours ago
Delhi, Delhi, India beBeeSenior Full time US$ 1,50,000 - US$ 2,10,000We're looking for an experienced Senior Cloud Reliability Engineer to join our team. As a key member of the engineering team, you will be responsible for designing, managing, and scaling large-scale observability infrastructure using the ELK stack (Elasticsearch, Logstash, Kibana).Key Responsibilities:Cloud Infrastructure Design: Architect scalable,...
-
Cloud Reliability Expert
4 days ago
Delhi, Delhi, India beBeeObservability Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Job Title">We are looking for an experienced Observability and Site Reliability Engineer to join our team. This is a key role in improving system reliability, monitoring capabilities, and resilience across our cloud infrastructure.">Key Responsibilities:">">Design, implement, and manage observability solutions using Splunk integrated with Azure Monitor, Log...
-
Senior Cloud Operations Engineer
1 week ago
Delhi, Delhi, India beBeeSiteReliability Full time ₹ 9,00,000 - ₹ 12,00,000Job OverviewWe are seeking a skilled Senior Site Reliability Engineer to join our team. This is an exciting opportunity for someone who thrives in ambiguity and drives results across organizational boundaries.About the RoleThis position is part of our specialized team of Senior Site Reliability Engineers who act as embedded technical experts across our IT...
-
Senior Site Administrator
1 week ago
Delhi, Delhi, India beBeeLeadership Full time ₹ 18,00,000 - ₹ 24,00,000Site Leadership RoleThis is a leadership position that requires an individual with strong organizational and interpersonal skills. The site manager will oversee all operations, including property management, team development, financial oversight, and maintenance.Key Responsibilities:Team ManagementDevelop and implement strategies to enhance the performance...
-
Manager, Site Reliability Engineering
3 weeks ago
Delhi, Delhi, India Palo Alto Networks Full timeOur MissionAt Palo Alto Networks everything starts and ends with our mission:Being the cybersecurity partner of choice, protecting our digital way of life.Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we're looking for...
-
Site Reliability Engineer
1 week ago
Delhi, Delhi, India CES Full timeWe are seeking a hands-on SRE with expertise in infrastructure automation, cloud scalability, and performance optimization. You'll design, manage, and monitor large-scale AWS environments, ensuring high availability, security, and reliability for our SaaS platformsKey ResponsibilitiesDevelop and execute UI automation using Cypress with TypeScript.Conduct...