
System Reliability Specialist
4 days ago
We are seeking a skilled System Reliability Specialist to join our team. As a System Reliability Specialist, you will play a critical role in ensuring the performance and reliability of our systems.
- Design and implement Service Level Agreements (SLAs), Service Level Indicators (SLIs), and error budgets to improve system reliability.
- Monitor and optimize system performance and infrastructure metrics proactively.
- Configure and maintain observability tools to enhance system monitoring, alerting, and logging.
- Analyze system architecture, identify risks, and develop mitigation strategies.
- Collaborate with engineering teams for system design reviews, capacity planning, and performance tuning.
- Conduct blameless postmortems for critical incidents and use learnings to prevent recurrence.
- Provide primary operational support for critical systems and manage incident resolution.
- Develop automated solutions to reduce manual efforts, implement self-healing mechanisms, and enforce resiliency patterns.
- Apply analytics to historic incident and usage data to predict and prevent future failures.
Required Skills & Qualifications :
- 23 years of experience in System Reliability Engineering or related roles.
- Hands-on experience in building dashboards and alerts using Splunk and AppDynamics.
- Solid understanding of microservices architecture and distributed systems.
- Minimum of 2 years of experience developing web-based applications (preferably in Java, Spring Boot).
- Strong understanding of monitoring, observability, and system reliability principles.
- Basic hands-on experience in SQL and database interaction.
- Experience in incident management, root cause analysis, and capacity planning.
Preferred Qualifications :
- Bachelor's or Master's degree in Computer Science, Engineering, or a related field (B.Tech / M.Tech).
- Familiarity with DevOps tools, CI/CD pipelines, and cloud infrastructure (AWS, Azure, or GCP) is a plus
],-
System Reliability Specialist
5 days ago
Hyderabad, Telangana, India beBeeOperations Full time ₹ 1,80,00,000 - ₹ 2,00,00,000Site Reliability EngineerWe are looking for a skilled Systems Operations Specialist with extensive experience, responsible for ensuring the reliability, availability, and performance of critical systems.Key Responsibilities:Implement scalable, secure services in cloud environments (AWS) adhering to SRE principles.Develop and manage Continuous...
-
System Reliability Expert
2 weeks ago
Hyderabad, Telangana, India beBeeSoftwareEngineer Full time ₹ 1,80,00,000 - ₹ 2,40,00,000Reliable System SpecialistOur organization seeks a highly skilled specialist to enhance system reliability and performance. This key role will be responsible for designing, implementing, and maintaining scalable infrastructure to support applications and services.The ideal candidate will have expertise in software engineering concepts and applied experience...
-
System Reliability Engineer
4 days ago
Hyderabad, Telangana, India beBeeAzure Full time ₹ 1,50,00,000 - ₹ 2,50,00,000System Reliability Engineer (SRE) - Azure SpecialistThis role is for a skilled System Reliability Engineer with expertise in Core Azure Services, IoT, Event Hub, Databricks, and experience with Kubernetes, Docker, and Python/Powershell scripting.The ideal candidate will have strong knowledge of monitoring tools, including ELK, alerting, and logging systems....
-
System Reliability Specialist
6 days ago
Hyderabad, Telangana, India beBeeReliability Full time ₹ 15,00,000 - ₹ 20,00,000Workday US Payroll EngineerSupport the technology systems performance and reliability to meet service level targets. Create and deploy continuous performance and capacity models using various performance and availability monitoring tools, processes, and techniques.Key Responsibilities:Perform independently and become a subject matter expert.Participate...
-
System Reliability Specialist
6 days ago
Hyderabad, Telangana, India beBeeReliabilityEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job Description">As a reliability engineer, you will be responsible for ensuring system and application availability, scalability, and reliability while maintaining optimal uptime.">The primary objective of this role is to:">">Build, monitor, and maintain highly scalable deployments.">Install and deploy new releases and environments for...
-
Principal Reliability Specialist
1 week ago
Hyderabad, Telangana, India beBeeReliability Full time US$ 1,25,000 - US$ 1,75,000Job DescriptionHiring an Experienced SRE Specialist to ensure our services are robust, scalable, secure and maintainable.We're seeking a highly skilled Site Reliability Engineer (SRE) with 12+ years of experience in managing large-scale solutions or platforms. The ideal candidate will blend software engineering and systems operations to automate processes,...
-
Infrastructure Reliability Specialist
5 days ago
Hyderabad, Telangana, India beBeereliability Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Job Role:We are seeking a highly skilled and experienced Infrastructure Reliability Specialist to join our team.The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our critical infrastructure, guaranteeing high availability for our services.Key Responsibilities:Ensure IT services and infrastructure uptime at...
-
Reliable Systems Specialist
7 days ago
Hyderabad, Telangana, India beBeeSre Full time ₹ 45,00,000 - ₹ 52,50,000**Job Opportunity:**We are seeking a highly experienced Senior Site Reliability Engineer to join our organization. As a key member of our SRE team, you will act as an embedded technical expert across the IT organization.**About the Role:**This is not a traditional SRE role. You will be a technical leader, coach, and hands-on problem solver who thrives in...
-
System Reliability Engineer
6 days ago
Hyderabad, Telangana, India beBeeReliability Full time ₹ 1,20,00,000 - ₹ 1,50,00,000**System Reliability Engineer Opportunity**This is an exciting opportunity to join a team as a System Reliability Engineer. We are seeking a highly motivated and experienced individual to ensure the overall stability of our production application.The successful candidate will be responsible for ensuring the reliability, availability, scalability, and...
-
Reliable Systems Specialist
7 days ago
Hyderabad, Telangana, India beBeeProblemSolver Full time ₹ 1,00,00,000 - ₹ 2,00,00,000Job TitleA Site Reliability Engineer III will be responsible for solving complex business problems with simple and straightforward solutions.Main Responsibilities:Independently decompose and iteratively improve on existing solutions.Configure, maintain, monitor, and optimize applications and their associated infrastructure.Key Skills:Building appropriate...