System Reliability Specialist

4 days ago


Hyderabad, Telangana, India beBeeReliability Full time ₹ 30,00,000 - ₹ 40,00,000

We are seeking a skilled System Reliability Specialist to join our team. As a System Reliability Specialist, you will play a critical role in ensuring the performance and reliability of our systems.

- Design and implement Service Level Agreements (SLAs), Service Level Indicators (SLIs), and error budgets to improve system reliability.

- Monitor and optimize system performance and infrastructure metrics proactively.

- Configure and maintain observability tools to enhance system monitoring, alerting, and logging.

- Analyze system architecture, identify risks, and develop mitigation strategies.

- Collaborate with engineering teams for system design reviews, capacity planning, and performance tuning.

- Conduct blameless postmortems for critical incidents and use learnings to prevent recurrence.

- Provide primary operational support for critical systems and manage incident resolution.

- Develop automated solutions to reduce manual efforts, implement self-healing mechanisms, and enforce resiliency patterns.

- Apply analytics to historic incident and usage data to predict and prevent future failures.

Required Skills & Qualifications :

- 23 years of experience in System Reliability Engineering or related roles.

- Hands-on experience in building dashboards and alerts using Splunk and AppDynamics.

- Solid understanding of microservices architecture and distributed systems.

- Minimum of 2 years of experience developing web-based applications (preferably in Java, Spring Boot).

- Strong understanding of monitoring, observability, and system reliability principles.

- Basic hands-on experience in SQL and database interaction.

- Experience in incident management, root cause analysis, and capacity planning.

Preferred Qualifications :

- Bachelor's or Master's degree in Computer Science, Engineering, or a related field (B.Tech / M.Tech).

- Familiarity with DevOps tools, CI/CD pipelines, and cloud infrastructure (AWS, Azure, or GCP) is a plus

],

  • Hyderabad, Telangana, India beBeeOperations Full time ₹ 1,80,00,000 - ₹ 2,00,00,000

    Site Reliability EngineerWe are looking for a skilled Systems Operations Specialist with extensive experience, responsible for ensuring the reliability, availability, and performance of critical systems.Key Responsibilities:Implement scalable, secure services in cloud environments (AWS) adhering to SRE principles.Develop and manage Continuous...


  • Hyderabad, Telangana, India beBeeSoftwareEngineer Full time ₹ 1,80,00,000 - ₹ 2,40,00,000

    Reliable System SpecialistOur organization seeks a highly skilled specialist to enhance system reliability and performance. This key role will be responsible for designing, implementing, and maintaining scalable infrastructure to support applications and services.The ideal candidate will have expertise in software engineering concepts and applied experience...


  • Hyderabad, Telangana, India beBeeAzure Full time ₹ 1,50,00,000 - ₹ 2,50,00,000

    System Reliability Engineer (SRE) - Azure SpecialistThis role is for a skilled System Reliability Engineer with expertise in Core Azure Services, IoT, Event Hub, Databricks, and experience with Kubernetes, Docker, and Python/Powershell scripting.The ideal candidate will have strong knowledge of monitoring tools, including ELK, alerting, and logging systems....


  • Hyderabad, Telangana, India beBeeReliability Full time ₹ 15,00,000 - ₹ 20,00,000

    Workday US Payroll EngineerSupport the technology systems performance and reliability to meet service level targets. Create and deploy continuous performance and capacity models using various performance and availability monitoring tools, processes, and techniques.Key Responsibilities:Perform independently and become a subject matter expert.Participate...


  • Hyderabad, Telangana, India beBeeReliabilityEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Job Description">As a reliability engineer, you will be responsible for ensuring system and application availability, scalability, and reliability while maintaining optimal uptime.">The primary objective of this role is to:">">Build, monitor, and maintain highly scalable deployments.">Install and deploy new releases and environments for...


  • Hyderabad, Telangana, India beBeeReliability Full time US$ 1,25,000 - US$ 1,75,000

    Job DescriptionHiring an Experienced SRE Specialist to ensure our services are robust, scalable, secure and maintainable.We're seeking a highly skilled Site Reliability Engineer (SRE) with 12+ years of experience in managing large-scale solutions or platforms. The ideal candidate will blend software engineering and systems operations to automate processes,...


  • Hyderabad, Telangana, India beBeereliability Full time ₹ 1,50,00,000 - ₹ 2,00,00,000

    Job Role:We are seeking a highly skilled and experienced Infrastructure Reliability Specialist to join our team.The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our critical infrastructure, guaranteeing high availability for our services.Key Responsibilities:Ensure IT services and infrastructure uptime at...


  • Hyderabad, Telangana, India beBeeSre Full time ₹ 45,00,000 - ₹ 52,50,000

    **Job Opportunity:**We are seeking a highly experienced Senior Site Reliability Engineer to join our organization. As a key member of our SRE team, you will act as an embedded technical expert across the IT organization.**About the Role:**This is not a traditional SRE role. You will be a technical leader, coach, and hands-on problem solver who thrives in...


  • Hyderabad, Telangana, India beBeeReliability Full time ₹ 1,20,00,000 - ₹ 1,50,00,000

    **System Reliability Engineer Opportunity**This is an exciting opportunity to join a team as a System Reliability Engineer. We are seeking a highly motivated and experienced individual to ensure the overall stability of our production application.The successful candidate will be responsible for ensuring the reliability, availability, scalability, and...


  • Hyderabad, Telangana, India beBeeProblemSolver Full time ₹ 1,00,00,000 - ₹ 2,00,00,000

    Job TitleA Site Reliability Engineer III will be responsible for solving complex business problems with simple and straightforward solutions.Main Responsibilities:Independently decompose and iteratively improve on existing solutions.Configure, maintain, monitor, and optimize applications and their associated infrastructure.Key Skills:Building appropriate...