
System Reliability Leader
3 hours ago
We are seeking an experienced System Reliability Engineer to oversee the reliability, scalability, and performance of our critical systems.
As a Reliability Engineering Leader, you will play a pivotal role in establishing and implementing system reliability practices, leading a team of engineers, and driving automation, monitoring, and incident response strategies. This position combines software engineering and systems engineering expertise to build and maintain high-performing, reliable systems.
Key Responsibilities:
- Lead efforts to maintain high availability and reliability of critical services.
- Define and monitor service level indicators, objectives, and agreements to ensure business requirements are met.
- Proactively identify and resolve performance bottlenecks and system inefficiencies.
Incident Management & Response:
- Establish and improve incident management processes and on-call rotations.
- Lead incident response and root cause analysis for high-priority outages.
- Drive post-incident reviews and ensure actionable insights are implemented.
Automation & Tooling:
- Develop and implement automated solutions to reduce manual operational tasks.
- Enhance system observability through metrics, logging, and distributed tracing tools.
- Optimize CI/CD pipelines for seamless deployments.
Collaboration:
- Partner with software engineering teams to improve the reliability of applications and infrastructure.
- Work closely with product/engineering teams to design scalable and robust systems.
- Ensure seamless integration of monitoring and alerting systems across teams.
Leadership & Team Building:
- Manage, mentor, and grow a team of SREs.
- Promote system reliability best practices and foster a culture of reliability and performance across the organization.
- Drive performance reviews, skills development, and career progression for team members.
Capacity Planning & Cost Optimization:
- Perform capacity planning and implement autoscaling solutions to handle traffic spikes.
- Optimize infrastructure and cloud costs while maintaining reliability and performance.
Required Skills & Qualifications:
- Technical Expertise:
- Experience with cloud platforms (AWS/Azure/GCP) and Kubernetes.
- Hands-on knowledge of infrastructure-as-code tools like Terraform/Helm/Ansible.
- Proficiency in Java.
- Expertise in distributed systems, databases, and load balancing.
- Monitoring & Observability:
- Proficient with tools like Prometheus, Grafana, Elastic APM.
- Understanding of metrics-driven approaches for system monitoring and alerting.
- Automation & CI/CD:
- Hands-on experience with CI/CD pipelines.
- Skilled in automation frameworks and tools for infrastructure and application deployments.
- Incident Management:
- Proven track record in handling incidents, post-mortems, and implementing solutions to prevent recurrence.
-
Site Reliability Engineer Leader
17 hours ago
Nagpur, Maharashtra, India beBeeLeader Full time ₹ 23,00,000 - ₹ 25,50,000Site Reliability Engineer LeaderWe are seeking a seasoned Site Reliability Engineer to lead our team in ensuring the high availability and performance of our systems.
-
Reliable Systems Specialist
2 days ago
Nagpur, Maharashtra, India beBeeInfrastructure Full time ₹ 18,00,000 - ₹ 25,00,000System Reliability EngineerAs a System Reliability Engineer, you will be responsible for designing and deploying automated systems to ensure high availability and performance.Key Responsibilities:Design and deploy infrastructure as code using IaC tools like Terraform, Ansible, or CloudFormation.Implement automation scripts using Python, Bash, or other...
-
Reliable Systems Engineer
6 minutes ago
Nagpur, Maharashtra, India beBeeTechnical Full time ₹ 20,00,000 - ₹ 25,00,000Job Description:We are seeking an experienced Site Reliability Engineer to ensure the reliability and scalability of our business and web applications.About the Role:As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our business and web applications.Key Responsibilities:Provide production support,...
-
Cloud Expert
3 hours ago
Nagpur, Maharashtra, India beBeeExpert Full time ₹ 1,50,00,000 - ₹ 2,00,00,000We are seeking a talented System Reliability Engineer to join our Cloud & DevOps practice.The ideal candidate will have strong expertise in building and maintaining resilient AWS architectures with automation and multi-region failover.Key Responsibilities:Design and implement SRE principles, including SLIs, SLOs, SLAs, and error budgets, to drive reliability...
-
Service Reliability Engineer
4 weeks ago
Nagpur, Maharashtra, India Tech USA Full timeAbout the CompanyWe are building a new Service Reliability Engineering (SRE) function focused on data platforms that power business processes across the enterprise. This team is part of a 24/7 distributed model that emphasizes proactive automation, observability, and techno-functional expertise. This is a high-impact role supporting systems such as...
-
Nagpur, Maharashtra, India beBeeEngineering Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job Title: Cloud Engineering and Resilience SpecialistWe are seeking an experienced engineer to design, build, and validate robust, scalable, and automated cloud-native environments. The ideal candidate will possess a deep understanding of AWS Cloud and strong Python development skills for automation and tooling.Key Responsibilities:Cloud Engineering...
-
Electrical Systems Designer
18 hours ago
Nagpur, Maharashtra, India beBeeDesign Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Electrical Design Expert RequiredWe are seeking an experienced Electrical Design Engineer to lead the design and implementation of primary and secondary electrical systems for substations.The successful candidate will be responsible for designing and coordinating electrical and control systems across substation infrastructure to ensure operational...
-
System Performance Specialist
17 hours ago
Nagpur, Maharashtra, India beBeePerformance Full time ₹ 18,00,000 - ₹ 20,00,000Job Title: System Performance SpecialistThis role plays a crucial part in ensuring the reliability and stability of globally deployed web applications.
-
High-End HVAC Systems Design Lead Position
20 hours ago
Nagpur, Maharashtra, India beBeeHvacdesign Full time ₹ 25,00,000 - ₹ 30,00,000Job Title: HVAC Design ManagerWe are seeking an experienced leader to spearhead the development of our HVAC systems.
-
Senior Systems Engineer
2 days ago
Nagpur, Maharashtra, India beBeeBackendDeveloper Full time ₹ 15,78,516 - ₹ 25,12,062We're seeking a skilled Backend Developer to build scalable systems for our gaming platform.Key Responsibilities:Design, develop, and maintain robust backend services using Node.js and TypeScript.Develop real-time communication systems leveraging Redis, WebSockets (Socket.io), and database architectures.Caching strategies, performance tuning, and scaling...