
Distinguished System Reliability Engineer
2 days ago
Job Title: System Reliability Leader
">We are seeking a seasoned system reliability engineer to lead our team in designing and implementing robust, scalable, and fault-tolerant systems. This role involves mentoring junior engineers, providing technical guidance, and fostering a culture of collaboration and continuous learning.
The ideal candidate will have a strong background in system administration, cloud computing, networking, distributed systems, and containerization technologies. Experience with scripting languages (Python, Bash) and automation tools is also required. Additionally, expertise in monitoring systems (Prometheus, Grafana), alerting configurations, and log analysis is essential.
The successful candidate will be responsible for:
- Leading the implementation of reliable, scalable, and fault-tolerant systems, including infrastructure, monitoring, and alerting.
- Managing incident response processes, including root cause analysis, post-mortem reviews, and proactive mitigation strategies to minimize system downtime and impact.
- Developing and maintaining comprehensive monitoring systems to identify potential issues early, set appropriate alerting thresholds, and optimize system performance.
- Driving automation initiatives to streamline operational tasks, including deployments, scaling, and configuration management, utilizing relevant tools and technologies.
- Proactively assessing system capacity needs, planning for future growth, and implementing scaling strategies to ensure optimal performance under load.
- Analyzing system metrics and identifying bottlenecks, implementing performance improvements, and optimizing resource utilization.
- Working closely with development teams, product managers, and other stakeholders to ensure alignment on reliability goals and smooth integration of new features.
- Developing and implementing the SRE roadmap, including technology adoption, standards, and best practices to maintain a high level of system reliability.
Requirements :
- Strong proficiency in system administration, cloud computing, networking, distributed systems, and containerization technologies.
- Expertise in scripting languages (Python, Bash) and ability to develop automation tools.
- Good understanding of Java.
- Deep understanding of monitoring systems (Prometheus, Grafana), alerting configurations, and log analysis.
- Proven experience in managing critical incidents, performing root cause analysis, and coordinating response efforts.
- Excellent communication skills to convey technical concepts to both technical and non-technical audiences.
- Strong analytical and troubleshooting skills to identify and resolve complex technical issues.
-
System Reliability Engineer
4 days ago
Mumbai, Maharashtra, India beBeeExpertise Full time ₹ 80,00,000 - ₹ 1,24,00,000Cloud Infrastructure SpecialistWe are seeking a skilled Cloud Computing expert to join our team. The successful candidate will be responsible for designing, installing, and maintaining a high level of system reliability.Key Responsibilities:Prioritize and resolve trouble tickets efficiently to maintain system reliability.Troubleshoot technical issues on...
-
High-Level System Reliability Engineer
5 days ago
Mumbai, Maharashtra, India beBeeReliability Full time ₹ 22,00,000 - ₹ 25,00,000Job Title:Reliability Engineering LeadOverview of the Role:We are seeking a highly experienced Reliability Engineering Lead to join our team. As a key member of our organization, you will be responsible for designing and implementing large-scale systems that prioritize reliability.The ideal candidate will have extensive experience in SRE or DevOps roles,...
-
Reliability Engineering Manager
7 days ago
Mumbai, Maharashtra, India Bloom Energy Full time US$ 90,000 - US$ 1,20,000 per yearRole and ResponsibilitiesManage the reliability team including reliability data analysis, continuous improvement and new product efforts based out of Mumbai, India. Support creation of System level DFMEAs and mitigation plans to address high risk items.Development of Reliability Block Diagrams (RBDs) and resulting reliability models to roll up system...
-
Site Reliability Engineer
5 days ago
Mumbai, Maharashtra, India beBeeSRE Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job Description", "As a Site Reliability Engineering Lead, you will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving automation, monitoring, and incident response strategies. This position combines software engineering and systems engineering expertise to build and maintain high-performing, reliable...
-
Highly Reliable Systems Specialist
7 days ago
Mumbai, Maharashtra, India beBeeReliability Full time ₹ 1,20,00,000 - ₹ 1,35,00,000Reliable Systems ExpertWe are seeking an experienced Reliability Engineer to join our team. As a key member of our Infrastructure Team, you will play a critical role in ensuring the reliability and scalability of our systems.Key Responsibilities:Implement monitoring tools to track system health and performance metrics.Develop automation scripts to streamline...
-
Site Reliability Engineer
3 weeks ago
Mumbai, Maharashtra, India SID Global Solutions Full timeJob Description: Site Reliability Engineer (SRE) Experience: 2 to 6 years The Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system reliability,...
-
Reliability Engineer
3 days ago
Mumbai, Maharashtra, India beBeePlatform Full time ₹ 1,80,00,000 - ₹ 2,50,00,000Job Description">We are seeking a technically skilled professional to join our global team as an Associate Platform Reliability Engineer. This role is critical to ensuring the stability, reliability, and resilience of our front-to-back technology infrastructure, with a focus on post-trade processing, operations, and regulatory support.">Key...
-
Sr Reliability Engineer
22 hours ago
Mumbai, Maharashtra, India JLL Full time US$ 90,000 - US$ 1,20,000 per yearJLL supports the Whole You, personally and professionally.Our people at JLL are shaping the future of real estate for a better world by combining world class services, advisory and technology to our clients. We are committed to hiring the best, most talented people in our industry; and we support them through professional growth, flexibility, and...
-
Sr Reliability Engineer
2 days ago
Mumbai, Maharashtra, India JLL Full time US$ 90,000 - US$ 1,20,000 per yearJLL empowers you to shape a brighter way. Our people at JLL and JLL Technologies are shaping the future of real estate for a better world by combining world class services, advisory and technology for our clients. We are committed to hiring the best, most talented people and empowering them to thrive, grow meaningful careers and to find a place where...
-
Reliability Engineering Position
4 days ago
Mumbai, Maharashtra, India beBeeRelevance Full time ₹ 20,00,000 - ₹ 25,00,000Job Title: Site Reliability EngineerAt our company, we are seeking a highly skilled Site Reliability Engineer.This role is responsible for ensuring the availability and scalability of our systems.Key Responsibilities:• Ensure the reliability and performance of our infrastructure.• Collaborate with cross-functional teams to identify and prioritize areas...