
Optimizing System Reliability
1 day ago
This is a technical leadership role that involves collaborating with development teams to enhance system reliability.
In this position, you will be responsible for designing and building highly available and scalable production services. You will also define and implement Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to measure system reliability and performance.
You will lead incident response efforts to mitigate and resolve production issues quickly. Additionally, you will conduct postmortems and root cause analyses to prevent recurrence.
Your responsibilities will include automating operational tasks using Infrastructure as Code (IaC) tools such as Terraform. You will also implement self-healing and auto-scaling mechanisms for infrastructure components.
The ideal candidate for this role should have experience operating Kubernetes in a production environment. They should also have proficiency in IaC tools and CI/CD automation tools. Furthermore, they should have hands-on experience with observability tools and familiarity with cloud platforms and cloud-native architectures.
Some of the key qualifications for this role include a strong understanding of microservices architecture and its operational challenges. The ideal candidate should also have experience fostering SRE best practices within an organization.
To excel in this position, you should have excellent problem-solving skills and the ability to take ownership of reliability-related challenges. You should also have proven experience in project management, identifying issues, planning solutions, driving execution, and coordinating stakeholders.
-
Optimizing System Performance Specialist
1 day ago
Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 1,00,00,000 - ₹ 1,20,00,000Job OverviewOur organization seeks a highly skilled Site Reliability Engineer to optimize the performance and reliability of our mission-critical systems. The successful candidate will be responsible for monitoring system health, implementing automation for repetitive tasks, responding to incidents, and driving efforts to prevent recurrences through detailed...
-
System Reliability Engineer
16 hours ago
Chennai, Tamil Nadu, India beBeeTechnical Full time US$ 1,20,000 - US$ 1,40,000We are seeking a skilled technical expert to enhance our automated risk detection systems. This role involves supporting and maintaining rule-based logic, data analysis processes, and scalable tools.In this position, you will play a vital part in system reliability, operational support, and technical troubleshooting, contributing to the protection of...
-
Reliable Systems Engineer
2 days ago
Chennai, Tamil Nadu, India beBeeReliability Full timeSystem Reliability ExpertWe are seeking a talented and proactive system reliability expert to join our infrastructure team. The ideal candidate will combine software engineering expertise with systems engineering skills to build scalable, reliable, and efficient systems.Key Responsibilities:Design, implement, and manage scalable, resilient, and secure...
-
System Reliability Specialist
3 days ago
Chennai, Tamil Nadu, India beBeeReliability Full time ₹ 8,00,000 - ₹ 12,00,000Job Title: System Reliability Specialist Experience:7 to 12 YearsQualification:Diploma/BE (Mech./Instru.) Locations:VadodaraChennaiResponsibility:We are seeking an experienced System Reliability Specialist to join our team. The successful candidate will be responsible for maintaining and troubleshooting complex systems, including instruments, valves,...
-
Reliability Engineer
3 days ago
Chennai, Tamil Nadu, India Alp Consulting Ltd. Full timeJob Title: Reliability EngineerExperience: 7 to 12 YearsQualification: Diploma/BE (Mech./Instru.)Locations: Vadodara/ChennaiResponsibility:Experience of maintaining the Instruments, Valves, transmitters, Sensors, Control systems (DCS/PLC, SCADA), Analyzers and F &G systems etc.Experience with GE-APM Reliability Analytics, SIL study and Exida will be an added...
-
Solar Tracker reliability Engineer
1 week ago
Chennai, Tamil Nadu, India FTC Solar, Inc Full timePosition Overview: We are seeking a highly skilled and motivated Solar Tracker Reliability Engineer to join our team. The Solar Tracker Reliability Engineer will play a key role in ensuring the reliability and performance of our solar tracker systems. This position offers an exciting opportunity to work at the forefront of renewable energy technology and...
-
Solar Tracker Reliability Engineer
5 hours ago
Chennai, Tamil Nadu, India FTC Solar, Inc Full timePosition Overview: We are seeking a highly skilled and motivated Solar Tracker Reliability Engineer to join our team.The Solar Tracker Reliability Engineer will play a key role in ensuring the reliability and performance of our solar tracker systems.This position offers an exciting opportunity to work at the forefront of renewable energy technology and...
-
Site Reliability Engineer
20 hours ago
Chennai, Tamil Nadu, India Trimble Inc. Full timeJob DescriptionJob SummaryWe are seeking a motivated Site Reliability Engineer (SRE) Level 1 to enhance the infrastructure and operational reliability of our ERP product, specifically within Azure and Windows environments. The ideal candidate will utilize SRE principles to ensure high system availability, stability, and performance while collaborating...
-
Senior Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Keuro Life Full timeSite Reliability Engineer / DevOpsWe are seeking an experienced Site Reliability Engineer / DevOps professional with a minimum of 6 years in the industry. The ideal candidate will be adept at managing large-scale, high-traffic production environments and ensuring their reliability.Key Responsibilities : - Manage and optimize production environments to...
-
Reliability Strategist
3 days ago
Chennai, Tamil Nadu, India beBeeReliability Full time US$ 1,20,000 - US$ 1,50,000Job Summary:We are seeking a highly skilled Reliability Engineer to join our team. In this role, you will play a critical part in developing and implementing equipment reliability strategies to ensure the safe and efficient operation of rotating equipment.Key Responsibilities:Provide technical support for multiple plants, utilizing your expertise in...