Site Reliability Engineer
1 month ago
The Incident Manager is responsible for owning the outcomes of the incident management process and leading a team of 24/7 site reliability engineers within the technology department. This role involves strategic oversight, resource management, and effective coordination of response efforts to minimize disruptions.
Key Responsibilities:- Provide strategic direction for the team, and meticulous oversight of the incident management process, ensuring smooth navigation through the incident life cycle.
- Allocate resources effectively, including personnel and tools, to address incidents promptly and provide the necessary 24/7 coverage.
- Oversee the development of automation scripts and tools to reduce manual intervention and improve system efficiency using our APM tools.
- Coordinate with cross-functional teams, manage communication with stakeholders, and provide regular status updates.
- Guide teams in making informed decisions and implementing solutions during incident responses. Leverage existing runbooks to minimize customer impact.
- Lead investigations to determine root causes and implement corrective actions to prevent recurrence.
- Conduct post-incident reviews, analyze trends, and apply insights to enhance incident management processes.
- Ensure comprehensive documentation of incidents and responses for future analysis and improvement.
The ideal candidate will have a strong background in cloud technologies and a proactive approach to identifying and resolving issues before they impact the business. Proficiency in using monitoring and alerting tools, ability to analyze and interpret alerts and logs, and strong problem-solving skills are essential. Experience with incident management processes and tools, as well as cloud-specific issues, is also required.
-
Site Reliability Engineer Lead
4 weeks ago
Chennai, Tamil Nadu, India Bounteous Full timeJob Title: Site Reliability Engineer LeadWe are seeking a highly skilled Site Reliability Engineer Lead to join our team at Bounteous x Accolite. As a Site Reliability Engineer Lead, you will be responsible for owning the outcomes of the incident management process and leading a team of 24/7 site reliability engineers within the technology department.Key...
-
Senior Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India Athenahealth Full timeJob SummaryWe are seeking a Senior Site Reliability Engineer to join our Service Operations, Site Reliability Engineering team within the Cloud Infrastructure Engineering division.The Team is responsible for managing the fleet of systems owned by its sister teams in the Service Operations zone.We are looking for Site Reliability & Infrastructure Engineering...
-
Site Reliability Engineer
1 month ago
Chennai, Tamil Nadu, India Altimetrik Full timeJob Description:We are seeking a highly skilled Site Reliability Engineer to join our team at Altimetrik. The ideal candidate will have a strong background in development and a proven track record of troubleshooting and triaging complex issues.Key Responsibilities:Mandatory Skills:Development experience in Java, .Net, or Python, with a strong focus on code...
-
Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Virtusa Full timeJob Title: DevOps ArchitectJob Summary:We are seeking a highly skilled DevOps Architect to join our team at Virtusa. As a DevOps Architect, you will be responsible for designing and implementing scalable and efficient cloud infrastructure solutions.Key Responsibilities:Design and implement cloud infrastructure solutions using Hadoop, HBase, Hive, Oozie, and...
-
Site Reliability Engineer
2 months ago
Chennai, Tamil Nadu, India NexionPro Services Full timeJob Title : Site Reliability Engineer (SRE)Location : Chennai (Guindy)Experience : 5-8 yearsNotice Period : Immediate or serving notice (August joiners preferred)Work Mode : 5 days in-officeReferences are highly appreciated.Job Summary : We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a solid...
-
Senior Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India SES Full timeAbout the RoleThe Senior Engineer, Site Reliability is a key member of the SES IT team, responsible for the development, monitoring, operation, and support of the global SES Cloud and on-premise install base, with a strong focus on systems.As the main backup of the Senior Manager IT Systems, the Senior Engineer, Site Reliability will work closely with the...
-
Site Reliability Engineer Lead
3 weeks ago
Chennai, Tamil Nadu, India Bounteous Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer Lead to join our team at Bounteous x Accolite. As a key member of our technology department, you will be responsible for leading a team of 24/7 site reliability engineers and overseeing the incident management process.Key ResponsibilitiesLeadership & Oversight: Provide strategic direction for...
-
Site Reliability Engineering Leader
5 days ago
Chennai, Tamil Nadu, India Athenahealth Full timeAs a Senior Site Reliability Engineer at Athenahealth, you will be part of the Service Operations Site Reliability Engineering team within the Cloud Infrastructure Engineering division. This team is responsible for managing the fleet of systems owned by its sister teams in the Service Operations zone.We are looking for experienced professionals to help us...
-
Site Reliability Engineer
5 days ago
Chennai, Tamil Nadu, India Centific Global Technologies Full timeJob Title:Site Reliability Engineer - AI/ML OperationsAbout the Role:Centific Global Technologies is seeking an experienced Site Reliability Engineer to join our team and lead the development of our AI/ML operations infrastructure. This individual will be responsible for designing, building, and maintaining scalable and reliable systems for our data and AI...
-
Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Bounteous Full timeJob Title: Incident ManagerJob Summary: We are seeking an experienced Incident Manager to lead our team of 24/7 site reliability engineers within the technology department. The ideal candidate will have a strong background in cloud technologies and a proactive approach to identifying and resolving issues before they impact the business.Key...
-
Site Reliability Engineer Leader
2 weeks ago
Chennai, Tamil Nadu, India Bounteous Full timeSeeking an experienced Site Reliability Engineer Leader to lead our 24/7 engineering team and drive incident management processes. The ideal candidate will have a strong background in cloud technologies and a proactive approach to identifying and resolving issues.ResponsibilitiesProvide strategic direction for the team and meticulous oversight of the...
-
Senior Site Reliability Engineer
1 month ago
Chennai, Tamil Nadu, India Athenahealth Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering division. As a key member of our Service Operations Site Reliability Engineering team, you will play a critical role in managing the fleet of systems owned by our sister teams in the Service Operations zone.Key ResponsibilitiesProvision...
-
Senior Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Athenahealth Full timeAbout the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our Service Operations, Site Reliability Engineering team within the Cloud Infrastructure Engineering division. As a key member of this team, you will be responsible for managing the fleet of systems owned by sister teams in the Service Operations zone.Key...
-
Chennai, Tamil Nadu, India MX Full timeJob Title:Senior Manager of Platform Automation and Site Reliability EngineeringAbout MX:MX is a dynamic and rapidly growing financial company committed to helping empower the world to be financially strong. We are driven by our moral imperative to advance mankind - and it all starts with our people, product, and purpose.Job Summary:We are seeking a dynamic...
-
Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Centific Global Technologies Full timeJob Title: Site Reliability Engineer - AI/ML OperationsJob Summary:Centific Global Technologies is seeking a highly skilled Site Reliability Engineer to lead the AI/ML operations team. The ideal candidate will have a strong background in software release management, SRE, and DevOps, with experience in AI/ML operations, data pipeline management, and...
-
Senior Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Athenahealth Full timeAbout the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering team. As a key member of our team, you will be responsible for designing, implementing, and maintaining high-availability systems and infrastructure.Key Responsibilities:Provisioning and managing physical and virtual Linux machines...
-
Chennai, Tamil Nadu, India Centific Global Technologies Full timeJob Description :Centific is a Seattle-based tech company pioneering the future of AI one breakthrough at a time. Learn how we're transforming the world through safe and scalable AI and empowering businesses to unlock the full potential of their data.Key Responsibilities : Strategic Leadership & Vision :- Lead and manage the Software Release Management...
-
Senior Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India Athenahealth Full timeJob Summary We are seeking an experienced Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering team. The ideal candidate will have a strong background in cloud-native technologies and a proven track record of delivering scalable and highly available SaaS infrastructure solutions. Key ResponsibilitiesDesign and...
-
Kubernetes Site Reliability Engineer
1 month ago
Chennai, Tamil Nadu, India Axiom Technologies Full timeAxiom Technologies is a leading provider of IT services, supporting medium to large-scale enterprises with innovative solutions.Key Responsibilities:Design and implement continuous integration and continuous deployment (CICD) pipelines for multiple products across various environments.Develop and maintain tools for deployment, monitoring, and operations to...
-
Reliability Engineering Specialist
4 weeks ago
Chennai, Tamil Nadu, India Tata Consultancy Services Full timeTCS is a global technology leader that thrives on innovation and collaboration. Our journey has been marked by a relentless pursuit of excellence, and we continue to push the boundaries of what is possible in the tech arena.At the heart of our success lies our commitment to empowering young techies like you. We offer a dynamic work environment that fosters...