Site Reliability Engineer Lead
4 weeks ago
We are seeking a highly skilled Site Reliability Engineer Lead to join our team at Bounteous x Accolite. As a Site Reliability Engineer Lead, you will be responsible for owning the outcomes of the incident management process and leading a team of 24/7 site reliability engineers within the technology department.
Key Responsibilities:
* Provide strategic direction for the team and meticulous oversight of the incident management process, ensuring smooth navigation through the incident life cycle.
* Allocate resources effectively, including personnel and tools, to address incidents promptly and provide the necessary 24/7 coverage.
* Oversee the development of automation scripts and tools to reduce manual intervention and improve system efficiency using our APM tools.
* Coordinate with cross-functional teams, manage communication with stakeholders, and provide regular status updates.
* Guide teams in making informed decisions and implementing solutions during incident responses.
* Lead investigations to determine root causes and implement corrective actions to prevent recurrence.
Requirements:
* Strong background in cloud technologies and a proactive approach to identifying and resolving issues before they impact the business.
* Proficiency in using monitoring and alerting tools, such as New Relic and Datadog.
* Ability to analyze and interpret alerts and logs to pinpoint the source of the issue.
* Ability to quickly identify and prioritize critical issues.
* Experience with incident management processes and tools, such as PagerDuty.
* Strong problem-solving skills to diagnose and resolve system and application issues.
* Proficiency in using diagnostic tools and techniques, such as logs analysis, tracing, and profiling.
* Strong working knowledge of operating systems, such as Linux and Windows, and system administration tasks.
* Familiarity with key system components, such as CPU, memory, disk, and network.
* Basic knowledge of database management and troubleshooting, such as MySQL, PostgreSQL, and MS-SQL.
* Experience with managing cloud resources and troubleshooting cloud-specific issues.
* Clear and concise communication skills to convey the status and impact of the outage to stakeholders.
* Ability to coordinate effectively with different teams, such as development, operations, and support.
* Ability to remain calm and focused under pressure.
* Effective time management to handle multiple tasks and prioritize urgent issues.
About Bounteous x Accolite:
Bounteous x Accolite is a global digital agency that helps ambitious brands succeed in a rapidly changing world. We are guided by Co-Innovation, our proven methodology of collaborative partnership. Our team of 5000+ employees spans North America, APAC, and EMEA, and we partner with leading technology providers. We are committed to promoting an inclusive environment and are proud to be an equal opportunity employer.
How to Apply:
If you are a motivated and experienced Site Reliability Engineer Lead looking for a new challenge, please submit your application. We look forward to hearing from you.
-
Site Reliability Engineer Lead
3 weeks ago
Chennai, Tamil Nadu, India Bounteous Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer Lead to join our team at Bounteous x Accolite. As a key member of our technology department, you will be responsible for leading a team of 24/7 site reliability engineers and overseeing the incident management process.Key ResponsibilitiesLeadership & Oversight: Provide strategic direction for...
-
Site Reliability Engineer
5 days ago
Chennai, Tamil Nadu, India Centific Global Technologies Full timeJob Title:Site Reliability Engineer - AI/ML OperationsAbout the Role:Centific Global Technologies is seeking an experienced Site Reliability Engineer to join our team and lead the development of our AI/ML operations infrastructure. This individual will be responsible for designing, building, and maintaining scalable and reliable systems for our data and AI...
-
Site Reliability Engineer
1 month ago
Chennai, Tamil Nadu, India Bounteous Full timeIncident ManagerThe Incident Manager is responsible for owning the outcomes of the incident management process and leading a team of 24/7 site reliability engineers within the technology department. This role involves strategic oversight, resource management, and effective coordination of response efforts to minimize disruptions.Key Responsibilities:Provide...
-
Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Bounteous Full timeJob Title: Incident ManagerJob Summary: We are seeking an experienced Incident Manager to lead our team of 24/7 site reliability engineers within the technology department. The ideal candidate will have a strong background in cloud technologies and a proactive approach to identifying and resolving issues before they impact the business.Key...
-
Senior Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India Athenahealth Full timeJob SummaryWe are seeking a Senior Site Reliability Engineer to join our Service Operations, Site Reliability Engineering team within the Cloud Infrastructure Engineering division.The Team is responsible for managing the fleet of systems owned by its sister teams in the Service Operations zone.We are looking for Site Reliability & Infrastructure Engineering...
-
Site Reliability Engineer Leader
2 weeks ago
Chennai, Tamil Nadu, India Bounteous Full timeSeeking an experienced Site Reliability Engineer Leader to lead our 24/7 engineering team and drive incident management processes. The ideal candidate will have a strong background in cloud technologies and a proactive approach to identifying and resolving issues.ResponsibilitiesProvide strategic direction for the team and meticulous oversight of the...
-
Site Reliability Engineer
1 month ago
Chennai, Tamil Nadu, India Altimetrik Full timeJob Description:We are seeking a highly skilled Site Reliability Engineer to join our team at Altimetrik. The ideal candidate will have a strong background in development and a proven track record of troubleshooting and triaging complex issues.Key Responsibilities:Mandatory Skills:Development experience in Java, .Net, or Python, with a strong focus on code...
-
Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Virtusa Full timeJob Title: DevOps ArchitectJob Summary:We are seeking a highly skilled DevOps Architect to join our team at Virtusa. As a DevOps Architect, you will be responsible for designing and implementing scalable and efficient cloud infrastructure solutions.Key Responsibilities:Design and implement cloud infrastructure solutions using Hadoop, HBase, Hive, Oozie, and...
-
Site Reliability Engineer
2 months ago
Chennai, Tamil Nadu, India NexionPro Services Full timeJob Title : Site Reliability Engineer (SRE)Location : Chennai (Guindy)Experience : 5-8 yearsNotice Period : Immediate or serving notice (August joiners preferred)Work Mode : 5 days in-officeReferences are highly appreciated.Job Summary : We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a solid...
-
Senior Site Reliability Engineer
3 weeks ago
Chennai, Tamil Nadu, India SES Full timeAbout the RoleThe Senior Engineer, Site Reliability is a key member of the SES IT team, responsible for the development, monitoring, operation, and support of the global SES Cloud and on-premise install base, with a strong focus on systems.As the main backup of the Senior Manager IT Systems, the Senior Engineer, Site Reliability will work closely with the...
-
Senior Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Athenahealth Full timeAbout the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our Service Operations, Site Reliability Engineering team within the Cloud Infrastructure Engineering division. As a key member of this team, you will be responsible for managing the fleet of systems owned by sister teams in the Service Operations zone.Key...
-
Site Reliability Engineering Leader
5 days ago
Chennai, Tamil Nadu, India Athenahealth Full timeAs a Senior Site Reliability Engineer at Athenahealth, you will be part of the Service Operations Site Reliability Engineering team within the Cloud Infrastructure Engineering division. This team is responsible for managing the fleet of systems owned by its sister teams in the Service Operations zone.We are looking for experienced professionals to help us...
-
Chennai, Tamil Nadu, India MX Full timeJob Title:Senior Manager of Platform Automation and Site Reliability EngineeringAbout MX:MX is a dynamic and rapidly growing financial company committed to helping empower the world to be financially strong. We are driven by our moral imperative to advance mankind - and it all starts with our people, product, and purpose.Job Summary:We are seeking a dynamic...
-
Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Centific Global Technologies Full timeJob Title: Site Reliability Engineer - AI/ML OperationsJob Summary:Centific Global Technologies is seeking a highly skilled Site Reliability Engineer to lead the AI/ML operations team. The ideal candidate will have a strong background in software release management, SRE, and DevOps, with experience in AI/ML operations, data pipeline management, and...
-
Senior Site Reliability Engineer
1 month ago
Chennai, Tamil Nadu, India Athenahealth Full timeAbout the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering division. As a key member of our Service Operations Site Reliability Engineering team, you will play a critical role in managing the fleet of systems owned by our sister teams in the Service Operations zone.Key ResponsibilitiesProvision...
-
Site Reliability Engineer Lead
7 days ago
Chennai, Tamil Nadu, India Bounteous Full timeOverviewBounteous x Accolite is a leading digital transformation company that empowers ambitious brands to accelerate their growth. Our diverse services include strategy, analytics, digital engineering, cloud, data and AI, experience design, and marketing. We prioritize Co-Innovation, a proven methodology that fosters collaborative partnerships.With over...
-
Senior Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Athenahealth Full timeAbout the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering team. As a key member of our team, you will be responsible for designing, implementing, and maintaining high-availability systems and infrastructure.Key Responsibilities:Provisioning and managing physical and virtual Linux machines...
-
Site Reliability Engineer Lead
3 weeks ago
Chennai, Tamil Nadu, India Bounteous Full timeIncident Management SpecialistBounteous x Accolite is a leading digital engineering and technology services company that serves the world's most ambitious brands. Our team is comprised of over 5,000 professionals across North America, APAC, and EMEA, and we are proud to offer our clients cutting-edge digital engineering, technology solutions, and data-driven...
-
Chennai, Tamil Nadu, India Centific Global Technologies Full timeJob Description :Centific is a Seattle-based tech company pioneering the future of AI one breakthrough at a time. Learn how we're transforming the world through safe and scalable AI and empowering businesses to unlock the full potential of their data.Key Responsibilities : Strategic Leadership & Vision :- Lead and manage the Software Release Management...
-
Reliability Engineering Lead for Medical Devices
3 weeks ago
Chennai, Tamil Nadu, India SVS Corporate Services Full timeJob SummaryWe are seeking an experienced Reliability Engineering Lead to join our team at SVS Corporate Services. The ideal candidate will have a strong background in reliability engineering, with a focus on medical devices.This role will involve leading reliability analysis, test planning, and execution for medical devices, as well as collaborating with...