Site Reliability Engineer Lead

4 weeks ago


Chennai, Tamil Nadu, India Bounteous Full time
Job Title: Site Reliability Engineer Lead

We are seeking a highly skilled Site Reliability Engineer Lead to join our team at Bounteous x Accolite. As a Site Reliability Engineer Lead, you will be responsible for owning the outcomes of the incident management process and leading a team of 24/7 site reliability engineers within the technology department.

Key Responsibilities:

* Provide strategic direction for the team and meticulous oversight of the incident management process, ensuring smooth navigation through the incident life cycle.
* Allocate resources effectively, including personnel and tools, to address incidents promptly and provide the necessary 24/7 coverage.
* Oversee the development of automation scripts and tools to reduce manual intervention and improve system efficiency using our APM tools.
* Coordinate with cross-functional teams, manage communication with stakeholders, and provide regular status updates.
* Guide teams in making informed decisions and implementing solutions during incident responses.
* Lead investigations to determine root causes and implement corrective actions to prevent recurrence.

Requirements:

* Strong background in cloud technologies and a proactive approach to identifying and resolving issues before they impact the business.
* Proficiency in using monitoring and alerting tools, such as New Relic and Datadog.
* Ability to analyze and interpret alerts and logs to pinpoint the source of the issue.
* Ability to quickly identify and prioritize critical issues.
* Experience with incident management processes and tools, such as PagerDuty.
* Strong problem-solving skills to diagnose and resolve system and application issues.
* Proficiency in using diagnostic tools and techniques, such as logs analysis, tracing, and profiling.
* Strong working knowledge of operating systems, such as Linux and Windows, and system administration tasks.
* Familiarity with key system components, such as CPU, memory, disk, and network.
* Basic knowledge of database management and troubleshooting, such as MySQL, PostgreSQL, and MS-SQL.
* Experience with managing cloud resources and troubleshooting cloud-specific issues.
* Clear and concise communication skills to convey the status and impact of the outage to stakeholders.
* Ability to coordinate effectively with different teams, such as development, operations, and support.
* Ability to remain calm and focused under pressure.
* Effective time management to handle multiple tasks and prioritize urgent issues.

About Bounteous x Accolite:

Bounteous x Accolite is a global digital agency that helps ambitious brands succeed in a rapidly changing world. We are guided by Co-Innovation, our proven methodology of collaborative partnership. Our team of 5000+ employees spans North America, APAC, and EMEA, and we partner with leading technology providers. We are committed to promoting an inclusive environment and are proud to be an equal opportunity employer.

How to Apply:

If you are a motivated and experienced Site Reliability Engineer Lead looking for a new challenge, please submit your application. We look forward to hearing from you.

  • Chennai, Tamil Nadu, India Bounteous Full time

    About the RoleWe are seeking a skilled Site Reliability Engineer Lead to join our team at Bounteous x Accolite. As a key member of our technology department, you will be responsible for leading a team of 24/7 site reliability engineers and overseeing the incident management process.Key ResponsibilitiesLeadership & Oversight: Provide strategic direction for...


  • Chennai, Tamil Nadu, India Centific Global Technologies Full time

    Job Title:Site Reliability Engineer - AI/ML OperationsAbout the Role:Centific Global Technologies is seeking an experienced Site Reliability Engineer to join our team and lead the development of our AI/ML operations infrastructure. This individual will be responsible for designing, building, and maintaining scalable and reliable systems for our data and AI...


  • Chennai, Tamil Nadu, India Bounteous Full time

    Incident ManagerThe Incident Manager is responsible for owning the outcomes of the incident management process and leading a team of 24/7 site reliability engineers within the technology department. This role involves strategic oversight, resource management, and effective coordination of response efforts to minimize disruptions.Key Responsibilities:Provide...


  • Chennai, Tamil Nadu, India Bounteous Full time

    Job Title: Incident ManagerJob Summary: We are seeking an experienced Incident Manager to lead our team of 24/7 site reliability engineers within the technology department. The ideal candidate will have a strong background in cloud technologies and a proactive approach to identifying and resolving issues before they impact the business.Key...


  • Chennai, Tamil Nadu, India Athenahealth Full time

    Job SummaryWe are seeking a Senior Site Reliability Engineer to join our Service Operations, Site Reliability Engineering team within the Cloud Infrastructure Engineering division.The Team is responsible for managing the fleet of systems owned by its sister teams in the Service Operations zone.We are looking for Site Reliability & Infrastructure Engineering...


  • Chennai, Tamil Nadu, India Bounteous Full time

    Seeking an experienced Site Reliability Engineer Leader to lead our 24/7 engineering team and drive incident management processes. The ideal candidate will have a strong background in cloud technologies and a proactive approach to identifying and resolving issues.ResponsibilitiesProvide strategic direction for the team and meticulous oversight of the...


  • Chennai, Tamil Nadu, India Altimetrik Full time

    Job Description:We are seeking a highly skilled Site Reliability Engineer to join our team at Altimetrik. The ideal candidate will have a strong background in development and a proven track record of troubleshooting and triaging complex issues.Key Responsibilities:Mandatory Skills:Development experience in Java, .Net, or Python, with a strong focus on code...


  • Chennai, Tamil Nadu, India Virtusa Full time

    Job Title: DevOps ArchitectJob Summary:We are seeking a highly skilled DevOps Architect to join our team at Virtusa. As a DevOps Architect, you will be responsible for designing and implementing scalable and efficient cloud infrastructure solutions.Key Responsibilities:Design and implement cloud infrastructure solutions using Hadoop, HBase, Hive, Oozie, and...


  • Chennai, Tamil Nadu, India NexionPro Services Full time

    Job Title : Site Reliability Engineer (SRE)Location : Chennai (Guindy)Experience : 5-8 yearsNotice Period : Immediate or serving notice (August joiners preferred)Work Mode : 5 days in-officeReferences are highly appreciated.Job Summary : We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a solid...


  • Chennai, Tamil Nadu, India SES Full time

    About the RoleThe Senior Engineer, Site Reliability is a key member of the SES IT team, responsible for the development, monitoring, operation, and support of the global SES Cloud and on-premise install base, with a strong focus on systems.As the main backup of the Senior Manager IT Systems, the Senior Engineer, Site Reliability will work closely with the...


  • Chennai, Tamil Nadu, India Athenahealth Full time

    About the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our Service Operations, Site Reliability Engineering team within the Cloud Infrastructure Engineering division. As a key member of this team, you will be responsible for managing the fleet of systems owned by sister teams in the Service Operations zone.Key...


  • Chennai, Tamil Nadu, India Athenahealth Full time

    As a Senior Site Reliability Engineer at Athenahealth, you will be part of the Service Operations Site Reliability Engineering team within the Cloud Infrastructure Engineering division. This team is responsible for managing the fleet of systems owned by its sister teams in the Service Operations zone.We are looking for experienced professionals to help us...


  • Chennai, Tamil Nadu, India MX Full time

    Job Title:Senior Manager of Platform Automation and Site Reliability EngineeringAbout MX:MX is a dynamic and rapidly growing financial company committed to helping empower the world to be financially strong. We are driven by our moral imperative to advance mankind - and it all starts with our people, product, and purpose.Job Summary:We are seeking a dynamic...


  • Chennai, Tamil Nadu, India Centific Global Technologies Full time

    Job Title: Site Reliability Engineer - AI/ML OperationsJob Summary:Centific Global Technologies is seeking a highly skilled Site Reliability Engineer to lead the AI/ML operations team. The ideal candidate will have a strong background in software release management, SRE, and DevOps, with experience in AI/ML operations, data pipeline management, and...


  • Chennai, Tamil Nadu, India Athenahealth Full time

    About the RoleWe are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering division. As a key member of our Service Operations Site Reliability Engineering team, you will play a critical role in managing the fleet of systems owned by our sister teams in the Service Operations zone.Key ResponsibilitiesProvision...


  • Chennai, Tamil Nadu, India Bounteous Full time

    OverviewBounteous x Accolite is a leading digital transformation company that empowers ambitious brands to accelerate their growth. Our diverse services include strategy, analytics, digital engineering, cloud, data and AI, experience design, and marketing. We prioritize Co-Innovation, a proven methodology that fosters collaborative partnerships.With over...


  • Chennai, Tamil Nadu, India Athenahealth Full time

    About the Role:We are seeking a highly skilled Senior Site Reliability Engineer to join our Cloud Infrastructure Engineering team. As a key member of our team, you will be responsible for designing, implementing, and maintaining high-availability systems and infrastructure.Key Responsibilities:Provisioning and managing physical and virtual Linux machines...


  • Chennai, Tamil Nadu, India Bounteous Full time

    Incident Management SpecialistBounteous x Accolite is a leading digital engineering and technology services company that serves the world's most ambitious brands. Our team is comprised of over 5,000 professionals across North America, APAC, and EMEA, and we are proud to offer our clients cutting-edge digital engineering, technology solutions, and data-driven...


  • Chennai, Tamil Nadu, India Centific Global Technologies Full time

    Job Description :Centific is a Seattle-based tech company pioneering the future of AI one breakthrough at a time. Learn how we're transforming the world through safe and scalable AI and empowering businesses to unlock the full potential of their data.Key Responsibilities : Strategic Leadership & Vision :- Lead and manage the Software Release Management...


  • Chennai, Tamil Nadu, India SVS Corporate Services Full time

    Job SummaryWe are seeking an experienced Reliability Engineering Lead to join our team at SVS Corporate Services. The ideal candidate will have a strong background in reliability engineering, with a focus on medical devices.This role will involve leading reliability analysis, test planning, and execution for medical devices, as well as collaborating with...