Site reliability engineer

4 weeks ago


Pune, India Roche Full time

The Position

KEY ROLES & RESPONSIBILITIES (required):

Responsibilities:

Design, implement, and maintain site reliability engineering (SRE) practices that ensure the reliability and performance of our production systems. Design and implement SRE practices that align with the company's overall reliability and performance goals. Develop and maintain automated monitoring and alerting systems to proactively identify and address potential issues. Implement incident response procedures to effectively resolve incidents and minimize downtime. Collaborate with developers and other engineers to define and implement service level agreements (SLAs). Conduct regular reviews of SRE practices to ensure they remain effective and aligned with evolving needs. Monitor and troubleshoot production systems to identify and resolve issues before they impact users. Continuously monitor production systems for performance degradation, potential failures, and security vulnerabilities. Thoroughly investigate and troubleshoot incidents to identify the root cause and implement corrective actions. Proactively identify potential issues by analyzing system logs, metrics, and trends. Collaborate with developers and other engineers to implement workarounds and fixes for identified issues. Document incident investigations and corrective actions to prevent recurrence and improve future troubleshooting efforts. Develop and implement automated monitoring and alerting systems. Design and implement automated monitoring systems to collect and analyze real-time data from production systems. Configure alerting systems to notify appropriate personnel of potential issues or performance deviations. Continuously evaluate and improve the effectiveness of automated monitoring and alerting systems. Automate repetitive tasks to improve efficiency and reduce manual intervention. Collaborate with developers and other engineers to design and implement new features and infrastructure changes. Work closely with developers to understand the impact of new features and code changes on system reliability and performance. Provide guidance and recommendations to developers on SRE best practices and design for reliability. Participate in code reviews to identify potential reliability issues and suggest improvements. Collaborate with infrastructure engineers to ensure that new infrastructure components are designed and deployed with reliability and performance in mind. Stay up-to-date on the latest technologies and trends in SRE and DevOps to contribute to continuous innovation and improvement. Prepare and deliver technical presentations and documentation. Prepare and deliver technical presentations to share SRE best practices, incident investigations, and lessons learned. Document SRE practices, procedures, and guidelines to ensure knowledge transfer and consistency. Contribute to internal documentation and knowledge bases to aid troubleshooting and problem-solving. Present findings and recommendations to management and stakeholders to inform decision-making processes.

KNOWLEDGE/ SKILLS/ATTRIBUTES (required): The minimum education, knowledge, experience, skills and attributes required to perform the essential functions of this job.

Required Experience, Skills and Qualifications

3 – 6 years of relevant experience Proven hands-on Software/Application support with Cloud as main technology area. Troubleshooting and the ability to delve deeply into technical details & acquire/create the necessary Knowledge to effectively troubleshoot and repair of the applications Knowledge Splunk, VictorsOps, Appdynamics, web automation like selenium and ability to learn new tools and technologies. Collaborative team player with excellent influence and interpersonal skills; inspires confidence. Experience with public Cloud providers, including Amazon Web Services architecture, tools, and Cloud methodologies. Proven ability to design, implement, and maintain SRE practices that ensure system reliability and performance. Experience in monitoring and troubleshooting production systems to identify and resolve incidents. Familiarity with automated monitoring and alerting systems, including tool selection, configuration, and maintenance. Experience collaborating with developers and other engineers to design, implement, and operate reliable systems. Excellent written and verbal communication Exposure to handling customers from various geographies Ability to work with minimum supervision Team player who shares ideas and resources Flexibility to work in shifts or weekends as per schedule

Education

Bachelor degree engineering or informatics.

  • Pune, India Ensono Full time

    About Us (Ensono)Ensono is an expert technology adviser and managed service provider. As a relentless ally, we accelerate clients’ digital transformation to achieve business outcomes that stand to last. Our dedicated team helps organizations optimize today’s systems across any hybrid environment with services such as consulting, mainframe and application...


  • Pune, India Ensono Full time

    About Us (Ensono) Ensono is an expert technology adviser and managed service provider. As a relentless ally, we accelerate clients’ digital transformation to achieve business outcomes that stand to last. Our dedicated team helps organizations optimize today’s systems across any hybrid environment with services such as consulting, mainframe and...


  • pune, India Ensono Full time

    About Us (Ensono) Ensono is an expert technology adviser and managed service provider. As a relentless ally, we accelerate clients’ digital transformation to achieve business outcomes that stand to last. Our dedicated team helps organizations optimize today’s systems across any hybrid environment with services such as consulting, mainframe and...


  • Pune, India Creospan Private Limited Full time

    Creospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. “Tomorrow’s ideas, built today!” In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and...


  • Pune, India Creospan Private Limited Full time

    Creospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. “Tomorrow’s ideas, built today!” In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and...


  • Pune, India NCS Group Full time

    Are you looking for value-adding and impactful work?Do you want to make a difference with your expertise?With us, you’ll be able to make it happen.NCS is a leading technology services firm, operating across Asia Pacific in over 20 cities , providing services and solutions in consulting, digital services, technology, and more.We believe in utilizing the...


  • Pune, India Roche Full time

    The PositionKEY ROLES & RESPONSIBILITIES (required):Responsibilities:Design, implement, and maintain site reliability engineering (SRE) practices that ensure the reliability and performance of our production systems.Design and implement SRE practices that align with the company's overall reliability and performance goals.Develop and maintain automated...


  • pune, India Ather Full time

    YOU’LL BE OUR: Site Reliability Engineer YOU’LL BE BASED AT: Pune YOU’LL BE ALIGNED WITH: Cloud Architect YOU’LL BE THE MEMBER OF: Web Technologies Team At Ather, we’re passionate about building software that solves problems. We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availability, and...


  • Pune, India Arista Networks Full time

    Site Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...


  • Pune, India Arista Networks Full time

    Site Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...


  • pune, India Arista Networks Full time

    Site Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...


  • Pune, India NCS Group Full time

    Are you looking for value-adding and impactful work?Do you want to make a difference with your expertise?With us, you’ll be able to make it happen.NCS is a leading technology services firm, operating across Asia Pacific in over 20 cities , providing services and solutions in consulting, digital services, technology, and more.We believe in utilizing the...


  • pune, India NCS Group Full time

    Are you looking for value-adding and impactful work? Do you want to make a difference with your expertise? With us, you’ll be able to make it happen. NCS is a leading technology services firm, operating across Asia Pacific in over 20 cities , providing services and solutions in consulting, digital services, technology, and more. We believe in utilizing the...


  • Pune, India NCS Group Full time

    Are you looking for value-adding and impactful work?Do you want to make a difference with your expertise?With us, you’ll be able to make it happen.NCS is a leading technology services firm, operating across Asia Pacific in over 20 cities , providing services and solutions in consulting, digital services, technology, and more.We believe in utilizing the...


  • pune, India Roche Full time

    The Position KEY ROLES & RESPONSIBILITIES (required): Responsibilities: Design, implement, and maintain site reliability engineering (SRE) practices that ensure the reliability and performance of our production systems. Design and implement SRE practices that align with the company's overall reliability and performance goals. Develop...


  • Pune, India Wipro Full time

    Role Purpose Required Skills: � 5+Years of experience in system administration, application development, infrastructure development or related areas � 5+ years of experience with programming in languages like Javascript, Python, PHP, Go, Java or Ruby � 3+ years of in reading, understanding and writing code in the same � 3+years Mastery of...


  • pune, India Wipro Full time

    Role Purpose Required Skills: � 5+Years of experience in system administration, application development, infrastructure development or related areas � 5+ years of experience with programming in languages like Javascript, Python, PHP, Go, Java or Ruby � 3+ years of in reading, understanding and writing code in the same � 3+years Mastery of...


  • pune, India Wipro Full time

    Role Purpose Required Skills: � 5+Years of experience in system administration, application development, infrastructure development or related areas � 5+ years of experience with programming in languages like Javascript, Python, PHP, Go, Java or Ruby � 3+ years of in reading, understanding and writing code in the same � 3+years Mastery of...


  • Pune, India LTIMindtree Full time

    We are Hiring DevOps Site Reliability Engineer !!!Exp - 8 to 12 yearsLocation - Pune Banglore & MumbaiNP - Immediate to 60 daysJD5+ years of experience in DevOps, Site Reliability Engineer, or as a developer in SaaS based/enterprise applications • Previous experience within Agile Development or Systems Engineering / automation role • Development...


  • Pune, India LTIMindtree Full time

    We are Hiring DevOps Site Reliability Engineer !!!Exp - 8 to 12 yearsLocation - Pune Banglore & MumbaiNP - Immediate to 60 daysJD5+ years of experience in DevOps, Site Reliability Engineer, or as a developer in SaaS based/enterprise applications • Previous experience within Agile Development or Systems Engineering / automation role • Development...