Site reliability engineer

2 weeks ago


Pune, India Roche Full time
The Position

KEY ROLES & RESPONSIBILITIES (required):Responsibilities:Design, implement, and maintain site reliability engineering (SRE) practices that ensure the reliability and performance of our production systems.Design and implement SRE practices that align with the company's overall reliability and performance goals.Develop and maintain automated monitoring and alerting systems to proactively identify and address potential issues.Implement incident response procedures to effectively resolve incidents and minimize downtime.Collaborate with developers and other engineers to define and implement service level agreements (SLAs).Conduct regular reviews of SRE practices to ensure they remain effective and aligned with evolving needs.Monitor and troubleshoot production systems to identify and resolve issues before they impact users.Continuously monitor production systems for performance degradation, potential failures, and security vulnerabilities.Thoroughly investigate and troubleshoot incidents to identify the root cause and implement corrective actions.Proactively identify potential issues by analyzing system logs, metrics, and trends.Collaborate with developers and other engineers to implement workarounds and fixes for identified issues.Document incident investigations and corrective actions to prevent recurrence and improve future troubleshooting efforts.Develop and implement automated monitoring and alerting systems.Design and implement automated monitoring systems to collect and analyze real-time data from production systems.Configure alerting systems to notify appropriate personnel of potential issues or performance deviations.Continuously evaluate and improve the effectiveness of automated monitoring and alerting systems.Automate repetitive tasks to improve efficiency and reduce manual intervention.Collaborate with developers and other engineers to design and implement new features and infrastructure changes.Work closely with developers to understand the impact of new features and code changes on system reliability and performance.Provide guidance and recommendations to developers on SRE best practices and design for reliability.Participate in code reviews to identify potential reliability issues and suggest improvements.Collaborate with infrastructure engineers to ensure that new infrastructure components are designed and deployed with reliability and performance in mind.Stay up-to-date on the latest technologies and trends in SRE and DevOps to contribute to continuous innovation and improvement.Prepare and deliver technical presentations and documentation.Prepare and deliver technical presentations to share SRE best practices, incident investigations, and lessons learned.Document SRE practices, procedures, and guidelines to ensure knowledge transfer and consistency.Contribute to internal documentation and knowledge bases to aid troubleshooting and problem-solving.Present findings and recommendations to management and stakeholders to inform decision-making processes.KNOWLEDGE/ SKILLS/ATTRIBUTES (required):

The minimum education, knowledge, experience, skills and attributes required to perform the essential functions of this job.Required Experience, Skills and Qualifications3 – 6 years of relevant experienceProven hands-on Software/Application support with Cloud as main technology area.Troubleshooting and the ability to delve deeply into technical details & acquire/create the necessaryKnowledge to effectively troubleshoot and repair of the applicationsKnowledge Splunk, VictorsOps, Appdynamics, web automation like selenium and ability to learn new tools and technologies.Collaborative team player with excellent influence and interpersonal skills; inspires confidence.Experience with public Cloud providers, including Amazon Web Services architecture, tools, and Cloud methodologies.Proven ability to design, implement, and maintain SRE practices that ensure system reliability and performance.Experience in monitoring and troubleshooting production systems to identify and resolve incidents.Familiarity with automated monitoring and alerting systems, including tool selection, configuration, and maintenance.Experience collaborating with developers and other engineers to design, implement, and operate reliable systems.Excellent written and verbal communicationExposure to handling customers from various geographiesAbility to work with minimum supervisionTeam player who shares ideas and resourcesFlexibility to work in shifts or weekends as per scheduleEducationBachelor degree engineering or informatics.

  • Pune, India LTIMindtree Full time

    We are Hiring DevOps Site Reliability Engineer !!! Exp - 8 to 12 years Location - Pune Banglore & Mumbai NP - Immediate to 60 days JD 5+ years of experience in DevOps, Site Reliability Engineer, or as a developer in SaaS based/enterprise applications • Previous experience within Agile Development or Systems Engineering / automation role • Development...


  • pune, India Ensono Full time

    About Us (Ensono) Ensono is an expert technology adviser and managed service provider. As a relentless ally, we accelerate clients’ digital transformation to achieve business outcomes that stand to last. Our dedicated team helps organizations optimize today’s systems across any hybrid environment with services such as consulting, mainframe and...


  • Pune, India Ensono Full time

    About Us (Ensono) Ensono is an expert technology adviser and managed service provider. As a relentless ally, we accelerate clients’ digital transformation to achieve business outcomes that stand to last. Our dedicated team helps organizations optimize today’s systems across any hybrid environment with services such as consulting, mainframe and...


  • Pune, India Creospan Private Limited Full time

    Creospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. “Tomorrow’s ideas, built today!” In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and...


  • Pune, India Creospan Private Limited Full time

    Creospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. “Tomorrow’s ideas, built today!” In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and...


  • Pune, India Creospan Private Limited Full time

    Creospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. “Tomorrow’s ideas, built today!” In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and...


  • pune, India Ather Full time

    YOU’LL BE OUR: Site Reliability Engineer YOU’LL BE BASED AT: Pune YOU’LL BE ALIGNED WITH: Cloud Architect YOU’LL BE THE MEMBER OF: Web Technologies Team At Ather, we’re passionate about building software that solves problems. We count on our site reliability engineers (SREs) to empower users with a rich feature set, high availability, and...


  • Pune, India Arista Networks Full time

    Site Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...


  • Pune, India Arista Networks Full time

    Site Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...


  • pune, India HCLSoftware Full time

    The Role:HCL BigFix is looking for a Site Reliability Engineer to work on infrastructure for a newproduct that will help keep our customers’ end points secure. You will be a part of a teamthat leverages modern technological solutions to drive growth and efficiency. Your dailyresponsibilities will be centered on HCL BigFix’s cloud infrastructure, with...


  • Pune, India Arista Networks Full time

    Site Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...


  • pune, India Arista Networks Full time

    Site Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...


  • Pune, India Arista Networks Full time

    Site Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...


  • Pune, India Creospan Private Limited Full time

    Creospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. “Tomorrow’s ideas, built today!” In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and...


  • pune, India Roche Full time

    The Position KEY ROLES & RESPONSIBILITIES (required): Responsibilities: Design, implement, and maintain site reliability engineering (SRE) practices that ensure the reliability and performance of our production systems. Design and implement SRE practices that align with the company's overall reliability and performance goals. Develop...


  • Pune, India Roche Full time

    The Position KEY ROLES & RESPONSIBILITIES (required): Responsibilities: Design, implement, and maintain site reliability engineering (SRE) practices that ensure the reliability and performance of our production systems. Design and implement SRE practices that align with the company's overall reliability and performance goals. Develop and...


  • Pune, India Global Payments Full time

    Every day, Global Payments makes it possible for millions of people to move money between buyers and sellers using our payments solutions for credit, debit, prepaid and merchant services. Our worldwide team helps over 3 million companies, more than 1,300 financial institutions and over 600 million cardholders grow with confidence and achieve amazing...


  • pune, India Creospan Private Limited Full time

    Creospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. “Tomorrow’s ideas, built today!” In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and...


  • Pune, India HCLSoftware Full time

    The Role: HCL BigFix is looking for a Site Reliability Engineer to work on infrastructure for a newproduct that will help keep our customers’ end points secure. You will be a part of a teamthat leverages modern technological solutions to drive growth and efficiency. Your dailyresponsibilities will be centered on HCL BigFix’s cloud infrastructure, with...


  • pune, India HCLSoftware Full time

    The Role: HCL BigFix is looking for a Site Reliability Engineer to work on infrastructure for a new product that will help keep our customers’ end points secure. You will be a part of a team that leverages modern technological solutions to drive growth and efficiency. Your daily responsibilities will be centered on HCL BigFix’s cloud infrastructure,...