Site reliability engineer

1 week ago


Pune, Maharashtra, India Roche Full time

The Position

KEY ROLES & RESPONSIBILITIES (required):

Responsibilities:

Design, implement, and maintain site reliability engineering (SRE) practices that ensure the reliability and performance of our production systems. Design and implement SRE practices that align with the company's overall reliability and performance goals. Develop and maintain automated monitoring and alerting systems to proactively identify and address potential issues. Implement incident response procedures to effectively resolve incidents and minimize downtime. Collaborate with developers and other engineers to define and implement service level agreements (SLAs). Conduct regular reviews of SRE practices to ensure they remain effective and aligned with evolving needs. Monitor and troubleshoot production systems to identify and resolve issues before they impact users. Continuously monitor production systems for performance degradation, potential failures, and security vulnerabilities. Thoroughly investigate and troubleshoot incidents to identify the root cause and implement corrective actions. Proactively identify potential issues by analyzing system logs, metrics, and trends. Collaborate with developers and other engineers to implement workarounds and fixes for identified issues. Document incident investigations and corrective actions to prevent recurrence and improve future troubleshooting efforts. Develop and implement automated monitoring and alerting systems. Design and implement automated monitoring systems to collect and analyze real-time data from production systems. Configure alerting systems to notify appropriate personnel of potential issues or performance deviations. Continuously evaluate and improve the effectiveness of automated monitoring and alerting systems. Automate repetitive tasks to improve efficiency and reduce manual intervention. Collaborate with developers and other engineers to design and implement new features and infrastructure changes. Work closely with developers to understand the impact of new features and code changes on system reliability and performance. Provide guidance and recommendations to developers on SRE best practices and design for reliability. Participate in code reviews to identify potential reliability issues and suggest improvements. Collaborate with infrastructure engineers to ensure that new infrastructure components are designed and deployed with reliability and performance in mind. Stay up-to-date on the latest technologies and trends in SRE and DevOps to contribute to continuous innovation and improvement. Prepare and deliver technical presentations and documentation. Prepare and deliver technical presentations to share SRE best practices, incident investigations, and lessons learned. Document SRE practices, procedures, and guidelines to ensure knowledge transfer and consistency. Contribute to internal documentation and knowledge bases to aid troubleshooting and problem-solving. Present findings and recommendations to management and stakeholders to inform decision-making processes.

KNOWLEDGE/ SKILLS/ATTRIBUTES (required): The minimum education, knowledge, experience, skills and attributes required to perform the essential functions of this job.

Required Experience, Skills and Qualifications

3 – 6 years of relevant experience Proven hands-on Software/Application support with Cloud as main technology area. Troubleshooting and the ability to delve deeply into technical details & acquire/create the necessary Knowledge to effectively troubleshoot and repair of the applications Knowledge Splunk, VictorsOps, Appdynamics, web automation like selenium and ability to learn new tools and technologies. Collaborative team player with excellent influence and interpersonal skills; inspires confidence. Experience with public Cloud providers, including Amazon Web Services architecture, tools, and Cloud methodologies. Proven ability to design, implement, and maintain SRE practices that ensure system reliability and performance. Experience in monitoring and troubleshooting production systems to identify and resolve incidents. Familiarity with automated monitoring and alerting systems, including tool selection, configuration, and maintenance. Experience collaborating with developers and other engineers to design, implement, and operate reliable systems. Excellent written and verbal communication Exposure to handling customers from various geographies Ability to work with minimum supervision Team player who shares ideas and resources Flexibility to work in shifts or weekends as per schedule

Education

Bachelor degree engineering or informatics.

  • Pune, Maharashtra, India Ensono Full time

    About Us (Ensono) Ensono is an expert technology adviser and managed service provider. As a relentless ally, we accelerate clients' digital transformation to achieve business outcomes that stand to last. Our dedicated team helps organizations optimize today's systems across any hybrid environment with services such as consulting, mainframe and...


  • Pune, Maharashtra, India Creospan Private Limited Full time

    Creospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. "Tomorrow's ideas, built today" In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and...


  • Pune, Maharashtra, India Creospan Private Limited Full time

    Creospan is a growing tech collective of makers, shakers, and problem solvers, offering solutions today that will propel businesses into a better tomorrow. "Tomorrow's ideas, built today" In addition to being able to work alongside equally brilliant and motivated developers, our consultants appreciate the opportunity to learn and apply new skills and...


  • Pune, Maharashtra, India Whizz HR Full time

    Job Description :As a Site Reliability Engineer (SRE), you will have a key role in ensuring our systems and services run smoothly and efficiently. You will work with various teams to design, build, and maintain robust infrastructure and applications, contributing to top-notch services and exceptional user experiences.Key Focus Areas : System Architecture :...


  • Pune, Maharashtra, India HCLSoftware Full time

    The Role:HCL BigFix is looking for a Site Reliability Engineer to work on infrastructure for a newproduct that will help keep our customers' end points secure. You will be a part of a teamthat leverages modern technological solutions to drive growth and efficiency. Your dailyresponsibilities will be centered on HCL BigFix's cloud infrastructure, with daily...


  • Pune, Maharashtra, India Thinkproject Full time

    Want to work in a workplace built on mutual trust and respect? How about having the flexibility to balance work with your life? A career with Thinkproject could be the perfect fit for you. What is our Focus? Thinkproject is a top player in digital tools for construction firms in Europe. In the past, construction companies relied on manual paperwork for their...


  • Pune, Maharashtra, India Arista Networks Full time

    Site Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery.They develop software solutions to enhance, harden and support our service delivery processes.This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...


  • Pune, Maharashtra, India HCLSoftware Full time

    The Role:HCL BigFixis looking for aSite Reliability Engineerto work on infrastructure for a new product that will help keep our customers' end points secure. You will be a part of a team that leverages modern technological solutions to drive growth and efficiency. Your daily responsibilities will be centered on HCL BigFix's cloud infrastructure, with daily...


  • Pune, Maharashtra, India HCLSoftware Full time

    The Role:HCL Big Fix is looking for a Site Reliability Engineer to work on infrastructure for a new product that will help keep our customers' end points secure.You will be a part of a team that leverages modern technological solutions to drive growth and efficiency.Your daily responsibilities will be centered on HCL Big Fix's cloud infrastructure, with...


  • Pune, Maharashtra, India Arista Networks Full time

    Site Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery.They develop software solutions to enhance, harden and support our service delivery processes.This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...


  • Pune, Maharashtra, India GfK Full time

    Description About You You are a DevOps or Site Reliability Engineer with a passion for cloud infrastructure and automation. You're a self-starter and you love keeping up to date with the latest developments in cloud, configuration management and container technologies. You understand the benefits of an immutable infrastructure and you enjoy enabling...


  • Pune, Maharashtra, India Global Payments Full time

    Every day, Global Payments makes it possible for millions of people to move money between buyers and sellers using our payments solutions for credit, debit, prepaid and merchant services. Our worldwide team helps over 3 million companies, more than 1,300 financial institutions and over 600 million cardholders grow with confidence and achieve amazing...


  • Pune, Maharashtra, India Zensar Technologies Full time

    Site Reliability Engineer (SRE) will focus on Scalability, High Availability, Performance, Stability and Reliability of Software Applications. SRE will build automations to simplify operations and processes, collaborate with cross-functional teams to create proactive engineering mechanisms and ensure positive end user experiences. SRE with a good...


  • Pune, Maharashtra, India Arista Networks Full time

    Site Reliability Engineers at Arista are critical team members that have a breadth of knowledge encompassing all aspects of service delivery. They develop software solutions to enhance, harden and support our service delivery processes. This can include building and managing CI/CD deployment pipelines, automated testing, capacity planning, performance...


  • Pune, Maharashtra, India FIS Global Full time

    Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Associate's Degree Travel Percentage : 0%Site Reliability Engineer (SRE)Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues in financial services and...


  • Pune, Maharashtra, India FIS Full time

    Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Associate's Degree Travel Percentage : 0% Site Reliability Engineer (SRE) Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues in financial...


  • Pune, Maharashtra, India Jobs for Humanity Full time

    Job Description Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Associate's Degree Travel Percentage : 0%Site Reliability Engineer (SRE) Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues in...


  • Pune, Maharashtra, India TechVerito Full time

    As a Site Reliability Engineer, you will be involved in exciting technical challenges by analyzing, troubleshooting, and designing vital services, platforms, and infrastructure while always thinking about reliability, scalability, resilience, security, and performance.Responsibilities:Owning Infra architecture and non-functional requirements, ensuring they...


  • Pune, Maharashtra, India Etraveli Group Full time

    Etraveli is one of the leading global flight centric Online Travel Agencies (OTAs) with €4bn+in annual gross sales. We also operate , the #1 meta searcher in Sweden and Tripstack, the independent B2B arm of the group offering a variety of complex technology solutions. Our diverse, dynamically growing team of 1000+ talented professionals is always on the...


  • Pune, Maharashtra, India NTT DATA Full time

    Job Description Req ID: NTT DATA Services strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.We are currently seeking a Site Reliability Engineer (SRE) to join our team in Pune, Mahārāshtra (IN-MH), India (IN).NTT DATA...