Site Reliability Engineer

4 days ago


bangalore, India NetApp Full time

Title: Site Reliability Engineer (SRE)

Location:

Bangalore, Karnataka, IN, 560071

Requisition ID: 127074

Job Summary

As a Site Reliability Engineer (SRE) with a specialization in storage, you'll manage and optimize a portfolio of customer-facing cloud services (SaaS/IaaS) on Google Cloud Platform (GCP), ensuring their overall availability, performance, and security. You will collaborate closely with global teams from NetApp and GCP, with a primary focus on supporting Google Cloud NetApp Volumes. This position includes rotational on-call work as part of a global team due to the critical nature of the services we support.

You will be working in a dynamic and fast-paced environment as an engineer on the Site Reliability Engineering (SRE) team. This team is responsible for assisting customers of Google Cloud NetApp Volumes in resolving complex technical issues in production environments. We are seeking an SRE with a deep understanding of storage systems, complex distributed systems, and cloud technologies, and the ability to articulate these concepts clearly to customers and fellow engineers.
You will work with your teammates and our customers to support innovative, cutting-edge technologies that address real-world challenges. You will provide valuable feedback and guidance to our Product and Engineering teams while representing the voice of our customers. You have the opportunity to make a significant impact and take real ownership of your work.

Job Requirements

o Collaborate with external customers and partners to ensure their success with Google Cloud NetApp Volumes.
o Respond to, troubleshoot, and drive root cause analysis (RCA) of complex live production incidents, including cross-platform issues involving OS, networking, and databases in cloud-based SaaS/IaaS environments by following and implementing SRE best practices.
o Continuously monitor, analyze, and measure system health, availability, and latency using tools like Prometheus, Google Cloud Monitoring, ElasticSearch, Grafana, and SolarWinds. Develop and implement steps to improve system and application performance, availability, and reliability.
o Document system knowledge, create runbooks, and ensure critical system information is readily available.
o Stay up-to-date with security trends and proactively identify, diagnose, and resolve complex security issues.
o Maintain and monitor deployment, orchestration of servers, Docker containers, databases, and general backend infrastructure.
o Automate tasks and system components that would benefit from automation or are performed manually.
o Utilize Atlassian Jira to track issues to resolution based on their priority.
o Engage in incident management processes and resolve issues within agreed SLAs/SLOs.

o Extensive experience in storage technologies and incident management processes.
o Advanced knowledge of Linux operating systems (e.g., Ubuntu, CentOS).
o Proficiency in container-based architecture (e.g., Kubernetes).
o Intermediate to advanced knowledge of automation tools and scripting languages such as Ansible, Python, Bash, Go, and PowerShell.
o Solid understanding of algorithms, data structures, and databases (SQL/NoSQL).
o Intermediate knowledge of networking concepts.
o Hands-on experience with cloud environments, particularly GCP.
o Exceptional debugging skills across various platforms and technologies.
o Familiarity with site reliability engineering principles and best practices.

Education

BE in Computer Science or a related field, or 6+ years of professional experience in a relevant role. 


Job Segment: Cloud, Software Engineer, Computer Science, Linux, Database, Technology, Engineering



  • bangalore, India Cricbuzz.com Full time

    Site Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 3 - 5 yearsResponsibilities:● Design,...


  • bangalore, India Microsoft Full time

    Overview Looking to join an exciting industry and organization at the forefront of the next Tech industry transformation? Are you ready to join a team of the world’s best technical experts to enable the success of Microsoft solutions for our commercial & enterprise customers? We are seeking to build out the team of next generation Site Reliability...


  • Bangalore, India Qure.ai Full time

    About the job Job Title: Site Reliability Engineer Department: Engineering Location: Bangalore Years of experience: 2-5 years Type: Full Time Employment About Qure.ai: Qure.ai is one of the fastest-growing startups in India, which develops Artificial Intelligence enabled products and platforms for healthcare diagnostics. We create...


  • bangalore, India Integra Connect Full time

    About IntegraConnectIntegra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • bangalore, India Zensar Technologies Full time

    About the Role: Site Reliability EngineerExperience: 5-8YrsLocation: BangaloreRequired Skills:Must have skills: -High level of experience using cloud log management and monitoring data platforms ( Dynatrace, Azure Monitor )Hands on experience in Azure BicepExperience working with Infrastructure as Code and Containerization tools ( Terraform , Docker,...


  • bangalore, India tsworks Full time

    Who We Are tsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...


  • bangalore, India CirrusLabs Full time

    About the CompanyWe are CirrusLabs. Our vision is to become the world's most sought-after niche digital transformation company that helps customers realize value through innovation. Our mission is to co-create success with our customers, partners and community. Our goal is to enable employees to dream, grow and make things happen. We are committed to...


  • bangalore, India 5100 Kyndryl Solutions Private Limited Full time

    Who We Are At Kyndryl, we design, build, manage and modernize the mission-critical technology systems that the world depends on every day. So why work at Kyndryl? We are always moving forward – always pushing ourselves to go further in our efforts to build a more equitable, inclusive world for our employees, our customers and our communities. The...


  • Bangalore City, India Zensar Technologies Full time

    About the Role: Site Reliability EngineerExperience: 5-8YrsLocation: BangaloreRequired Skills:Must have skills: High level of experience using cloud log management and monitoring data platforms (Dynatrace, Azure Monitor)Hands on experience in Azure Bicep Experience working with Infrastructure as Code and Containerization tools (Terraform, Docker, Kubernetes,...


  • Bangalore City, India Zensar Technologies Full time

    About the Role: Site Reliability EngineerExperience: 5-8YrsLocation: BangaloreRequired Skills:Must have skills: -High level of experience using cloud log management and monitoring data platforms (Dynatrace, Azure Monitor)Hands on experience in Azure Bicep Experience working with Infrastructure as Code and Containerization tools (Terraform, Docker,...


  • bangalore, India Indusface Full time

    Careers » Current Openings » Site Reliability Engineer Role: Indusface is hiring for a talented, enthusiastic individual passionate about all aspects of IT infrastructure operations to join us as a Site Reliability Engineer (SRE) Job Description: Monitor and maintain availability of cloud infrastructure, troubleshoot, identify, and...


  • bangalore, India BayOne Solutions Full time

    ResponsibilitiesTo ensure the reliability, availability and performance of customer’s production systems. Monitor system health, identify issues and implement solutions to prevent and resolve incidentsFor responding to incidents, perform root cause analysis and work with functional teams, 3rd party vendors to implement corrective actionsFor monitoring,...


  • bangalore, India Qlik Full time

    Description What makes us Qlik? A Gartner Magic Quadrant Leader for 13 years in a row, Qlik transforms complex data landscapes into actionable insights, driving strategic business outcomes. Serving over 40,000 global customers, our portfolio leverages pervasive data quality and advanced AI/ML capabilities that lead to better decisions, faster. ...

  • Engineering Manager

    4 days ago


    bangalore, India Flipkart Full time

    As a Site Reliability Engineering Manager (EM), you will be owning one full charter of SRE org, You will play the role of a tech-managerial EM, who needs to work with various stakeholders, peers, and your team to contribute to the organizational commitments and growth. You will lead a team of skilled engineers in optimizing search functionalities and driving...


  • Bangalore Urban, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • Bangalore Urban, India Integra Connect Full time

    About IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...


  • bangalore, India Central Business Solutions, Inc Full time

    Responsibilities include: - Managing operations for the firm's Artifactory based software distribution platform- Maximizing the availability and performance of supported systems through optimized and automated plant management, ongoing problem management, and architecture reviews with engineering-side peers- Reduction of the cost of support through the...


  • Bangalore, India Signify Netherlands B.V. Full time

    Signify, the new company name of Philips Lighting, is the global leader in lighting building on 125+ years of innovations. Our purpose is to unlock the extraordinary potential of light for brighter lives and a better world.We are proud to be ahead of the game in the Internet of Things and being carbon neutral. We learn through disruptive challenges and our...


  • bangalore, India Barracuda Full time

    Job ID: 25-251Come Join Our Passionate Team! At Barracuda, we make the world a safer place. We believe every business deserves access to cloud-enabled, enterprise-grade security solutions that are easy to buy, deploy, and use. We protect email, networks, data and applications with innovative solutions that grow and adapt with our customers’ journey. More...


  • Bangalore City, India BayOne Solutions Full time

    ResponsibilitiesTo ensure the reliability, availability and performance of customer’s production systems. Monitor system health, identify issues and implement solutions to prevent and resolve incidentsFor responding to incidents, perform root cause analysis and work with functional teams, 3rd party vendors to implement corrective actionsFor monitoring,...