Site Reliability Engineer
2 weeks ago
Title: Site Reliability Engineer (SRE)
Location:Bangalore, Karnataka, IN, 560071
Requisition ID: 127074Job Summary
As a Site Reliability Engineer (SRE) with a specialization in storage, you'll manage and optimize a portfolio of customer-facing cloud services (SaaS/IaaS) on Google Cloud Platform (GCP), ensuring their overall availability, performance, and security. You will collaborate closely with global teams from NetApp and GCP, with a primary focus on supporting Google Cloud NetApp Volumes. This position includes rotational on-call work as part of a global team due to the critical nature of the services we support.
You will be working in a dynamic and fast-paced environment as an engineer on the Site Reliability Engineering (SRE) team. This team is responsible for assisting customers of Google Cloud NetApp Volumes in resolving complex technical issues in production environments. We are seeking an SRE with a deep understanding of storage systems, complex distributed systems, and cloud technologies, and the ability to articulate these concepts clearly to customers and fellow engineers.
You will work with your teammates and our customers to support innovative, cutting-edge technologies that address real-world challenges. You will provide valuable feedback and guidance to our Product and Engineering teams while representing the voice of our customers. You have the opportunity to make a significant impact and take real ownership of your work.
Job Requirements
o Collaborate with external customers and partners to ensure their success with Google Cloud NetApp Volumes.
o Respond to, troubleshoot, and drive root cause analysis (RCA) of complex live production incidents, including cross-platform issues involving OS, networking, and databases in cloud-based SaaS/IaaS environments by following and implementing SRE best practices.
o Continuously monitor, analyze, and measure system health, availability, and latency using tools like Prometheus, Google Cloud Monitoring, ElasticSearch, Grafana, and SolarWinds. Develop and implement steps to improve system and application performance, availability, and reliability.
o Document system knowledge, create runbooks, and ensure critical system information is readily available.
o Stay up-to-date with security trends and proactively identify, diagnose, and resolve complex security issues.
o Maintain and monitor deployment, orchestration of servers, Docker containers, databases, and general backend infrastructure.
o Automate tasks and system components that would benefit from automation or are performed manually.
o Utilize Atlassian Jira to track issues to resolution based on their priority.
o Engage in incident management processes and resolve issues within agreed SLAs/SLOs.
o Extensive experience in storage technologies and incident management processes.
o Advanced knowledge of Linux operating systems (e.g., Ubuntu, CentOS).
o Proficiency in container-based architecture (e.g., Kubernetes).
o Intermediate to advanced knowledge of automation tools and scripting languages such as Ansible, Python, Bash, Go, and PowerShell.
o Solid understanding of algorithms, data structures, and databases (SQL/NoSQL).
o Intermediate knowledge of networking concepts.
o Hands-on experience with cloud environments, particularly GCP.
o Exceptional debugging skills across various platforms and technologies.
o Familiarity with site reliability engineering principles and best practices.
Education
BE in Computer Science or a related field, or 6+ years of professional experience in a relevant role.
Job Segment: Cloud, Software Engineer, Computer Science, SQL, Linux, Technology, Engineering
-
Site Reliability Engineer
4 months ago
Bengaluru, India Cricbuzz.com Full timeSite Reliability Engineer We are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services. Experience - 3 - 5 years Responsibilities: ●...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, India Cricbuzz.com Full timeSite Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 4 - 5 yearsResponsibilities:● Design,...
-
Site Reliability Engineer
1 month ago
Bengaluru, India Qure.ai Full timeAbout the jobJob Title:Site Reliability EngineerDepartment:EngineeringLocation:BangaloreYears of experience:2-5 yearsType:Full Time EmploymentAbout Qure.ai:Qure.ai is one of the fastest-growing startups in India, which develops Artificial Intelligence enabled products and platforms for healthcare diagnostics. We create cutting-edge solutions that positively...
-
Site Reliability Engineer
1 week ago
Bengaluru, India Tata Consultancy Services Full timeDear CandidateGreetings from TCS !!!Role: Site Reliability EngineerLocation: Bangalore/Chennai/Pune/DelhiExperience Range: 8 to 12 YearsJob Description:Exceptional skills in Docker/Kubernetes deployment and configuration, scaling and management of containerized applications.Excellent skills in managing, performance optimisation of complex Prometheus,...
-
Site Reliability Engineering
3 months ago
Bengaluru, India Microsoft Full timeOverview Looking to join an exciting industry and organization at the forefront of the next Tech industry transformation? Are you ready to join a team of the world’s best technical experts to enable the success of Microsoft solutions for our commercial & enterprise customers? We are seeking to build out the team of next generation Site Reliability...
-
Site Reliability Engineer
3 months ago
Bengaluru, India tsworks Full timeWho We Aretsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
Site Reliability Engineer
1 month ago
Bengaluru, India Qure.ai Full timeAbout the job Job Title: Site Reliability Engineer Department: Engineering Location: Bangalore Years of experience: 2-5 years Type: Full Time Employment About Qure.ai: Qure.ai is one of the fastest-growing startups in India, which develops Artificial Intelligence enabled products and platforms for healthcare diagnostics. We create cutting-edge...
-
Site Reliability Engineer
1 week ago
Bengaluru, India Tata Consultancy Services Full timeDear CandidateGreetings from TCS !!!Role: Site Reliability EngineerLocation: Bangalore/Chennai/Pune/DelhiExperience Range: 8 to 12 YearsJob Description:Exceptional skills in Docker/Kubernetes deployment and configuration, scaling and management of containerized applications.Excellent skills in managing, performance optimisation of complex Prometheus,...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, India NetApp Full timeTitle: Site Reliability Engineer Location: Bangalore, Karnataka, IN, 560071 Requisition ID: 127625 Job SummaryAs a Keystone Site Reliability Engineer, you will be operating at the intersection of development and operations. Your role will involve engaging in and enhancing the lifecycle of Keystone services - from monitoring, end of month reporting,...
-
Site Reliability Engineer
3 months ago
Bengaluru, India tsworks Full timeWho We Aretsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
Site Reliability Engineer
4 months ago
Bengaluru, India tsworks Full timeWho We Aretsworks Technologies India Private Limited (subsidiary of The Software Works, Inc, USA) is a technology product and services company. Our mission is to provide domain expertise, innovative solutions and thought leadership to empower businesses to thrive in a digital world. We value our employees, take pride in providing best value in customer...
-
Site Reliability Engineer
4 months ago
Bengaluru, India Encora Inc. Full timePosition: Site Reliability Engineer Location: Bangalore Experience: 4+ Years Job Mode: Full-time Work Mode: Remote Responsibilities and Duties Collaborate with cross-functional teams to design, implement, and maintain reliable and scalable infrastructure solutions on the Azure cloud platform. Implement and maintain monitoring and logging...
-
Staff Site Reliability Engineer
3 weeks ago
Bengaluru, India nference Full timeStaff Site Reliability Engineer:Job Location: BangaloreWork Mode: Hybrid (3 days in the office, 2 days remote)As a Staff Site Reliability Engineer (SRE) at Nference, you will ensure the reliability, scalability, and performance of our nSights platform. Collaborate closely with engineering teams to design, build, and maintain systems supporting our global...
-
Senior Site Reliability Engineer
3 weeks ago
Bengaluru, India nference Full timeSenior Site Reliability Engineer (SRE)Job Location: BangaloreWork Mode: Hybrid (3 days in the office, 2 days remote)As a Senior Site Reliability Engineer (SRE) at Nference, you will ensure the reliability, scalability, and performance of our nSights platform. Collaborate closely with engineering teams to design, build, and maintain systems supporting our...
-
Site Reliability Engineer
4 months ago
Bengaluru, India Integra Connect Full timeAbout IntegraConnect Integra Connect delivers a comprehensive, integrated suite of cloud-based technologies and services that enable specialty groups to optimize clinical and financial performance as reimbursement shifts to value-based models. Connected by the IntegraCloud platform, the company’s core applications span population health including care...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, India Zensar Technologies Full timeAbout the Role: Site Reliability EngineerExperience: 5-8YrsLocation: BangaloreRequired Skills:Must have skills: -High level of experience using cloud log management and monitoring data platforms (Dynatrace, Azure Monitor)Hands on experience in Azure Bicep Experience working with Infrastructure as Code and Containerization tools (Terraform, Docker,...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, India Zensar Technologies Full timeAbout the Role: Site Reliability Engineer Experience: 5-8Yrs Location: Bangalore Required Skills: Must have skills: - High level of experience using cloud log management and monitoring data platforms (Dynatrace, Azure Monitor ) Hands on experience in Azure Bicep Experience working with Infrastructure as Code and Containerization tools (Terraform ,...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, India Zensar Technologies Full timeAbout the Role: Site Reliability EngineerExperience: 5-8YrsLocation: BangaloreRequired Skills:Must have skills: -High level of experience using cloud log management and monitoring data platforms (Dynatrace, Azure Monitor)Hands on experience in Azure Bicep Experience working with Infrastructure as Code and Containerization tools (Terraform, Docker,...
-
Site Reliability Engineer
1 week ago
Bengaluru, India Tata Consultancy Services Full timeDear CandidateGreetings from TCS !!!Role: Site Reliability Engineer Location: Bangalore/Chennai/Pune/DelhiExperience Range: 8 to 12 YearsJob Description: Exceptional skills in Docker/Kubernetes deployment and configuration, scaling and management of containerized applications.Excellent skills in managing, performance optimisation of complex Prometheus,...
-
Site Reliability Engineer
1 week ago
Bengaluru, India Tata Consultancy Services Full timeDear CandidateGreetings from TCS !!!Role: Site Reliability Engineer Location: Bangalore/Chennai/Pune/DelhiExperience Range: 8 to 12 YearsJob Description: Exceptional skills in Docker/Kubernetes deployment and configuration, scaling and management of containerized applications.Excellent skills in managing, performance optimisation of complex Prometheus,...