Site Reliability Engineer

2 months ago


Bengaluru, India NetApp Full time

Title: Site Reliability Engineer (SRE)

Location:

Bangalore, Karnataka, IN, 560071

Requisition ID: 127074

Job Summary

As a Site Reliability Engineer (SRE) with a specialization in storage, you'll manage and optimize a portfolio of customer-facing cloud services (SaaS/IaaS) on Google Cloud Platform (GCP), ensuring their overall availability, performance, and security. You will collaborate closely with global teams from NetApp and GCP, with a primary focus on supporting Google Cloud NetApp Volumes. This position includes rotational on-call work as part of a global team due to the critical nature of the services we support.

You will be working in a dynamic and fast-paced environment as an engineer on the Site Reliability Engineering (SRE) team. This team is responsible for assisting customers of Google Cloud NetApp Volumes in resolving complex technical issues in production environments. We are seeking an SRE with a deep understanding of storage systems, complex distributed systems, and cloud technologies, and the ability to articulate these concepts clearly to customers and fellow engineers.
You will work with your teammates and our customers to support innovative, cutting-edge technologies that address real-world challenges. You will provide valuable feedback and guidance to our Product and Engineering teams while representing the voice of our customers. You have the opportunity to make a significant impact and take real ownership of your work.

Job Requirements

o Collaborate with external customers and partners to ensure their success with Google Cloud NetApp Volumes.
o Respond to, troubleshoot, and drive root cause analysis (RCA) of complex live production incidents, including cross-platform issues involving OS, networking, and databases in cloud-based SaaS/IaaS environments by following and implementing SRE best practices.
o Continuously monitor, analyze, and measure system health, availability, and latency using tools like Prometheus, Google Cloud Monitoring, ElasticSearch, Grafana, and SolarWinds. Develop and implement steps to improve system and application performance, availability, and reliability.
o Document system knowledge, create runbooks, and ensure critical system information is readily available.
o Stay up-to-date with security trends and proactively identify, diagnose, and resolve complex security issues.
o Maintain and monitor deployment, orchestration of servers, Docker containers, databases, and general backend infrastructure.
o Automate tasks and system components that would benefit from automation or are performed manually.
o Utilize Atlassian Jira to track issues to resolution based on their priority.
o Engage in incident management processes and resolve issues within agreed SLAs/SLOs.

o Extensive experience in storage technologies and incident management processes.
o Advanced knowledge of Linux operating systems (e.g., Ubuntu, CentOS).
o Proficiency in container-based architecture (e.g., Kubernetes).
o Intermediate to advanced knowledge of automation tools and scripting languages such as Ansible, Python, Bash, Go, and PowerShell.
o Solid understanding of algorithms, data structures, and databases (SQL/NoSQL).
o Intermediate knowledge of networking concepts.
o Hands-on experience with cloud environments, particularly GCP.
o Exceptional debugging skills across various platforms and technologies.
o Familiarity with site reliability engineering principles and best practices.

Education

BE in Computer Science or a related field, or 6+ years of professional experience in a relevant role. 


Job Segment: Cloud, Software Engineer, SQL, Linux, Database, Technology, Engineering



  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring aSite Reliability Engineerto join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring aSite Reliability Engineerto join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5 Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE) Location: Bangalore We're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you. Experience with Ansible and Kubernetes is a MUST-HAVE Key Responsibilities: Manage...


  • Bengaluru, India BCE Global Tech Full time

    At BCE Global Tech, immerse yourself in exciting projects that are shaping the future of both consumer and enterprise telecommunications. This involves building innovative mobile apps to enhance user experiences and enable seamless connectivity on-the-go.If you are passionate about technology and eager to make a difference, we want to hear from you! Apply...


  • Bengaluru, India Tranzeal Incorporated Full time

    Hi Everyone,One of our Direct client is Hiring Site Reliability Engineer in Bengaluru, Karnataka, India. If anyone is interested, please share your resume.Job Title: Site Reliability EngineerLocation: Bengaluru, Karnataka, India - OnsiteJob DescriptionResponsible for maintaining and scaling production services and servers across multiple data centers for...


  • Bengaluru, India Tranzeal Incorporated Full time

    Hi Everyone,One of our Direct client is HiringSite Reliability EngineerinBengaluru, Karnataka, India.If anyone is interested, please share your resume.Job Title: Site Reliability EngineerLocation: Bengaluru, Karnataka, India - OnsiteJob DescriptionResponsible for maintaining and scaling production services and servers across multiple data centers for complex...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE) Location: Bangalore, KA Work Mode: Office (5Days/Week) Position Type: Contract based We're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you. Experience with...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...


  • Bengaluru, India BCE Global Tech Full time

    At BCE Global Tech, immerse yourself in exciting projects that are shaping the future of both consumer and enterprise telecommunications. This involves building innovative mobile apps to enhance user experiences and enable seamless connectivity on-the-go.If you are passionate about technology and eager to make a difference, we want to hear from you! Apply...


  • Bengaluru, India Qure.ai Full time

    About the jobJob Title: Site Reliability EngineerDepartment: EngineeringLocation: BangaloreYears of experience: 2-5 yearsType: Full Time EmploymentAbout Qure.ai:Qure.ai is one of the fastest-growing startups in India, which develops Artificial Intelligence enabled products and platforms for healthcare diagnostics. We create cutting-edge solutions that...


  • Bengaluru, India N Consulting Ltd Full time

    Experience: 10+ years Location: Bengaluru Job Description: Site Reliability EngineeringGood Communication & Leadership skillExperience in Software Release Management or worked in application side(Code Reviews)Should have strong knowledge in Java Should have strong knowledge in PythonShould have strong knowledge in AWSShould have Lead experience.Site...


  • Bengaluru, India Qure.ai Full time

    About the jobJob Title:Site Reliability EngineerDepartment:EngineeringLocation:BangaloreYears of experience:2-5 yearsType:Full Time EmploymentAbout Qure.ai:Qure.ai is one of the fastest-growing startups in India, which develops Artificial Intelligence enabled products and platforms for healthcare diagnostics. We create cutting-edge solutions that positively...


  • Bengaluru, India BCE Global Tech Full time

    At BCE Global Tech, immerse yourself in exciting projects that are shaping the future of both consumer and enterprise telecommunications. This involves building innovative mobile apps to enhance user experiences and enable seamless connectivity on-the-go. If you are passionate about technology and eager to make a difference, we want to hear from you! Apply...