Staff SRE, Application SRE

3 days ago

Bengaluru, Karnataka, India Netskope Full time ₹ 20,00,000 - ₹ 25,00,000 per year

About Netskope

Today, there's more data and users outside the enterprise than inside, causing the network perimeter as we know it to dissolve. We realized a new perimeter was needed, one that is built in the cloud and follows and protects data wherever it goes, so we started Netskope to redefine Cloud, Network and Data Security.

Since 2012, we have built the market-leading cloud security company and an award-winning culture powered by hundreds of employees spread across offices in Santa Clara, St. Louis, Bangalore, London, Paris, Melbourne, Taipei, and Tokyo. Our core values are openness, honesty, and transparency, and we purposely developed our open desk layouts and large meeting spaces to support and promote partnerships, collaboration, and teamwork. From catered lunches and office celebrations to employee recognition events and social professional groups such as the Awesome Women of Netskope (AWON), we strive to keep work fun, supportive and interactive. Visit us at Netskope Careers. Please follow us on LinkedIn and .

About the role

Please note, this team is hiring across all levels and candidates are individually assessed and appropriately leveled based upon their skills and experience.

The Application SRE Team supports several critical components of our foundational technologies for real-time protection, as well as Data services. We are a team of software engineers focused on improving availability, latency, performance, efficiency, change management, monitoring, emergency response, and capacity planning of the engineering stacks. If you are passionate about solving complex problems and developing cloud services at scale, we would like to speak with you.

As a SRE MLOps, you will be critical to deploying and managing cutting-edge infrastructure crucial for AI/ML operations, and you will collaborate with AI/ML engineers and researchers to develop a robust CI/CD pipeline that supports safe and reproducible experiments. Your expertise will also extend to setting up and maintaining monitoring, logging, and alerting systems to oversee extensive training runs and client-facing APIs. You will ensure that training environments are optimally available and efficiently managed across multiple clusters, enhancing our containerization and orchestration systems with advanced tools like Docker and Kubernetes.

Work closely with AI/ML engineers and researchers to participate in the designing and architecture of AI ML Applications for scale and reliability. Design and deploy a CI/CD pipeline that ensures safe and reproducible experiments.
Involve in production troubleshooting of AI ML Application code as well as infrastructure configurations.
Set up and manage monitoring, logging, and alerting systems for extensive training runs and client-facing APIs.
Ensure training environments are consistently available and prepared across multiple clusters.
Develop and manage containerization and orchestration systems utilizing tools such as Docker and Kubernetes.
Operate and oversee large Kubernetes clusters with GPU workloads.
Improve reliability, quality, and time-to-market of our suite of software solutions
Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement
Provide primary operational support and engineering for multiple large-scale distributed software applications

Is this you?

You have professional experience with:
- Model training
Huggingface Transformers
Pytorch
LLM
TensorRT
Infrastructure as code tools like Terraform
Scripting languages such as Python or Bash
Cloud platforms such as Google Cloud, AWS or Azure
Git and GitHub workflows
Tracing and Monitoring
Familiar with high-performance, large-scale ML systems
You have a knack for troubleshooting complex systems and enjoy solving challenging problems
Proactive in identifying problems, performance bottlenecks, and areas for improvement
Take pride in building and operating scalable, reliable, secure systems
Familiar with monitoring tools such as Prometheus, Grafana, or similar
Are comfortable with ambiguity and rapid change

Preferred skills and experience:

Familiar with monitoring tools such as Prometheus, Grafana, or similar
8+ years building core infrastructure
Experience running inference clusters at scale
Experience operating orchestration systems such as Kubernetes at scale

LI-DB1

Netskope is committed to implementing equal employment opportunities for all employees and applicants for employment. Netskope does not discriminate in employment opportunities or practices based on religion, race, color, sex, marital or veteran statues, age, national origin, ancestry, physical or mental disability, medical condition, sexual orientation, gender identity/expression, genetic information, pregnancy (including childbirth, lactation and related medical conditions), or any other characteristic protected by the laws or regulations of any jurisdiction in which we operate.

Netskope respects your privacy and is committed to protecting the personal information you share with us, please refer to Netskope's Privacy Policy for more details.

Staff SRE, Application SRE

2 weeks ago

Bengaluru, Karnataka, India Netskope Full time ₹ 12,00,000 - ₹ 36,00,000 per year

About NetskopeToday, there's more data and users outside the enterprise than inside, causing the network perimeter as we know it to dissolve. We realized a new perimeter was needed, one that is built in the cloud and follows and protects data wherever it goes, so we started Netskope to redefine Cloud, Network and Data Security. Since 2012, we have built...
java/.net SRE application support

5 days ago

Bengaluru, Karnataka, India Natobotics Full time ₹ 6,00,000 - ₹ 12,00,000 per year

TECH MAHINDRA hiring for SRE application supportsqllinux/unixgrafana/splunk/kibanadynatrace/apicaExperience-8yrsLocation-Bangalore/MumbaiTECH MAHINDRA hiring for SRE application supportsqllinux/unixgrafana/splunk/kibanadynatrace/apicaExperience-8yrsLocation-Bangalore/MumbaiTECH MAHINDRA hiring for SRE application
Chief SRE

2 weeks ago

Bengaluru, Karnataka, India Credence HR Services Full time ₹ 20,00,000 - ₹ 25,00,000 per year

Job Title:Chief SRE(IC Role)Location:BengaluruYour responsibilities:As a matured Big Thinker, you'll work closely with senior leaders on the strategic development of the SRE practiceCreating, developing, installing and implementing tools required to support the operational management (including security) of software applications and systemsTesting,...
SRE Consultant

1 day ago

Bengaluru, Karnataka, India RTown Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Role : SRE Consultant Exp : 8-10 years Notice period : 0-15 days Mode of work : WFO Location : Bangalore Mandatory skills : AWS, Microsoft Azure, Iac, Sre, Site Reliability Engineering, Cloud Operations, software development, Golang, Ruby, Ruby Rails, automation, Cloud Infrastructure. SRE Consultant Job Description Overview The Site Reliability Engineer...
SRE Engineer

1 day ago

Bengaluru, Karnataka, India RingCentral Full time ₹ 20,00,000 - ₹ 25,00,000 per year

Job DescriptionSRE Engineer contributes to the strategic objectives of the System Operations Division by providing RingCentral application services and support. Employees in this position perform day-to-day tasks for running a hybrid cloud-based environment which consists of Linux/Windows based web-application services, Authentication/DNS/NTP infrastructure,...
SRE Engineer

1 day ago

Bengaluru, Karnataka, India RingCentral Full time ₹ 20,00,000 - ₹ 25,00,000 per year

Job DescriptionSRE Engineer contributes to the strategic objectives of the System Operations Division by providing RingCentral application services and support. Employees in this position perform day-to-day tasks for running a hybrid cloud-based environment which consists of Linux/Windows based web-application services, Authentication/DNS/NTP infrastructure,...
SRE L3

3 days ago

Bengaluru, Karnataka, India Wipro Full time ₹ 15,00,000 - ₹ 25,00,000 per year

Mandatory Skills:- SRE Ops with Devops & Observability/Automatio LocationBnagalore Preferred (OK with ) LevelL3- About The Role :Must to haveSRE Ops, AWS Cloud Infra, DevOps, Linux, Observability/Automation, CI/CD,Kubernetes/Docker- Good to haveTools extensive knowledge (likeAppDynamics, Nagios, Splunk, Dynatrace, New Relic, Prometheus, Grafana, ELK, etc.),...
DevOps / SRE with Python

1 day ago

Bengaluru, Karnataka, India Bahwan Cybertek Group Full time ₹ 20,00,000 - ₹ 25,00,000 per year

We are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure.Responsibilities:- Develop and...
SRE Engineer

2 weeks ago

Bengaluru, Karnataka, India Technology Next Full time ₹ 4,20,00,000 - ₹ 10,80,00,000 per year

Site Reliability Engineer (SRE) – 6+ Years | Immediate JoinersLocation: BangaloreContract: 6 months (extendable)About the Role:We are hiring an experienced Site Reliability Engineer (SRE) with 6+ years of experience to ensure system reliability, scalability, and performance across large-scale cloud environments.Key Responsibilities:Monitor, troubleshoot,...
SRE Engineer

2 weeks ago

Bengaluru, Karnataka, India AMERICAN EXPRESS Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Description - ExternalYou Lead the Way. We've Got Your Back.With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, youll learn and grow as we help...

Americas

Europe

Asia / Oceania

Africa

Staff SRE, Application SRE