Senior Site Reliability Engineer

4 months ago


Bengaluru, India Okta, Inc. Full time

Get to know Okta

Okta is The World’s Identity Company. We free everyone to safely use any technology—anywhere, on any device or app. Our Workforce and Customer Identity Clouds enable secure yet flexible access, authentication, and automation that transforms how people move through the digital world, putting Identity at the heart of business security and growth. 

At Okta, we celebrate a variety of perspectives and experiences. We are not looking for someone who checks every single box - we’re looking for lifelong learners and people who can make us better with their unique experiences. 

Join our team We’re building a world where Identity belongs to you.

Workforce Identity Cloud

Okta Workforce Identity Cloud (WIC) provides easy, secure access for your workforce so you can focus on other strategic priorities—like reducing costs, and doing more for your customers.

If you like to be challenged and have a passion for solving large-scale automation, testing, and tuning problems, we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, “If you have to do something more than once, automate it” and who can rapidly self-educate on new concepts and tools.

The Engineering Opportunity

We are seeking an exceptional SRE/Devops Engineer who is experienced in building software systems to manage and deploy reliable, performant infrastructure and product code at scale on a cloud infrastructure. 

This engineer will join our group responsible for designing, implementing and maintaining services/frameworks that automates actions against production infrastructure. These services enable engineers across product and infrastructure engineering groups to safely, reliably and repeatedly execute runbooks and other actions on test, preview and production environments.

As part of this team, you will also work on new efforts to keep Okta’s infrastructure practices at par with the best industry standards. You will also interface with teams involved with deployments, operations, release engineering, product and data to address process bottlenecks with code and automate time consuming jobs. You will work hands-on with Kubernetes on GCP / AWS to help Okta’s services run seamlessly in both cloud environments. You will also be involved in the maintenance and debugging of team owned services as part of incident response.

What you’ll be doing 

Design, build, maintain and deploy tools that allow Okta’s engineers to execute infrastructure production changes and deploy code. Manage multiple environments spanning a globally distributed infrastructure. Improve environment visibility and management in a repeatable and automatable way. Collaborate with all engineering and operations teams to improve overall product health and reliability. Respond to production incidents and determine how we can prevent them in the future. Triage and troubleshoot complex production issues to ensure reliability and performance. Design and build scalable and extensible platforms/services/tools in Java, Python, Go with a focus on automation and reliability. Work cross functionally with Operations and Product teams to identify bottlenecks and manual processes. Build solutions that provide scale and reliability to address these issues. Leverage industry best practices in infrastructure, automation, orchestration to explore greenfield opportunities that will form the basis of future infrastructure improvements. Identify areas for automation that are self-serviceable to reduce manual onboarding. Develop tools and processes to address these areas. Work on improving the security posture of team owned services and infrastructure. This would involve base image maintenance, updating hosts with newer library versions from vendors as well as services with vulnerability free libraries if and when they are identified.

What we are looking for

5+ years of Experience with Java, Go, Python or similar backend languages 5+ years of experience building, maintaining and debugging services, internal tools and frameworks  3+ years experience automating and deploying large scale production services in AWS, GCP or similar 3+ years of hands on experience working with Kubernetes, with a good understanding of Kuberentes fundamentals

#LI-Hybrid

What you can look forward to as an Full-Time Okta employee

Amazing Benefits Making Social Impact Fostering Diversity, Equity, Inclusion and Belonging at Okta 

Okta cultivates a dynamic work environment, providing the best tools, technology and benefits to empower our employees to work productively in a setting that best and uniquely suits their needs. Each organization is unique in the degree of flexibility and mobility in which they work so that all employees are enabled to be their most creative and successful versions of themselves, regardless of where they live. Find your place at Okta today .



  • Bengaluru, Karnataka, India AMEX Full time

    About the Role: We are seeking an experienced Senior Site Reliability Engineer to lead our team in delivering high-quality, reliable technology solutions. The ideal candidate will have a deep understanding of observability tools and methodologies, as well as strong leadership and people management skills. About Us: American Express is a global leader in...


  • Bengaluru, India Barracuda Full time

    Job ID: 25-251Come Join Our Passionate Team! At Barracuda, we make the world a safer place. We believe every business deserves access to cloud-enabled, enterprise-grade security solutions that are easy to buy, deploy, and use. We protect email, networks, data and applications with innovative solutions that grow and adapt with our customers’ journey. More...


  • Bengaluru, India Tech Mahindra (formerly Mahindra Satyam) Full time

    We are looking for Senior SRE who can join with us in Pan India Location.Skill - SRE with GCP platformExp - 6- 13 YrsLocation - Pan IndiaHybrid work locationAs an Off-Shore Site Reliability Engineer, you will have the following preferred skillsets:- Experience with cloud platforms such as Azure, or GCP- Proficiency in scripting languages such as Python,...


  • Bengaluru, India Tech Mahindra Full time

    We are looking for Senior SRE who can join with us in Pan India Location.Skill - SRE with GCP platformExp - 6- 13 YrsLocation - Pan IndiaHybrid work locationAs an Off-Shore Site Reliability Engineer, you will have the following preferred skillsets:- Experience with cloud platforms such as Azure, or GCP- Proficiency in scripting languages such as Python,...


  • Bengaluru, India Ushur Full time

    Location: BangaloreExperience: 6-8 YearsWork Mode: Hybrid/RemoteThe RoleSenior Site Reliability Engineers at Ushur perform a unique blend of customer support engineering, solution engineering, and operational engineering. You will work on our largest customers’ most complex problems and craft intuitive, elegant solutions. You’ll also proactively work...


  • Bengaluru, India Hirextra -World's First Staffing Aggregator Full time

    Role: Site Reliability EngineerExperiences: 6+ yearsLocation: BangaloreMode: HybridBudget: 14 LPAJob Description:Highly skilled Cloud Site Reliability Engineer to ensure high availability, reliability and performance of cloud infrastructure and services.Experience in cloud platforms (AWS, GCP), automation, monitoring, and incident management.Experience in...


  • Bengaluru, India IntraEdge Full time

    Job Title: Site Reliability Engineer (SRE)Location:Gurgaon / BangaloreJob Type:Full-TimeExperience :5+ YearsAbout the Role:We are seeking a skilled Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our applications. You will work with development teams on microservices architecture, cloud platforms (AWS, GCP, Azure),...


  • Bengaluru, India Hirextra -World's First Staffing Aggregator Full time

    Role: Site Reliability Engineer Experiences: 6+ yearsLocation: Bangalore Mode: HybridBudget: 14LPA Job Description: Highly skilled Cloud Site Reliability Engineer to ensure high availability, reliability and performance of cloud infrastructure and services. Experience in cloud platforms (AWS, GCP), automation, monitoring, and incident management. ...


  • Bengaluru, India Hirextra -World's First Staffing Aggregator Full time

    Role: Site Reliability Engineer Experiences: 6+ yearsLocation: Bangalore Mode: HybridBudget: 14LPA Job Description: Highly skilled Cloud Site Reliability Engineer to ensure high availability, reliability and performance of cloud infrastructure and services. Experience in cloud platforms (AWS, GCP), automation, monitoring, and incident management. ...


  • Bengaluru, India Tech Mahindra (formerly Mahindra Satyam) Full time

    We are looking for Senior SRE who can join with us in Pan India Location.Skill - SRE with GCP platformExp - 6- 13 YrsLocation - Pan IndiaHybrid work locationAs an Off-Shore Site Reliability Engineer, you will have the following preferred skillsets: - Experience with cloud platforms such as Azure, or GCP - Proficiency in scripting languages such as Python,...


  • Bengaluru, India Tech Mahindra (formerly Mahindra Satyam) Full time

    We are looking for Senior SRE who can join with us in Pan India Location.Skill - SRE with GCP platformExp - 6- 13 YrsLocation - Pan IndiaHybrid work locationAs an Off-Shore Site Reliability Engineer, you will have the following preferred skillsets: - Experience with cloud platforms such as Azure, or GCP - Proficiency in scripting languages such as Python,...


  • Bengaluru, India Oracle Full time

    Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health Applications & Infrastructure. This team will focus on product development and product strategy for Oracle Health while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: BangaloreWe're hiring aSite Reliability Engineerto join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and Kubernetes is a MUST-HAVEKey Responsibilities:Manage and scale...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring aSite Reliability Engineerto join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...


  • Bengaluru, India Laerdal Bangalore Full time

    As a Senior Site Reliability Engineer, you’ll play a pivotal role in ensuring the reliability and performance of our cloud-based applications and solutions. Collaborating closely with our team, you will cultivate a culture of SRE breaking down silos and managing incidents and problems. Your role will involve developing and implementing innovative solutions...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible and...


  • Bengaluru, India Tranzeal Incorporated Full time

    Job Title: Site Reliability Engineer (SRE)Location: Bangalore, KAWork Mode: Office (5 Days/Week)Position Type: Contract basedWe're hiring a Site Reliability Engineer to join our team in Bangalore! If you have a strong background in maintaining and scaling cloud services and love automating infrastructure at scale, this is for you.Experience with Ansible...


  • Bengaluru, India Laerdal Bangalore Full time

    As a Senior Site Reliability Engineer, you’ll play a pivotal role in ensuring the reliability and performance of our cloud-based applications and solutions. Collaborating closely with our team, you will cultivate a culture of SRE breaking down silos and managing incidents and problems. Your role will involve developing and implementing innovative solutions...