Senior Site Reliability Engineer
4 days ago
Position Overview:
We are seeking an experienced and technically influential Senior Site Reliability Engineer to join our Cloud Tooling and Pipelines team. This pivotal team drives the strategy, development, and operation of our core Continuous Delivery (CD) platform (leveraging Spinnaker and custom tooling), Infrastructure as Code (IaC) executions (primarily Terraform), and a suite of supporting microservices. These systems are critical for managing our extensive resource footprint across AWS ECS and EKS.
As a Senior Site Reliability Engineer , you will be a key technical leader, setting the architectural vision and driving the implementation of scalable and reliable solutions for automating infrastructure and application deployments. Your deep expertise in both software engineering and cloud operations will be essential in building and maintaining our critical tooling, enhancing our capabilities in infrastructure provisioning, vulnerability management, and IaC deployments. You will also play a vital role in mentoring other engineers and influencing the team's technical roadmap. If you have a strong passion for both building software and managing infrastructure at scale and are driven to solve complex operational challenges through automation, we encourage you to apply.
*Key Responsibilities:- Maintain Platform Strategy: Be a core technical leader in shaping the strategic direction and future evolution of Okta's CD platform (including Spinnaker, Terraform and custom tools) and related infrastructure automation.
- Architect End-to-End Automation: Lead the design and architecture of robust CD pipelines, Terraform-based IaC workflows, and application deployment processes, ensuring scalability, reliability, and security.
- Design, Build, and Maintain Critical Tooling: Architect, build, maintain, and deploy sophisticated tools and microservices that empower Okta's engineering teams to provision infrastructure, execute production changes, and deploy code with high reliability and efficiency.
- Develop High-Quality Automation Software: Design and build scalable and reliable microservices (potentially in Java, Python, or Go) with a strong focus on automation, operational excellence, and self-service capabilities.
- Drive Cross-Functional Collaboration: Partner closely with Software Engineering, SREs, and Product teams to proactively identify operational bottlenecks and manual processes, leading the design and implementation of scalable and reliable automation solutions.
- Champion DevOps Best Practices: Research and advocate for the adoption of industry best practices in infrastructure automation, continuous delivery, and orchestration to drive innovation and continuous improvement.
- Integrate Security: Apply and promote security best practices throughout the development lifecycle of our tooling and infrastructure automation to ensure a secure and compliant operational environment.
- Deliver Self-Service Capabilities: Proactively identify opportunities to create self-service automation for infrastructure provisioning, application deployments, and other operational tasks, reducing manual effort and improving developer velocity and onboarding.
- Provide Technical Guidance and Mentorship:* Serve as a technical mentor and role model for other engineers on the team, fostering a culture of collaboration, innovation, and technical excellence.
*Required Qualifications:- 4+ years of combined experience in Software Engineering and Site Reliability Engineering roles.
- 3+ years of software development experience in Go , or similar backend languages, with a focus on building scalable and reliable applications.
- 4+ years of hands-on experience automating and managing large-scale production infrastructure and services in AWS, GCP , or similar cloud environments.
- Deep understanding and practical experience with containerization and orchestration technologies such as Kubernetes and ECS .
- Strong working knowledge of Continuous Integration/Continuous Delivery (CI/CD) platforms , with experience in Spinnaker and a strong interest in exploring other industry-standard tools.
- Solid understanding of Infrastructure-as-Code (IaC) principles and experience with tools such as Terraform .
- Proficient in using Docker and supporting infrastructure, with strong Linux and networking fundamentals .
- Experience with database technologies* (MySQL, MongoDB, etc.) in the context of application development and operational management.
- A strong passion for automation and solving complex operational challenges through software solutions.
- Excellent communication, collaboration, and leadership skills.
- Bachelors degree in Computer Science or a related field, or equivalent professional experience.
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Josys Full timeSenior Site Reliability Engineer (SRE)About JOSYSJosys, a dynamic B2B SaaS platform startup, has embarked on a mission to revolutionize IT operations globally, following an exceptional launch in Japan and securing $125 million in Series A and B funding. Our platform enables businesses to conquer the complexities of work-from-anywhere setups, rapid digital...
-
Senior Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India Zorba Consulting Full time ₹ 8,00,000 - ₹ 16,00,000 per yearDescription :Senior Site Reliability Engineer (SRE)Location : Bangalore, IndiaExperience : 6+ YearsDomain : DevOps/Cloud InfrastructureAbout the Role :We are looking for a Senior Site Reliability Engineer (SRE) to join our core engineering team.You will be instrumental in ensuring the reliability, scalability, and performance of our global microservices...
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Saviynt Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout the job Saviynt's AI-powered identity platform manages and governs human and non-human access to all of an organization's applications, data, and business processes. Customers trust Saviynt to safeguard their digital assets, drive operational efficiency, and reduce compliance costs. Built for the AI age, Saviynt is today helping organizations safely...
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Atlassian Full time ₹ 15,00,000 - ₹ 20,00,000 per yearOverviewWe are looking for a Senior Site Reliability Engineer who is passionate about scaling Cloud services to join our growing SRE team. The SRE team owns the infrastructure, tooling and automation that Jira and Confluence Cloud runs on, and has a deep understanding of how Atlassian products leverage cloud infrastructure to meet customer reliability...
-
Senior Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India Okta Full timeJoin our team Were building a world where Identity belongs to you.Oktas Workforce Identity Cloud Security Engineering group is looking for a Senior Site Reliability Engineer with a passion for DevSecOps , Infrastructure Security , and SRE . Join a team that is not just building solutions but redefining the standards for cloud security. If you have a proven...
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Aerospike Full time ₹ 15,00,000 - ₹ 1,00,00,000 per yearAbout AerospikeAerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.Global leaders, including Adobe, Airtel,...
-
Site Reliability Engineer
7 days ago
Bengaluru, Karnataka, India Acceldata Full time ₹ 20,00,000 - ₹ 25,00,000 per yearYou will join a team of highly skilled hadoop engineers who are responsible for delivering Acceldata's support services in vendor-agnostic environments. As a Site Reliability Engineer, you will actively learn from experienced team members, contributing to improving the availability, scalability, performance, and reliability of our products and our customers'...
-
Senior Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India Tata Consultancy Services Full timeTCS is Hiring for Senior Site Reliability Engineer (SRE)Role & responsibilitiesKey ResponsibilitiesInfrastructure & Application SupportDesign, implement, and support infrastructure and services for large-scale production workloads.Troubleshoot across the full stack: network, server, OS, application, database, storage, and identity/access management.Provide...
-
Site Reliability Engineer
7 days ago
Bengaluru, Karnataka, India Capgemini Full time ₹ 10,00,000 - ₹ 25,00,000 per yearAt Capgemini Engineering, the world leader in engineering services, we bring together a global team of engineers, scientists, and architects to help the world's most innovative companies unleash their potential. From autonomous cars to life-saving robots, our digital and software technology experts think outside the box as they provide unique R&D and...
-
Site Reliability Engineering
6 days ago
Bengaluru, Karnataka, India Thakral One Full time US$ 60,000 - US$ 1,20,000 per yearCompany DescriptionThakral One, headquartered in Singapore, is a technology consulting and services company with a strong presence across Asia. The company specializes in technology-driven consulting, custom solution development, data analytics, and leveraging cloud capabilities to deliver enhanced decision support and practical outcomes. Collaborating...