Senior Staff Site Reliability Engineer
8 hours ago
Job Description Job Description Job Title: Senior Staff Site Reliability Engineer Location: Bangalore About Movius At Movius, we solve a critical gap companies face with employee-to-client communication over voice and messaging. We are the leading global provider of Secure Communication as a Service (SCaaS). Our flagship solution, MultiLine, enhances workflows, resolves compliance gaps and unifies cross-channel messaging. Movius AI-powered solutions enable businesses to build strong and lasting relationships with their customers in a company-owned, controllable system. Welcome to Phone 3.0. Headquartered in Alpharetta, GA, with offices in Silicon Valley, Bangalore, India, New York, and London, Movius partners with leading global wireless carriers like T-Mobile, Vodafone, TELUS, BT, Singtel & more. To learn more about Movius, visit www.movius.ai . Your Opportunity We are looking for a Senior Staff Site Reliability Engineer (SRE) with strong technical expertise in distributed systems, cloud infrastructure, observability, and automation. In this role, you will be responsible for improving the reliability, scalability, and performance of our production and pre-production systems. You will work hands-on in designing and implementing SRE frameworks, automating key reliability workflows, and building a culture of operational excellence. You will also work closely with product engineering, QA, and DevOps teams to define SLOs/SLIs, enhance monitoring and alerting, and strengthen our overall reliability practices. What You'll Do - Reliability Engineering & Architecture - Design and maintain highly available, fault-tolerant systems on AWS. - Implement service reliability models based on SLOs, SLIs, and error budgets. - Continuously improve system performance, scalability, and resilience. - Automation & Infrastructure-as-Code (IaC) - Build and maintain automation pipelines using Terraform, Ansible, Bitbucket, and Jenkins. - Develop reusable IaC modules for multi-account and multi-environment AWS setups. - Automate operational processes for provisioning, scaling, monitoring, and recovery. - Observability & Monitoring - Define observability standards and create dashboards using Elastic Stack, Grafana, or Prometheus. - Implement intelligent alerting using AIOps and anomaly detection tools. - Work with development teams to ensure proper telemetry and trace coverage. - Incident Management & RCA - Lead major incident response and ensure quick service restoration. - Conduct blameless post-incident reviews and implement preventive actions. - Create and maintain runbooks, escalation matrices, and reliability playbooks. - Performance & Capacity Planning - Analyse performance bottlenecks and propose tuning or optimization strategies. - Lead capacity forecasting and ensure the system can handle growth demands. - Collaboration & Mentorship - Partner with development, QA, and DevOps teams to embed SRE principles. - Coach and mentor junior engineers on reliability engineering and automation. - Documentation & Knowledge Management - Maintain detailed architecture diagrams, design documents, and operational procedures. - Document SLOs, automation workflows, and change management reports. - Technical Leadership - Lead technical discussions, reliability reviews, and performance retrospectives. - Promote a code-driven, automation-first reliability culture across teams. What You Bring Education - Bachelor's degree in Computer Science, Information Technology, or equivalent experience. Experience - 8+ years in SRE or DevOps roles managing large-scale distributed systems. - Proven hands-on experience in cloud operations (AWS preferred), automation, and CI/CD pipelines. - Experience in the Telecom domain is an added advantage. Technical Skills - Deep knowledge of AWS (EKS, EC2, RDS, IAM, VPC, Kafka, CloudWatch, API Gateway, Lambda, WAF, KMS). - Strong Linux administration and networking fundamentals. - Skilled in Terraform, Jenkins, Git, and scripting (Python, Bash). - Solid understanding of observability tools (Grafana, Elastic Stack, Prometheus). - Experience with container orchestration (Kubernetes) and microservices-based systems. Certifications (Preferred) - AWS Certified DevOps Engineer / Solutions Architect Associate. - Terraform Associate or Kubernetes Certified Administrator (CKA). - SRE Foundation or Google SRE certification is desirable. Why Join Movius - Work on a global-scale platform serving enterprise customers. - Be part of a high-performing, innovation-driven engineering team. - Competitive pay, benefits, and opportunities for professional growth. Ready to build the future of reliable, secure, and intelligent communication Apply now at www.movius.ai
-
Senior Staff Site Reliability Engineer
23 hours ago
Bengaluru, KA,, India Movius Interactive Corporation Full timeJob Description Job Title: Senior Staff Site Reliability EngineerLocation: BangaloreAbout Movius At Movius, we solve a critical gap companies face with employee-to-client communication over voice and messaging. We are the leading global provider of Secure Communication as a Service (SCaaS). Our flagship solution, MultiLine, enhances workflows,...
-
Staff Site Reliability Engineer
1 day ago
India SentinelOne Full timeWhat are we looking for?SRE organization's mission at SentinelOne (S1) is to keep our uptime promise to our customers by ensuring we meet our SLOs/SLAs, help our engineering teams ship software to our customers fast and with quality and ensure our customers are successful.As a Staff Site Reliability Engineer, you will be a technical leader within the SRE...
-
Staff Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Okta Full time ₹ 8,00,000 - ₹ 24,00,000 per yearJoin our team Were building a world where Identity belongs to you.Oktas Workforce Identity Cloud Security Engineering group is looking for a Staff Site Reliability Engineer with a passion for DevSecOps , Infrastructure Security , and SRE . Join a team that is not just building solutions but redefining the standards for cloud security. If you have a proven...
-
Senior Site Reliability Engineer
3 weeks ago
Bengaluru, India Atlassian Full timeJob Description We are looking for a Senior Site Reliability Engineer who is passionate about scaling Cloud services to join our growing SRE team. The SRE team owns the infrastructure, tooling and automation that Jira and Confluence Cloud runs on, and has a deep understanding of how Atlassian products leverage cloud infrastructure to meet customer...
-
Senior Site Reliability Engineer
1 week ago
India Capgemini Engineering Full timeJob Description Job Title: Senior Site Reliability Engineer (SRE) Experience: 4+ years Location: Mysuru Employment Type: Full-time About the Role We are seeking a highly skilled Senior Site Reliability Engineer to join our team. The ideal candidate will have deep expertise in Microsoft Azure OR AWS, strong experience in building and maintaining reliable,...
-
Senior Staff Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Zscaler Full time ₹ 15,00,000 - ₹ 20,00,000 per yearAbout ZscalerZscaler accelerates digital transformation so our customers can be more agile, efficient, resilient, and secure. Our cloud native Zero Trust Exchange platform protects thousands of customers from cyberattacks and data loss by securely connecting users, devices, and applications in any location.Here, impact in your role matters more than title...
-
Senior Site Reliability Engineer
8 hours ago
Hyderabad, India GSPANN Technologies, Inc Full timeJob Description Unified Dashboards, Elastic Stack (ELK), Loki, Splunk, Dynatrace, Datadog, Grafana, New Relic, Azure, Python, GitLab, Jenkins, Ansible, Terraform, DevOps, SLO/SLAs Monitoring, Incident Response, Root Cause Analysis (RCA), E2E Implementation Description GSPANN is hiring a Senior Site Reliability Engineer (SRE) to join our team in Pune or...
-
Site Reliability Engineer
3 weeks ago
India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
3 weeks ago
India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
3 weeks ago
India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...