
Site Reliability Engineering Specialist
14 hours ago
Platform Stability and Reliability
• Ensure the platform meets performance, availability, and reliability SLAs.
• Proactively identify and resolve performance bottlenecks and risks in production environments.
• Maintain and improve monitoring, logging, and alerting frameworks to detect and prevent incidents.
Incident Management
• Act as the primary responder for critical incidents, ensuring rapid mitigation and resolution.
• Conduct post-incident reviews and implement corrective actions to prevent recurrence.
• Develop and maintain detailed runbooks and playbooks for operational excellence.
Automation and Efficiency
• Build and maintain tools to automate routine tasks, such as deployments, scaling, and failover.
• Contribute to CI/CD pipeline improvements for faster and more reliable software delivery.
• Write and maintain Infrastructure as Code (IaC) using tools like Pulumi or Terraform to provision and manage resources.
Collaboration and Mentorship
• Collaborate with SRE, CI/CD, Developer Experience, and Templates teams to improve the platform’s reliability and usability.
• Mentor junior engineers by sharing knowledge and best practices in SRE and operational excellence.
• Partner with developers to integrate observability and reliability into their applications.
Observability and Metrics
• Implement and optimize observability tools like Dynatrace, Prometheus, or Grafana for deep visibility into system performance.
• Define key metrics and dashboards to track the health and reliability of platform components.
• Continuously analyze operational data to identify and prioritize areas for improvement.
Required:
• 8+ years of experience in site reliability engineering, software engineering, or a related field.
• Demonstrated expertise in managing and optimizing cloud-based environments, with 3+ years of experience in AWS.
• Strong programming skills in one or more languages: Python, Java, Node.js, or TypeScript.
• Hands-on experience with containerization and orchestration technologies (e.g., Kubernetes, Docker).
• Proficiency in CI/CD practices and tools, such as GitLab, Jenkins, or similar.
• Familiarity with monitoring, logging, and alerting tools; experience with Dynatrace is a plus.
Preferred:
• Hands-on experience with Kubernetes (K8s) for container orchestration and deployment.
• Familiarity with monitoring and observability tools like Dynatrace, Prometheus, or similar.
• Exposure to agile development practices and collaborative environments.
• Experience working with other cloud platforms (e.g., Azure or Google Cloud) is a plus.
-
Site Reliability Engineer
1 day ago
Bangalore, India Synechron Full timeWe have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron – Bangalore Job Role: - SRE (Senior Site Reliability Engineer) Job Location: - Bangalore Notice Period: Within 30days About Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown...
-
Site Reliability Engineer
1 day ago
Bangalore, India ViewSonic Full timeJob Requirements: Bachelor's degree in Computer Science, Engineering, or a related field. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS. Interest and understanding of...
-
Site Reliability Engineer
6 hours ago
Bangalore, India ViewSonic Full timeJob Requirements: Bachelor's degree in Computer Science, Engineering, or a related field. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS. Interest and understanding of...
-
Site Reliability Engineer
14 hours ago
bangalore, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
11 hours ago
bangalore, India Synechron Full timeWe have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years.Synechron – BangaloreJob Role: - SRE (Senior Site Reliability Engineer)Job Location: - BangaloreNotice Period: Within 30daysAbout SynechronWe began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to 14,500+...
-
▷ Urgent! Site Reliability Engineer
9 hours ago
Bangalore, India Synechron Full timeWe have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron – Bangalore Job Role: - SRE (Senior Site Reliability Engineer) Job Location: - Bangalore Notice Period: Within 30days About Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization...
-
Site Reliability Engineer
1 day ago
Bangalore, India Synechron Full timeWe have immediate opportunity for SRE (Senior Site Reliability Engineer) 5 to 9 years. Job Role: - SRE (Senior Site Reliability Engineer) We began life in 2001 as a small, self-funded team of technology specialists. Innovative tech solutions for business We're now a leading global digital consulting firm, providing innovative technology solutions for...
-
Site Reliability Engineer
13 hours ago
bangalore, India Employ Full timeRole - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation – Fully RemoteType - 6 months ContractWork Ex - 5+ YrsWe’re working with a AI product company that’s building the next generation of GenAI powered developer platforms.We’re looking for an experienced Site Reliability Engineer to join their Platform...
-
Site Reliability Engineer
1 day ago
Bangalore, India Xebia Full timeWe are seeking an experienced AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE) to design, build, and manage scalable, reliable, and secure cloud environments. The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI/CD, monitoring & observability frameworks, and incident...
-
Site Reliability Engineer
1 day ago
Bangalore, India Tavant Full timeAbout Tavant: With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tech-enabled transformation across a wide range of industries such as Consumer Lending, Manufacturing, Agtech, Media & Entertainment, and Retail in...