
Site Reliability Engineer
1 week ago
Job Description
Job Description
What You ll Do
- Collaborate with engineering teams to provide feedback and contribute code where needed, enhancing product functionality and resilience.
- Participate in on-call rotations to ensure 24x7 availability of services.
- Design and develop tools to support 24x7 follow-the-sun operations for critical production systems.
- Automate deployment tasks for core products and infrastructure, maintaining a robust automation framework.
- Monitor and optimize the performance of applications on the Guidewire Cloud Platform, ensuring reliability and efficiency.
- Develop and maintain observability tools, metrics, and dashboards, including self-healing mechanisms for increased reliability.
- Foster a culture of reliability by promoting blameless postmortems, SLO tracking, and continuous learning from incidents.
- Proactively identify and address infrastructure issues to minimize business impact.
- Develop system documentation and training materials to empower and educate team members.
Who You Are
- Skilled in programming with Python or Go for building internal tools, CLIs, and APIs; familiarity with Java and Spring Boot is a plus.
- Exceptional troubleshooting skills, with a proactive, critical approach to solving complex issues.
- Proficient in containerization technologies, with hands-on expertise in Docker, Helm, Kubernetes (EKS), CNI, and Ingress networking.
- Strong knowledge of Kubernetes concepts (pods, deployments, services, statefulsets, ingress etc.) and the Operator pattern.
- Experienced with Terraform, including developing and testing complex modules.
- Advanced experience with AWS, including custom tool development using AWS SDK.
- Solid understanding of Single Sign-On (SSO), SAML, and OAuth protocols; experience with Okta is a bonus.
- Skilled in using observability tools such as Prometheus, OpenTelemetry, or Datadog for proactive monitoring.
- Production-At-Scale support background in a heavily microservice-based world.
- Familiar with agile methodologies, including Scrum and Kanban, to enhance software development processes.
- Excellent communication skills, with the ability to explain complex technical concepts to diverse audiences.
Other Requirements
- Bachelor s Degree in Computer Science or a related field.
- Ability to read, write, and speak English
- We provide 24x7 support to our customers, so we expect you to take turns with your teammates being on-call for weekend production emergencies or to provide rotating weekend operational support
- Travel - Expect occasional travel (less than 5%) to other Guidewire offices for training and team meetings
Bonus Points
- Kubernetes or AWS certifications
- Contributions to open source projects
- Familiar with Kubevela (OAM) or Crossplane for Kubernetes-native infrastructure management
- Experience in managing large scale Aurora PostgreSQL clusters and Aurora Serverless
- Experience with TeamCity CI or GitHub actions
-
Site Reliability Engineer
22 hours ago
Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per yearRole DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....
-
Site Reliability Engineer
1 week ago
Bengaluru, India Programming Full timeRole - Site Reliability Engineering. Location - Bengaluru Years of Expereince - 4+ Years Professional & Technical Skills: Must To Have Skills: Proficiency in Site Reliability Engineering. Good To Have Skills: Experience with cloud service providers such as AWS, Azure, or Google Cloud. Strong understanding of CI/CD tools and practices. Experience with...
-
Site Reliability Engineer
24 hours ago
Bengaluru, Karnataka, India FIS Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout the Role :Site Reliability Engineer (SRE)with deep expertise inMainframe technologies like COBOL, JCL, etc. to support and enhance ourCard Management & Payment processing functions. This role will be responsible for ensuring reliability, high availability, scalability, stability and performance of mission-critical mainframe software applications and...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, India ViewSonic Full timeJob Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...
-
Site reliability engineer
1 week ago
Bengaluru, India ViewSonic Full timeJob Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.3+ year of experience in a relevant role, such as Site Reliability Engineer, Dev Ops Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, Cloud Watch, Lambda, and RDS.Interest and understanding of Platform...
-
Site Reliability Engineer
5 days ago
Bengaluru, India HDFC Limited Full timeHiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore LocationExperience - 8 - 14 YearsJob PurposeAnalysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance.Job Responsibilities:Help build a Site Reliability...
-
Site Reliability Engineer
1 week ago
Bengaluru, India ViewSonic Full timeJob Requirements: 1. Bachelor's degree in Computer Science, Engineering, or a related field. 2. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory. 3. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS. 4. Interest and understanding of...
-
Site Reliability Engineer
1 week ago
Bengaluru, India FOSS United Full timeAll Jobs Site Reliability Engineer at ZEISS India Site Reliability Engineer Apply Posted on September 11, 2025 ZEISS India Kadubeesanahalli, Bengaluru Full TIme Job DescriptionZEISS in India ZEISS in India is headquartered in Bengaluru and present in the fields of Industrial Quality Solutions, Research Microscopy Solutions, Medical Technology, Vision Care...
-
Site Reliability Engineer
3 days ago
Bengaluru, India HDFC Limited Full timeHiring for Lead / Sr Site Reliability Engineer for Mumbai & Bangalore LocationExperience - 8 - 14 YearsJob Purpose- Analysing, troubleshooting, and designing vital services, platforms, and infrastructure on GCP while always thinking about reliability, scalability, resilience, security, and performance.Job Responsibilities:- Help build a Site Reliability...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India Enterprise Minds, Inc Full timeWe're Hiring | Site Reliability Engineer | 8-10 years