
Only 24h Left Senior Site Reliability Engineer
3 weeks ago
Job Description
Summary
- We re searching for people who are as passionate about working together to deliver quality products and support as we are. Join us and enjoy a career where you can make an impact. You ll be inspired by those around you, and you ll be trusted and empowered to go further.
- As a Site Reliability Engineer, you will be part of a team that is passionately automating everything possible to make Guidewire systems run more efficiently. The Platform team is dedicated full-time to creating and running software that improves the reliability of systems in production, serving hundreds of customers and supporting millions of transactions each day. You will be ensuring the reliability of Guidewire s flagship cloud platform and InsuranceSuite products and building tooling to help ensure efficient operations and optimal availability of all SaaS multi-tenant and customer-focused systems. Platform SREs collaborate closely with Guidewire s core product developers to ensure that the Guidewire core cloud products address functional and non-functional requirements such as availability, performance, observability, and maintainability.
ESSENTIAL DUTIES AND RESPONSIBILITIES
- Collaborate with development teams to enhance the reliability and efficiency of microservices applications.
- Engage with product development (PD) teams by participating in design reviews and production readiness checks.
- Collaborate with engineering teams, providing product feedback and where necessary contribute code to the product
- Work closely with cross-functional teams to ensure seamless integration of new features and services.
- Analyze data from observability and monitoring tools to improve operational metrics of microservices as well as the entire platform.
- Leverage end-to-end technical expertise gained by engagement with multiple PD teams and analyzing observability data to propose improvements in code and design to improve SLO and prevent incidents.
- Create system documentation and training materials to empower and educate our fellow team members
- Take a purist SRE approach to shared multi-tenant infrastructure for a resilient SaaS microservice-based containerized systems in addition to customer-centric application environments
- Oversee and automate the team s growing presence in AWS
- Creatively build and develop tooling to aid in driving 24x7x365 follow-the-sun operations of critical production systems
- Build and maintain observability tooling, metrics, and dashboarding for a global platform product infrastructure
- Improve our incident management lifecycle to identify, mitigate, and learn from reliability risks and issues
- Collaborate with engineering teams, providing product feedback and where necessary contribute code to the product
REQUIRED SKILLS AND EXPERIENCE
Education and Work Experience
- Bachelor s Degree in Computer Science or related field with 10+ Years of experience
- Software engineering and task automation skills with Bash, Python, and/or Go are a must
- Experience in developing and maintaining Java-based web applications, including deployment and support on Apache/Tomcat servers in a live production environment.
- Familiarity with the Agile software development lifecycle
- Deep background with Linux systems and engineering
- Highly experienced with engineering and automating on Amazon Web Services (AWS)
- Prior experience with IaC tools like Terraform/Terragrunt/Terraspace
- Prior experience with devops/gitops tools (Git, Bitbucket, Flux CD, Teamcity) for gate promotions
- Production-At-Scale support background in a heavily microservice-based world
- Hands-on engineering and ops expertise in containerization (Docker, Helm, Kubernetes/EKS, CNI and Ingress networking)
- Strong understanding of Single-Sign On, SAML, OAuth (Bonus if hands-on experience with Okta)
- Seasoned expertise around x.509 certificate technology and basic concepts of encryption
- Experience working with Relational Databases such as Aurora Postgres and/or Oracle RDS
- Advanced exposure to application development, web UI (design and development), JSON, application architecture
- Experience strongly utilizing observability tools (logging/APM) like Datadog, CloudWatch, and PagerDuty.
- Familiarity with event store/stream-processing technologies like Kafka or AWS SQS
- Understanding of Open Application Model systems such as KubeVela or Crossplane
Personal Qualities and Soft Skills
- You greatly prefer writing code than clicking a GUI.
- You enjoy teaching, being a mentor to others, and working across boundaries
- Outstanding troubleshooting skills; ability to think critically and display an aptitude for problem solving
- Strong analytical mind with a penchant for process development and enhancement
- A highly positive can-do attitude with desire for being a team player
- Great communication skills and ability to explain complex technical concepts to a varied audience
- Demonstrate strong follow-through, a strong work ethic and consistently keep and meet commitments
Other Requirements
- Ability to read, write, and speak English
- We provide 24x7 support to our customers, so we expect you to take turns with your teammates being on-call for weekend production emergencies or to provide rotating weekend operational support
- Travel - Expect occasional travel (less than 5%) to other Guidewire offices for training and team meetings
-
Senior Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India Akamai Full timeJob Category Site Reliability Would you like to lead modernization initiatives while building a public cloud platform from scratch Would you like to own critical services in a new public cloud platform Join our IaaS Site Reliability Engineering SRE team We design develop and operate infrastructure and services that power the backbone of our...
-
Bengaluru, Karnataka, India Commonwealth Bank Full timeJob DescriptionOrganization: At CommBank, we never lose sight of the role we play in other people's financial wellbeing. Our focus is to help people and businesses move forward to progress. To make the right financial decisions and achieve their dreams, targets, and aspirations. Regardless of where you work within our organisation, your initiative, talent,...
-
Senior Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India LanceSoft, Inc. Full time ₹ 6,00,000 - ₹ 8,00,000 per yearRole DescriptionThis is a full-time on-site role for a Senior Site Reliability Engineer based in Bangalore/Chennai/Pune. The Senior Site Reliability Engineer will be responsible for maintaining and enhancing the reliability and performance of the company's IT infrastructure & Development. Daily tasks include troubleshooting system issues, ensuring system...
-
Senior Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India Quantaleap Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Title: Senior Site Reliability EngineerLocation: Remote (occasional travel for team meetings)Experience Required: 5+ YearsDomain: Release Engineering / SRE / DevOpsRole OverviewWe are seeking a Senior Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our systems. The role requires strong expertise in...
-
Senior Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India Bottomline Full time ₹ 15,00,000 - ₹ 28,00,000 per yearWhy Choose Bottomline?Are you ready to transform the way businesses pay and get paid? Bottomline is a global leader in business payments and cash management, with over 35 years of experience and moving more than $16 trillion in payments annually. We're looking for passionate individuals to join our team and help drive impactful results for our customers. If...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Senior Site Reliability Engineer
12 hours ago
Bengaluru, Karnataka, India Procore Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob DescriptionWe're looking for aSenior Site Reliability Engineerto join Procore's Product & Technology Team. Procore software solutions aim to improve the lives of everyone in construction and the people within Product & Technology are the driving force behind our innovative, top-rated global platform. We're a customer-centric group that encompasses...
-
Site Reliability Engineer 3 Days Left
4 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Senior Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Saviynt Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout the job Saviynt's AI-powered identity platform manages and governs human and non-human access to all of an organization's applications, data, and business processes. Customers trust Saviynt to safeguard their digital assets, drive operational efficiency, and reduce compliance costs. Built for the AI age, Saviynt is today helping organizations safely...
-
Senior Site Reliability Engineer
13 hours ago
Bengaluru, Karnataka, India Procore Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob DescriptionWe're looking for a Senior Site Reliability Engineer to join Procore's Product & Technology Team. Procore software solutions aim to improve the lives of everyone in construction and the people within Product & Technology are the driving force behind our innovative, top-rated global platform. We're a customer-centric group that encompasses...