
Site Reliability Engineer
2 weeks ago
SailPoint is the leader in identity security for the cloud enterprise. Our identity security solutions secure and enable thousands of companies worldwide, giving our customers unmatched visibility into the entirety of their digital workforce, ensuring workers have the right access to do their job – no more, no less.
IdentityNow is SailPoint's Identity as a Service (IDaaS) product, and the Site Reliability Engineer will be a key player on our Reliability Engineering team servicing the IdentityNow product suite. We are looking for engineers with broad experience in building and running distributed systems at global scale. If you enjoy analyzing complicated problems, innovating creative solutions, and collaborating across teams to build reliable, scalable, and impactful solutions, come join our Reliability Engineering team. We are a team of people that write software to solve scalability, observability, security, reliability, and operability problems.
What You'll Make Happen:
- Make it easy for everyone to create, consume, manage, and scale reliable cloud production services to achieve more.
- Work independently or collaboratively on SailPoint SaaS services to design, develop, and improve end-to-end reliability and maintainability for all services
- Coach engineering teams on observability best practices such as setting up well defined Service Level Objectives (SLOs).
- Lead engineering teams through post-incident reviews to define effective preventive actions.
- Collaborate effectively with developers to increase system reliability through short-term embedding programs.
- Enable our engineering teams to scale our enterprise operations by providing guidance, best practices and support as part of an SRE Center of Excellence
- Manage cross-functional requirements working with Engineering, Product, Services, and other departments.
- Develop and implement automation tools and processes to streamline operations and enhance system performance.
- Be a mentor of quality for design reviews, code, test cases, automation, observability, root cause analysis, and self-healing.
- Influence architectural design, implementation, consolidation, and simplification for global scale
- Focuses on expanding own skills and looking at improving their teammates' skills..
- Drive operational excellence to deliver frictionless operation, happy on call, and optimal customer experience.
Requirements
- 5 + years experience in SRE or DevOps production operations supporting a highly available environment for SaaS software or cloud service provider.
- Experience with cloud infrastructure environments, preferably AWS, and Infrastructure as code , preferably Terraform .
- Experience with containerization technology and/or Kubernetes.
- Experience with metrics, tracing, and logging observability tools such as Prometheus, Grafana, Honeycomb , and Kibana.
- Experience with incident management, including conducting incident reviews.
- Experience with programming languages (Java, Python, Go, etc). Strong understanding of Linux, software development, systems, networking, and Cloud concepts.
- Strong interpersonal and teaming skills - ability to set and enforce process and influence engineers who are not direct reports.
- Have excellent communication skills- English fluency of C1 or higher preferred
- Bachelor's degree in Computer Science or other technical discipline, or equivalent experience is preferred, not required.
Within the first 30 days you will:
- Onboard into your new role, get familiar with our product offering and technology stack.
- If applicable, come up to speed on Identity Access Management space .
- Get to know your peers, leaders and other engineers to understand current state, c hall enges and motivations.
Get to understand the current state of our reliability practices
By 90 days:
- Contribute to the technical architecture of our reliability and capacity planning practices, providing architectural ideas.
- Look beyond the immediate backlog to create and share a forward-thinking technical vision for your team .
You are prioritizing projects and defining scopes of work and developing solutions.
By 6 months:
- You are regularly mentoring and coaching members of your team .
- You own multiple significant projects .
- You provide technical leadership to your team while also delivering high quality code on your own.
- Lead significant and thoughtful critiques of others' design document.
- Consistently achieve targets and meet deadlines; you ensure the quality of deliverables exceeds expectations.
- You are flexing your lifelong learning muscles, staying abreast of emerging external technologies to understand when to introduce them to your team.
SailPoint is an equal opportunity employer and we welcome all qualified candidates to apply to join our team. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, protected veteran status, or any other category protected by applicable law.
Alternative methods of applying for employment are available to individuals unable to submit an application through this site because of a disability. Contact or mail to 11120 Four Points Dr, Suite 100, Austin, TX 78726, to discuss reasonable accommodations. NOTE: Any unsolicited resumes sent by candidates or agencies to this email will not be considered for current openings at SailPoint.
-
Site Reliability Engineer
1 week ago
Remote, India Immersive Infotech Pvt Ltd Full time ₹ 11,600 - ₹ 13,920 per yearSite Reliability Engineer (SRE)Location: RemoteDuration: Long-Term ContractExperience: 5–7 YearsWe are seeking an experienced Site Reliability Engineer (SRE) to join our team and contribute to developing robust, scalable, and automated systems.Key Skills & Responsibilities:Strong proficiency in Jenkins and Groovy scriptingHands-on experience with Python...
-
Site Reliability Engineer
2 weeks ago
Remote, India Immersive Infotech Pvt Ltd Full time ₹ 7,92,000 - ₹ 13,20,000 per yearJob Title: Site Reliability Engineer (SRE)Experience: 6+ YearsWork Hours: European Time Zone (till 9:30 PM IST)Location: Remote/Offshore (India)Key ResponsibilitiesManage and optimize Windows and Linux server environments in Azure.Ensure system reliability, uptime, and performance aligned with defined SLOs/SLAs/SLIs.Lead incident management and drive root...
-
Site Reliability Engineer
3 weeks ago
Remote, India Rackspace Technology Full timeJob DescriptionSite Reliability Engineer / Observability EngineerPublic Cloud - Offerings and Delivery - Workforce Mgmt & Delivery Ops /Full - Time / RemoteRackspace is building up its Professional Services Center of Excellence on Application Performance Monitoring Suites.If you enjoy solving complex business problems and can contribute to building next...
-
Site Reliability Engineer 3
12 hours ago
Remote, India Granicus Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Summary:Opening from Default - All locations The Company Serving the People Who Serve the People Granicus is driven by the excitement of building, implementing, and maintaining technology that is transforming the Govtech industry by bringing governments and their constituents together. We are on a mission to support our customers by meeting the needs of...
-
Senior Site Reliability Engineer
1 week ago
Remote, India Techolution Full time US$ 1,50,000 - US$ 2,00,000 per year1 day agoRemote, India|Full Time|SeniorSkills RequiredNon-Negotiable Skills:Cloud Platforms (AWS, GCP, Azure)Infrastructure as Code (Terraform, CloudFormation)Containerization (Docker, Kubernetes)Monitoring and ObservabilityScripting (Python, Bash)CI/CD PipelinesOwnershipSeeker MindsetPassionate Towards WorkExtremely AmbitiousUnbeatable Work EthicsAbility to...
-
Site Reliability Engineer
2 weeks ago
India Remote Cyberhaven Full time ₹ 15,00,000 - ₹ 20,00,000 per yearAbout the role We're looking for an experienced Site Reliability engineer for making sure systems are reliable, scalable, and performing well especially in production environments. Our technology is new and rapidly evolving as an early member on the team, you'll play a key role in shaping the reliability architecture, building scalable infrastructure, and...
-
Site Reliability Engineer
4 days ago
Remote, India Rackspace Technology Full timeJob Description Site Reliability Engineer / Observability Engineer Public Cloud - Offerings and Delivery - Workforce Mgmt & Delivery Ops / Full - Time / Remote Rackspace is building up its Professional Services Center of Excellence on Application Performance Monitoring Suites. If you enjoy solving complex business problems and can contribute to building...
-
Senior Site Reliability Engineer
2 weeks ago
Remote, India OutSystems Full time US$ 1,20,000 - US$ 2,00,000 per yearThere are NO limits to your career: come shape the future and be part of a truly unique global culture at OutSystemsAbout the roleSite Reliability Engineering (SRE) is a discipline that incorporates aspects of software engineering and applies them to infrastructure and operations problems. The main goals of SRE are to create scalable and highly reliable...
-
Site Reliability Engineer
1 day ago
Pacific Remote Islands Marine National Monument, India Immersive Infotech Pvt Ltd Full timeJob Title: Site Reliability Engineer (SRE) Experience: 6+ Years Work Hours: European Time Zone (till 9:30 PM IST) Location: Remote/Offshore (India) Key Responsibilities Manage and optimize Windows and Linux server environments in Azure. Ensure system reliability, uptime, and performance aligned with defined SLOs/SLAs/SLIs. Lead incident management and drive...
-
Site Reliability Engineer 3
1 day ago
Pacific Remote Islands Marine National Monument, India Granicus Full timeJob Summary: Opening from Default - All locations The Company Serving the People Who Serve the People Granicus is driven by the excitement of building, implementing, and maintaining technology that is transforming the Govtech industry by bringing governments and their constituents together. We are on a mission to support our customers by meeting the needs of...