Sr. Site Reliability Engineer
2 days ago
We are looking for someone with experience leading small teams and has a technical leadership mindset as we grow the team. We are building the next generation data security platform for the multi-cloud era - will you join us?
You will:
- Deploy software for Cloud Prem and SAAS customers.
- Respond to and diagnose system incidents in a timely and efficient manner, minimizing downtime and impact on users.
- Collaborate with other engineers to establish root causes and implement effective resolutions.
- Continuously improve incident response processes and documentation for future occurrences.
- Proactively monitor and maintain the health and performance of our infrastructure and services.
- Perform routine administrative tasks such as system configuration, user management, and data backups.
- Identify and implement operational improvements to ensure ongoing system reliability and efficiency.
- Develop and implement scripts and automated solutions to streamline operational tasks and reduce manual workload.
- Participate in the on-call rotation to address critical incidents outside of regular business hours.
- Ensure effective handoff between on-call engineers and document post-incident information for future reference.
- Document processes for support and create, maintain and execute run-books for identified situations
- Provide tier 2/3 technical support to customers experiencing platform issues or requiring advanced troubleshooting
- Work directly with customer technical teams to resolve complex deployment, configuration, and integration challenges
- Conduct technical onboarding sessions and provide guidance on best practices for customer implementations
- Collaborate with customer success teams to ensure smooth customer experiences and rapid issue resolution
- Create and maintain customer-facing technical documentation, troubleshooting guides, and knowledge base articles
- Escalate customer feedback and feature requests to product and engineering teams
- Participate in customer calls and technical discussions to provide expert-level platform guidance
- Track and analyze customer support metrics to identify trends and areas for improvement
- Education:BS degree in Computer Science or related field
- Experience:3+ years of experience in Site Reliability Engineering
- 2+ years experience working with cloud platform and cloud automation tools especially in AWS
- Strong experience with Kubernetes, Linux, AWS networking(VPC) and Terraform
- Experience with the GitOps model for deployment
- Familiarity with distributed version control
- Other:
- Experience with monitoring and alerting tools (e.g., Prometheus, Grafana).
- Bazel and Helm experience a plus
- Understanding of software configuration best practices
- Ability to wear multiple hats in a fast-paced environment
- Hands-on, "can do" attitude and a bias for action
- Low ego and high intellectual curiosity
- Comfortable working across time zones to support global customer base
- Excellent communication skills with ability to explain technical concepts to both technical and non-technical audiences
- Strong customer service orientation with patience and empathy when working with frustrated customers
Our Culture
We're driven to build a strong company culture and are looking for individuals with solid alignment with the following:
- Ownership Mindset
- Act with Integrity
- Guardians of our Customers
- Opinionated Humility
- Build Trust, Earn Trust
At Veza, your base pay is one part of your total compensation package. For this position, the reasonably expected pay range can be discussed with your recruiter for the level at which this job has been scoped. Your base pay will depend on several factors, including your experience, qualifications, education, location, and skills. In the event that you are considered for a different level, a higher or lower pay range would apply. This position is also eligible for equity and a competitive benefits package.
Veza is proud to be an equal opportunity employer. We are committed to equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, or other applicable legally protected characteristics. We also consider qualified applicants according to applicable federal, state, and local laws. If a candidate with a disability requires an accommodation during the recruitment process, please email
About Veza
Veza is the identity security company. Identity and security teams use Veza to secure identity access across SaaS apps, on-prem apps, data systems, and cloud infrastructure. Veza solves the blind spots of traditional identity tools with its unique ability to ingest and organize permissions metadata in the Veza Authorization Graph. Global enterprises like Blackstone, Wynn Resorts, and Expedia trust Veza to visualize access permissions, monitor permissions activity, automate access reviews, and remediate privilege violations. Founded in 2020, Veza is headquartered in Redwood City, California, and is funded by Accel, Bain Capital, Ballistic Ventures, GV, Norwest Venture Partners, and True Ventures. Visit us at and follow us on LinkedIn, Twitter, and YouTube.
-
Site Reliability Engineer
2 hours ago
Remote (India) Luma Financial Technologies Full time ₹ 30,000 - ₹ 60,000 per yearAbout the roleAt Luma, our Site Reliability Engineer (SRE) team keeps our platform reliable, secure, and lightning fast. They own everything from AWS infrastructure and Kubernetes clusters to CI/CD pipelines, monitoring, and alerting. If you're passionate about tackling big challenges, automating at scale, and making systems more resilient, we'd love to have...
-
Site Reliability Engineer
2 weeks ago
Remote, India Redlinux Full time ₹ 24,00,000 - ₹ 48,00,000 per yearFreelancing OpportunityJob Title: Site Reliability EngineerExperience Level: 8+ YearsAbout the RoleWe are seeking an experienced Site Reliability Engineer (SRE) to ensure the reliability, performance, and scalability of our production systems. The ideal candidate will have deep expertise in AWS, automation, and infrastructure as code, along with a strong...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, India Relanto Full timeJob Description Job Title: Site Reliability Engineer Summary We are looking for a Site Reliability Engineer to join our Digital & Transformation department. The ideal candidate will have 2-3 years of experience in this field and will be responsible for ensuring the reliability, availability, and performance of our systems and applications. Roles And...
-
Site Reliability Engineer
6 hours ago
India InOrg Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAbout VivaOps :VivaOps is a leading DevSecOps platform company specializing in GitLab - The comprehensive DevOps platform, to transform and secure software development processes. We help organizations to streamline their DevSecOps journey by offering a complete range of GitLab services, from advisory, to implementation and managed services, to accelerate...
-
Site Reliability Engineer
4 weeks ago
, India, IN Sonata Software Full timeWe're Hiring: Senior Site Reliability Engineer Location: Onsite (Office: Hyderabad – Mandatory from Day 1) Employment Type: Full-time Notice Period: Immediate to 15 Days Only Experience: 8+ Years About the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact...
-
Site Reliability Engineer
7 days ago
India Akamai Technologies Full timeJob Description Job Description Do you like collaborating across teams to solve complex problems Do you enjoy solving large scale distributed content delivery challenges Join our highly skilled Compute Site Reliability team Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We...
-
Site Reliability Engineer, Contract
2 days ago
Remote, India 66degrees Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOverview of 66degrees66degrees is a leading consulting and professional services company specializing in developing AI-focused, data-led solutions leveraging the latest advancements in cloud technology. With our unmatched engineering capabilities and vast industry experience, we help the world's leading brands transform their business challenges into...
-
Senior Site Reliability Engineer
2 weeks ago
Hyderabad, India IntraEdge Full timeJob Description Strong leadership and people management skills. Exceptional technical proficiency in Pearson's technology stack. Strategic thinking with a focus on long-term operational excellence. Champion operational excellence by directing initiatives that elevate system reliability, availability, and overall efficiency. Function as the diplomatic link...
-
Site Reliability Engineer
4 days ago
India Akamai Full time ₹ 8,00,000 - ₹ 24,00,000 per yearDo you like collaborating across teams to solve complex problems?Do you enjoy solving large scale distributed content delivery challenges?Join our highly skilled Compute Site Reliability teamOur team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We specialize in creating solutions that...
-
Site Reliability Engineer
2 days ago
India LivePerson Full time ₹ 9,00,000 - ₹ 12,00,000 per yearLivePerson (NASDAQ: LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are...