Site Reliability Engineer
21 hours ago
LivePerson (NASDAQ: LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are powered by our Conversational Cloud each month.
You'll be successful at LivePerson if you are excited to build something from the ground up. You excel by finding daily opportunities to grow at the same pace as the technology we're building, and you build partnerships that improve our business. Likewise, you're someone who sees feedback as a chance to learn and grow and believe decisions powered by data are the norm. You care about the well-being of others and yourself.
Job Description: Site Reliability Engineer (Platform Engineer) Mid Level (L2)
Location: India (Remote / Hybrid – Bengaluru, Hyderabad, Pune, or Chennai)
Overview:
We are seeking a Mid-Level Site Reliability Engineer (SRE) to join our global Platform Engineering team. As an SRE, your primary responsibility is to ensure that our platform is reliable, scalable, and performant. You'll be the bridge between development and operations — designing automation, improving observability, and maintaining the health of our production systems. You should have what it takes to ask the right questions, identify potential risks early, and raise flags when necessary to maintain a culture of reliability and continuous improvement.
You will:
- Collaborate closely with Developers, QA, and Product teams during sprint planning to understand release plans, dependencies, and infrastructure requirements.
- Participate in the application release cycle, ensuring deployments are automated, consistent, and reliable.
- Manage and operate Kubernetes clusters in Google Kubernetes Engine (GKE) and Amazon Elastic Kubernetes Service (EKS).
- Develop and manage Terraform modules for provisioning and configuring cloud infrastructure across GCP and AWS.
- Standardize service deployments using Helm for templating and versioned releases.
- Build and enhance observability with Prometheus, Grafana, and Datadog to monitor application and platform performance.
- Design, implement, and maintain GitLab CI/CD pipelines for build, test, and deployment automation.
- Drive an automation-first culture by developing scripts and tooling in Python, Go, or Shell to minimize manual effort and improve efficiency.
- Participate in a 24/7 on-call rotation, ensuring quick detection, mitigation, and resolution of incidents.
- Perform root cause analysis (RCA) and contribute to post-incident reviews to prevent recurrence.
- Proactively identify reliability or scalability gaps, raise early warnings, and partner with teams to address systemic risks.
You have:
- 5-8 years of experience as a Site Reliability Engineer, Platform Engineer, or DevOps Engineer.
- Hands-on experience managing Kubernetes clusters (GKE, EKS) in GCP and AWS.
- Strong knowledge of Terraform, Helm, and GitLab CI/CD pipelines.
- Proficiency in Python, Go, or Shell scripting for automation and tooling.
- Experience implementing and managing observability stacks (Prometheus, Grafana, Datadog).
- Deep understanding of Linux systems, cloud networking, and container orchestration concepts.
- Experience working in Agile/Scrum environments and partnering closely with developers.
- Excellent analytical skills with a proactive attitude — able to question assumptions and escalate potential risks early.
Good to Have
- Experience with ArgoCD or Flux (GitOps-based workflows).
- Familiarity with service mesh (Istio, Linkerd) or API gateways.
- Knowledge of cloud cost optimization, autoscaling, or security best practices.
- Experience with incident management tools such as PagerDuty, ServiceNOW
Why Join Us
- Build and operate modern cloud-native platforms using Kubernetes, Terraform, GitLab, Datadog, and Grafana.
- Be part of a global SRE team that values automation, reliability, and innovation.
- Work in a collaborative culture that encourages ownership, learning, and continuous improvement.
- Enjoy flexible working arrangements, competitive compensation, and career growth opportunities including certifications and mentorship.
Why you'll love working here:
As leaders in enterprise customer conversations, we celebrate diversity, empowering our team to forge impactful conversations globally. LivePerson is a place where uniqueness is embraced, growth is constant, and everyone is empowered to create their own success. And, we're very proud to have earned recognition from Fast Company, Newsweek, and BuiltIn for being a top innovative, beloved, and remote-friendly workplace.
- Benefits: 15 Days PTO + Casual & Sick Leave
- Insurance: 8 Lakhs Family Floater Coverage; Personal Accident & Life Insurance: 3x of Gross Annual Salary*
The talent acquisition team at LivePerson has recently been notified of a phishing scam targeting candidates applying for our open roles. Scammers have been posing as hiring managers and recruiters in an effort to access candidates' personal and financial information. This phishing scam is not isolated to only LivePerson and has been documented in news articles and media outlets.Please note that any communication from our hiring teams at LivePerson regarding a job opportunity will only be made by a LivePerson employee with an email address.
LivePerson does not ask for personal or financial information as part of our interview process, including but not limited to your social security number, online account passwords, credit card numbers, passport information and other related banking information. If you have any questions and or concerns, please feel free to contact recruiting-
-
Site Reliability Engineer
2 weeks ago
Bengaluru, India Relanto Full timeJob Description Job Title: Site Reliability Engineer Summary We are looking for a Site Reliability Engineer to join our Digital & Transformation department. The ideal candidate will have 2-3 years of experience in this field and will be responsible for ensuring the reliability, availability, and performance of our systems and applications. Roles And...
-
Site Reliability Engineer
4 weeks ago
, India, IN Sonata Software Full timeWe're Hiring: Senior Site Reliability Engineer Location: Onsite (Office: Hyderabad – Mandatory from Day 1) Employment Type: Full-time Notice Period: Immediate to 15 Days Only Experience: 8+ Years About the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact...
-
Site Reliability Engineer
6 days ago
India Akamai Technologies Full timeJob Description Job Description Do you like collaborating across teams to solve complex problems Do you enjoy solving large scale distributed content delivery challenges Join our highly skilled Compute Site Reliability team Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We...
-
Site Reliability Engineer
3 days ago
India Akamai Full time ₹ 8,00,000 - ₹ 24,00,000 per yearDo you like collaborating across teams to solve complex problems?Do you enjoy solving large scale distributed content delivery challenges?Join our highly skilled Compute Site Reliability teamOur team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We specialize in creating solutions that...
-
Site Reliability Engineer
20 hours ago
India LivePerson Full time ₹ 9,00,000 - ₹ 12,00,000 per yearLivePerson (NASDAQ: LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are...
-
Site Reliability Engineer
2 weeks ago
India CareerUS Solutions Full timeJob Description Position Overview: The Site Reliability Engineer (SRE) is responsible for ensuring the stability, scalability, performance, and reliability of production systems and services. This role bridges software development and operations, using automation, monitoring, and performance optimization to build resilient systems that can scale efficiently...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India, Karnataka WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
2 days ago
India CitNOW Group Full timeAbout us Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably. CitNOW’s app-based platform provides a secure, brand-compliant solution – for dealers to build trust, transparency and long-lasting relationships. CitNOW Group was formed...
-
Site Reliability Engineer
13 hours ago
India CitNOW Group Full timeAbout us Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably. CitNOW’s app-based platform provides a secure, brand-compliant solution – for dealers to build trust, transparency and long-lasting relationships. CitNOW Group was formed...
-
Site Reliability Engineer
11 hours ago
India Photon Group Full time ₹ 8,00,000 - ₹ 12,00,000 per yearDescriptionSRE Engineer is responsible for ensuring website uptime, optimizing performance, and maintaining security of the production application. This role involves monitoring site reliability, addressing technical issues, automating maintenance tasks, and collaborating with cross-functional teams to meet business objectives. Responsibilities Run the...