Site Reliability Engineer- Platform Engineering

1 week ago


India Weekday AI Full time ₹ 15,00,000 - ₹ 25,00,000 per year

This role is for one of Weekday's clients
Min Experience: 4 years
JobType: full-time

We are looking for an experienced and motivated Site Reliability Engineer (SRE) – Platform Engineering to join our growing technology team. In this role, you will be responsible for designing, building, and maintaining scalable, resilient, and secure infrastructure platforms that support business-critical applications and services. The SRE will work at the intersection of software development and systems engineering to ensure the availability, performance, and reliability of our platforms.

This role requires deep expertise in automation, cloud-native technologies, monitoring, and platform operations. The ideal candidate is passionate about solving complex infrastructure challenges, streamlining deployment pipelines, and building highly reliable systems.

Key Responsibilities
  • Platform Engineering: Design, implement, and optimize platform services and infrastructure to ensure high availability, scalability, and performance.
  • Reliability & Resilience: Build self-healing and fault-tolerant systems while proactively identifying and eliminating reliability risks.
  • Automation: Develop Infrastructure as Code (IaC) solutions using tools like Terraform, Ansible, or CloudFormation to automate infrastructure provisioning and configuration.
  • Monitoring & Observability: Implement monitoring, logging, and alerting systems using tools such as Prometheus, Grafana, ELK, or Datadog to track platform health and performance.
  • Incident Management: Troubleshoot incidents, perform root cause analysis, and ensure timely resolution while minimizing downtime and customer impact.
  • DevOps & CI/CD: Collaborate with development teams to enhance CI/CD pipelines for seamless deployment and integration, ensuring reliability in production environments.
  • Cloud Infrastructure: Manage cloud environments (AWS, Azure, or GCP) and optimize for cost, security, and performance.
  • Security & Compliance: Implement security best practices, monitor vulnerabilities, and ensure compliance with industry standards across infrastructure and platforms.
  • Collaboration: Partner with software engineers, product teams, and IT operations to align infrastructure capabilities with business requirements.
  • Continuous Improvement: Analyze existing infrastructure and processes, identifying areas for improvement, and implementing best practices for operational efficiency.
  • Capacity Planning: Forecast infrastructure requirements, ensuring the platform is always prepared to handle current and future workloads.
Qualifications & Skills
  • Bachelor's degree in Computer Science, Information Technology, or related field. Equivalent practical experience may be considered.
  • 4+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering.
  • Strong proficiency with cloud platforms (AWS, Azure, or GCP).
  • Hands-on experience with Infrastructure as Code (Terraform, Ansible, or CloudFormation).
  • Solid understanding of Linux systems administration, networking, and container orchestration (Docker, Kubernetes).
  • Experience with CI/CD pipelines (Jenkins, GitLab CI, or similar tools).
  • Proficiency in scripting/programming languages such as Python, Go, Bash, or Java.
  • Strong knowledge of monitoring and observability tools (Prometheus, Grafana, ELK, Datadog, Splunk).
  • Familiarity with incident response and on-call support practices.
  • Knowledge of security best practices and compliance frameworks.
  • Excellent problem-solving, debugging, and analytical skills.
  • Strong communication and collaboration abilities to work effectively across cross-functional teams.


  • Bengaluru, India Relanto Full time

    Job Description Job Title: Site Reliability Engineer Summary We are looking for a Site Reliability Engineer to join our Digital & Transformation department. The ideal candidate will have 2-3 years of experience in this field and will be responsible for ensuring the reliability, availability, and performance of our systems and applications. Roles And...


  • India InOrg Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About VivaOps :VivaOps is a leading DevSecOps platform company specializing in GitLab - The comprehensive DevOps platform, to transform and secure software development processes. We help organizations to streamline their DevSecOps journey by offering a complete range of GitLab services, from advisory, to implementation and managed services, to accelerate...


  • India Grootan Technologies Full time

    About the Role We are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • India LivePerson Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    LivePerson (NASDAQ: LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are...


  • India LivePerson Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    LivePerson (NASDAQ: LPSN) is a leading customer engagement company, creating digital experiences powered by Curiously Human AI. Every person is unique, and our technology makes it possible for companies, including leading brands like HSBC, Orange, and GM Financial, to treat their audiences that way at scale. Nearly a billion conversational interactions are...


  • India Akamai Technologies Full time

    Job Description Job Description Do you like collaborating across teams to solve complex problems Do you enjoy solving large scale distributed content delivery challenges Join our highly skilled Compute Site Reliability team Our team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We...


  • Hyderabad, India UBS Full time

    Job Description Job Reference # 322870BR Job Type Full Time Your role Are you an analytic thinker Do you enjoy Site Reliability Engineering initiatives and proactive problem management across on-premises & Cloud Database ensuring high availability & stability of Database infrastructure services Do you want to play a key role in transforming our firm into an...


  • India Akamai Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Description Do you like collaborating across teams to solve complex problems? Do you enjoy solving large scale distributed content delivery challenges?Join our highly skilled Compute Site Reliability teamOur team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We specialize in creating...


  • India CitNOW Group Full time

    About us Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably. CitNOW’s app-based platform provides a secure, brand-compliant solution – for dealers to build trust, transparency and long-lasting relationships. CitNOW Group was formed...


  • India CitNOW Group Full time

    About us Founded in 2008, CitNOW is an innovative, enterprise-level software product suite that allows automotive dealerships globally to sell more vehicles and parts more profitably. CitNOW’s app-based platform provides a secure, brand-compliant solution – for dealers to build trust, transparency and long-lasting relationships. CitNOW Group was formed...