Site Reliability Engineer

6 hours ago


Hyderabad, India Jigya Software Services Full time

Job Title:
Senior Site Reliability Engineer (SRE) - AWS/Kubernetes

Location:
Hyderabad - Onsite

Job Type:
Full-Time

About the Role:

We are looking for a highly skilled and motivated Site Reliability Engineer to design, build, and maintain our high-performance, scalable cloud infrastructure. You will play a critical role in ensuring the reliability, performance, and efficiency of our systems by applying core SRE principles. You will work closely with development teams to automate deployment, monitoring, and operational processes, turning manual toil into automated solutions.

Key Responsibilities:

  • Design, implement, and manage Kubernetes clusters (EKS) for running containerized applications.
  • Build and maintain reliable, scalable infrastructure on AWS using infrastructure-as-code.
  • Develop automation scripts (Shell/Python) for provisioning, deployment, and self-healing capabilities.
  • Implement comprehensive monitoring and alerting solutions (e.g., Prometheus, Grafana, CloudWatch) to ensure system health and performance.
  • Troubleshoot complex issues across the entire stack: hardware, software, application, and network.
  • Participate in capacity planning, performance analysis, and system tuning.
  • Champion SRE best practices, including blameless post-mortems and error budget management.
  • Collaborate with software engineering teams to improve services and achieve higher levels of reliability.

Required Qualifications:

  • Bachelor's degree in Computer Science, Engineering, or a related field.
  • 5+ years of experience in a Site Reliability, DevOps, or Infrastructure Engineering role.
  • Strong hands-on experience with Kubernetes in production environments.
  • In-depth knowledge of AWS cloud services (EC2, IAM, EKS, S3, VPC, CloudWatch).
  • Proficiency in scripting with Shell and/or Python.
  • Solid understanding of Linux operating systems and networking fundamentals.
  • Experience with infrastructure as code tools like Terraform or CloudFormation.
  • Excellent problem-solving skills and a keen attention to detail.
  • Strong interpersonal and collaboration skills.

Preferred Qualifications:

  • AWS Certified Kubernetes Specialist or AWS DevOps Engineer Professional.
  • Experience with monitoring tools like Prometheus, Grafana, or Datadog.
  • Knowledge of CI/CD pipelines and tools.


  • Hyderabad, Telangana, India Talent Worx Full time US$ 1,20,000 - US$ 2,00,000 per year

    Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services.Your work will involve both software engineering and systems operations as you strive to improve customer experiences and operational...


  • Hyderabad, Telangana, India Jigya Software Services Full time ₹ 1,50,000 - ₹ 28,00,000 per year

    Job Title:Senior Site Reliability Engineer (SRE) - AWS/KubernetesLocation:Hyderabad - OnsiteJob Type:Full-TimeAbout the Role:We are looking for a highly skilled and motivated Site Reliability Engineer to design, build, and maintain our high-performance, scalable cloud infrastructure. You will play a critical role in ensuring the reliability, performance, and...


  • Hyderabad, Telangana, India IntraEdge Full time

    Site Reliability EngineerExperience: 7+ YearsLocation: HyderabadHybrid 4-day office and 1 Day remoteSkills for Principal:- Strong leadership and people management skills.- Exceptional technical proficiency in Pearson's technology stack.- Advanced project management capabilities.- Excellent communication and collaboration skills.- Adept at risk assessment and...


  • Hyderabad, Telangana, India IntraEdge Full time US$ 1,20,000 - US$ 2,00,000 per year

    Site Reliability EngineerExperience: 7+ YearsLocation: HyderabadSkills for Principal:Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Advanced project management capabilities.Excellent communication and collaboration skills.Adept at risk assessment and crisis management.Strategic thinking with a...


  • Hyderabad, India IntraEdge Full time

    Site Reliability Engineer Experience: 7+ Years Location: Hyderabad Skills for Principal: Strong leadership and people management skills. Exceptional technical proficiency in Pearson's technology stack. Advanced project management capabilities. Excellent communication and collaboration skills. Adept at risk assessment and crisis management. Strategic thinking...


  • Hyderabad, Telangana, India ServiceNow Full time

    Site Reliability Engineer (SRE)Experience : 6+ YearsAbout the Role : We are seeking a seasoned SRE to ensure the reliability, availability, and performance of our critical services. You will combine software engineering with systems administration to create scalable and highly reliable software systems.Responsibilities : - Design, build, and maintain...


  • Hyderabad, Telangana, India IntraEdge Full time

    Site Reliability Engineer Experience: 7+ Years Location: Hyderabad Hybrid 4-day office and 1 Day remote Skills for Principal: Strong leadership and people management skills. Exceptional technical proficiency in Pearson's technology stack. Advanced project management capabilities. Excellent communication and collaboration skills. Adept at risk assessment...


  • Hyderabad, Telangana, India IntraEdge Full time

    Site Reliability EngineerExperience: 7+ YearsLocation: HyderabadHybrid 4-day office and 1 Day remoteSkills for Principal:Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Advanced project management capabilities.Excellent communication and collaboration skills.Adept at risk assessment and crisis...


  • Hyderabad, Telangana, India INDIGLOBE IT SOLUTIONS PRIVATE LIMITED Full time

    Job Summary :We are looking for a Senior Site Reliability Engineer (SRE) to join our growing Engineering team. As an SRE, you will play a key role in ensuring the reliability, scalability, and performance of our production systems across a multi-cloud environment (GCP & AWS). Youll be responsible for owning application support, maintaining our microservices...


  • Hyderabad, India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or DevOps Engineering rolesLocation – Fully RemoteType - 6 months ContractWork Ex - 5+ YrsWe’re working with a AI product company that’s building the next generation of GenAI powered developer platforms.We’re looking for an experienced Site Reliability Engineer to join their Platform...