Site Reliability Engineer

2 weeks ago


Bengaluru, Karnataka, India Awign Expert Full time

We are seeking a skilled and proactive engineer with expertise in Kubernetes, Java-based applications, and cloud platforms (AWS/Azure/GCP) , along with experience in ServiceNow for support ticket management. The ideal candidate will be responsible for maintaining cloud-native applications, troubleshooting production issues, and ensuring smooth operations through effective ticket handling and resolution.

Duration:

Key Responsibilities:

Kubernetes & Cloud Operations:

  • Deploy, manage, and monitor containerized applications using Kubernetes.
  • Maintain and optimize cloud infrastructure (AWS, Azure, or GCP).
  • Automate deployments and infrastructure using CI/CD pipelines and Infrastructure as Code (IaC) tools like Terraform or Helm.
  • Monitor system performance, availability, and security.

Java Application Support:

  • Troubleshoot and debug Java-based microservices and APIs.
  • Collaborate with development teams to resolve application issues.
  • Participate in code reviews and suggest performance improvements.

ServiceNow (SNOW) Support:

  • Handle incident, problem, and change management via ServiceNow.
  • Raise, track, and resolve support tickets in coordination with internal and external teams.
  • Document root cause analysis (RCA) and resolution steps for recurring issues.

Collaboration & Documentation:

  • Work closely with DevOps, QA, and development teams.
  • Maintain technical documentation, runbooks, and knowledge base articles.
  • Participate in on-call rotations and provide timely support for critical issues.

Required Skills:

  • Strong hands-on experience with Kubernetes and container orchestration.
  • Proficiency in Java and related frameworks (Spring Boot, REST APIs).
  • Experience with cloud platforms (AWS, Azure, or GCP).
  • Familiarity with ServiceNow or similar ITSM tools.
  • Good understanding of CI/CD tools (Jenkins, GitLab CI, etc.).
  • Knowledge of monitoring tools (Prometheus, Grafana, ELK, etc.)

Qualification:

  • Bachelor's or Master's degrees in Computer Science, Computer Engineering, or related technical discipline.
  • Ability to work independently and to adapt to a fast-changing environment.
  • Creative, self-disciplined, and capable of identifying and completing critical tasks independently and with a sense of urgency.


  • Bengaluru, Karnataka, India Enterprise Minds, Inc Full time

    We're Hiring | Site Reliability Engineer | 8-10 years


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...


  • Bengaluru, Karnataka, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Role OverviewAs a Site Reliability Engineer, you will play a pivotal role in driving innovation and modernizing complex systems by leveraging cutting-edge technologies and collaboration with cross-functional teams.


  • Bengaluru, Karnataka, India Coforge Full time

    Job Description- Design, implement, and maintain scalable infrastructure to ensure high availability and performance of software applications.- Collaborate with development teams to identify and resolve issues affecting application performance, stability, and reliability.- Develop automated monitoring scripts using tools like Prometheus, Grafana, etc. to...


  • Bengaluru, Karnataka, India Infrasoft Technologies Limited Full time

    Job DescriptionJob Title: DeveloperWork Location: Bangalore, KarnatakaExperience Range: 68 YearsJob Description:We are looking for a skilled Developer with strong hands-on experience in Site Reliability Engineering (SRE), Java, JavaScript, and Production Support. The ideal candidate should have a solid background in application monitoring and troubleshooting...


  • Bengaluru, Karnataka, India Collabera Full time

    Job Description As a Principal/Chief Site Reliability Engineer , you will play a critical role in designing, developing, and maintaining scalable and highly reliable systems. You'll work closely with development teams to improve system reliability, monitor critical applications, and design fail-proof infrastructure. Responsibilities Design and implement...


  • Bengaluru, Karnataka, India Xebia Full time

    We are seeking an experienced AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE) to design, build, and manage scalable, reliable, and secure cloud environments. The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI/CD, monitoring & observability frameworks, and incident...


  • Bengaluru, Karnataka, India NatWest Group Full time

    Join us as a Site Reliability Engineer In this key role you ll support the improvement of non-functional and operational characteristics such as availability performance efficiency change management monitoring security incident response and capacity planning of our products and services You ll enjoy significant stakeholder interaction working in...


  • Bengaluru, Karnataka, India Tata Technologies Full time

    Job DescriptionSite Reliability EngineerWhat awaits you/ Job ProfileAn SRE is responsible for maintaining reliability. That means facilitating automated, streamlined, and efficient error responses and reducing human error at scale. SREs spend a lot of time removing pain points, configuring internal tools, and setting and testing system benchmarks. They also...


  • Bengaluru, Karnataka, India beBeeReliability Full time

    Pearson is looking for a dynamic and experienced Manager - Site Reliability Engineering (SRE) to join our team. This individual will play a critical role in ensuring the stability, performance, and scalability of our infrastructure. If you possess excellent leadership skills, profound technical expertise, and the ability to thrive in a fast-paced,...