Site Reliability Engineer

4 weeks ago


Hyderabad, Telangana, India 5100 Kyndryl Solutions Private Limited Full time

About the Role

We are seeking a highly skilled Site Reliability Engineer to join our team at 5100 Kyndryl Solutions Private Limited. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, resiliency, and innovation of our information systems and ecosystems.

Key Responsibilities

  • Develop and maintain key reliability metrics such as Service Level Indicators (SLIs), Service Level Objectives (SLOs), and Service Level Agreements (SLAs) in line with business goals.
  • Continuously assess and improve the reliability of our systems, ensuring they meet defined metrics.
  • Deliver reliable and scalable solutions by following SRE principles such as automation, infrastructure as code (IaC), and proactive cost monitoring and failure detection.
  • Collaborate with engineering and operations teams to ensure the alignment of architecture and design with reliability goals.
  • Incorporate security best practices into all aspects of SRE work, ensuring systems are resilient against vulnerabilities and threats.
  • Establish and document best practices for system monitoring, backup, restore, disaster recovery (DR), and resiliency processes.
  • Ensure comprehensive coverage of monitoring, including vulnerability checks, application performance, infrastructure health, and user experience.
  • Own Incident Management Process and responsible for Continuous Improvement.

About You

We are looking for a highly motivated and experienced Site Reliability Engineer who possesses a strong background in systems engineering, programming, and cloud computing. You should have a minimum of 6 years of experience in Site Reliability Engineering and a deep understanding of SRE concepts such as Service Level Indicators and Service Level Objectives.

Requirements

  • Total experience of 6 to 9 years as Site Reliability Engineer.
  • At least 6 years of professional experience in Systems engineering.
  • Minimum 4 years of experience in programming in Shell script, JavaScript, Python, Amazon Web Services cloud components, including Compute, Network, and Storage (IaaS and PaaS) and services.
  • Minimum 4 years of experience in delivering infrastructure as code with CloudFormation and other frameworks like SAM, Terraform, etc.
  • Minimum 4 years of experience in supporting production applications, release management, and providing escalated on-call support.
  • Experience in SRE best practices, Application Performance Monitoring, Deep understanding of SRE concepts such as Service Level Indicators and Service Level Objectives, Change management best practices, Version Control Systems - Git, Trunk-based development & AWS Systems.
  • Familiarity with containerization, especially Kubernetes.

Preferred Requirements

  • Bachelor's Degree or equivalent practical experience.
  • AWS Certified DevOps - Professional.
  • AWS Certified Solutions Architect – Associate.
  • Hands-on experience with Observability Platforms.
  • Experience in Automation Testing Frameworks like Selenium or Playwright, DevOps, SecOps, AWS Cloud Development Kit (CDK), AWS Management, Security, Scalability, Reliability, and Cost Optimization.

About Kyndryl

Kyndryl has a global footprint, which means that as a Site Reliability Engineer at Kyndryl, you will have opportunities to work on projects and collaborate with colleagues from around the world. This role is dynamic and influential – offering a wide range of professional and personal growth opportunities that you won't find anywhere else.

What We Offer

With state-of-the-art resources and Fortune 100 clients, every day is an opportunity to innovate, build new capabilities, new relationships, new processes, and new value. Kyndryl cares about your well-being and prides itself on offering benefits that give you choice, reflect the diversity of our employees, and support you and your family through the moments that matter – wherever you are in your life journey. Our employee learning programs give you access to the best learning in the industry to receive certifications, including Microsoft, Google, Amazon, Skillsoft, and many more. Through our company-wide volunteering and giving platform, you can donate, start fundraisers, volunteer, and search over 2 million non-profit organizations. At Kyndryl, we invest heavily in you, we want you to succeed so that together, we will all succeed.



  • Hyderabad, Telangana, India SID Global Solutions Full time

    Site Reliability EngineerAt SID Global Solutions, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain scalable and highly available cloud...


  • Hyderabad, Telangana, India RealPage, Inc. Full time

    Job SummaryRealPage, Inc. is seeking a highly skilled Site Reliability Engineer to join our SRE & Systems team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our multiple open-source application environments.Key ResponsibilitiesProvision, de-provision, and support multiple open-source application...


  • Hyderabad, Telangana, India Experian Full time

    Job Title: Site Reliability EngineerJob Summary:Experian is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and scalability of our AWS platform.Key Responsibilities:Optimize microservice and serverless processes on robust distributed...


  • Hyderabad, Telangana, India Live Connections Full time

    We are looking for Manager Site Reliability Engineer in Hyderabad locationRoles and Responsibilities :Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, Telangana, India FactSet Full time

    Job SummaryWe are seeking a skilled Site Reliability Engineer to join our team at FactSet. The ideal candidate will have a strong background in designing, implementing, and maintaining highly available and scalable architectures for our applications and infrastructure.Key ResponsibilitiesCollaborate with cross-functional teams to define, review, and...


  • Hyderabad, Telangana, India Virtusa Full time

    Job SummaryVirtusa is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining reliable and scalable infrastructure solutions to support our applications and services.Key ResponsibilitiesDesign and implement robust monitoring and alerting systems to...


  • Hyderabad, Telangana, India Virtusa Full time

    Job SummaryVirtusa is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining reliable and scalable infrastructure solutions to support our applications and services.Key ResponsibilitiesDesign and implement robust monitoring and alerting systems to...


  • Hyderabad, Telangana, India RiskInsight Consulting Pvt Ltd Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at RiskInsight Consulting Pvt Ltd. As a Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our banking applications and infrastructure.Key Responsibilities:Manage a 24/7 production support team in the Banking...


  • Hyderabad, Telangana, India Tata Consultancy Services Full time

    About the RoleTata Consultancy Services is a global leader in the technology arena, and we're looking for talented individuals to join our team. As a Site Reliability Engineer, you'll play a crucial role in ensuring the stability and performance of our applications.Key ResponsibilitiesDesign, develop, and test Java applications using standard frameworks and...


  • Hyderabad, Telangana, India Experian Full time

    Job Title: Cloud and Site Reliability EngineerJob Summary:We are seeking a highly skilled Cloud and Site Reliability Engineer to join our team at Experian. As a Cloud and Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining our cloud infrastructure, ensuring high availability, scalability, and security.Key...


  • Hyderabad, Telangana, India Microsoft Full time

    About the RoleWe are seeking a talented Senior Site Reliability Engineer to join our Cloud Infrastructure Health team at Microsoft. As a key member of our team, you will be responsible for designing, developing, and delivering software solutions that reduce operational burden and improve the reliability of our cloud infrastructure.ResponsibilitiesDesign and...


  • Hyderabad, Telangana, India Thomson Reuters Full time

    About the RoleIn this opportunity as Site Reliability Engineer, you will be responsible for overseeing the operational aspects of cloud-based systems, ensuring their efficiency, reliability, and scalability. Key responsibilities include managing change and problem management, application and configuration management, and production support of strategic...


  • Hyderabad, Telangana, India Zenoti Full time

    Join Zenoti's Tech Team:Zenoti, a leading cloud-based software solution for the beauty and wellness industry, is seeking an experienced Site Reliability Engineering Manager to join our team. As a key member of our technology team, you will be responsible for ensuring the reliability and scalability of our cloud-based platform.Your Key...


  • Hyderabad, Telangana, India Virtusa Full time

    Job Summary: We are seeking a skilled Site Reliability Engineer to join our team at Virtusa. In this role, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based systems.Responsibilities:Troubleshoot recurring failures and participate in incident triages to minimize downtime and ensure system...


  • Hyderabad, Telangana, India Zenoti Full time

    About ZenotiZenoti is a leading provider of cloud-based software solutions for the beauty and wellness industry. Our comprehensive platform enables businesses to manage every aspect of their operations, from online appointment bookings to inventory management and marketing programs.With a presence in over 50 countries and a portfolio of global brands, Zenoti...


  • Hyderabad, Telangana, India Thomson Reuters Full time

    About the RoleAs an AWS Site Reliability Engineer, you will work with application teams to manage and support applications into production. This includes continuous improvement to an ongoing support model, including release and change management for maintaining strategic environments. You will provide well-written documentation and technical presentations on...


  • Hyderabad, Telangana, India Zenoti Full time

    Zenoti is seeking a seasoned Site Reliability Engineering Manager to join our team. As a key member of our engineering organization, you will be responsible for leading the adoption of DevOps practices and architecture across various services in the company.The ideal candidate will be a self-starter with a zeal to own things from start to end with little...


  • Hyderabad, Telangana, India RiskInsight Consulting Pvt Ltd Full time

    Job SummaryAt RiskInsight Consulting Pvt Ltd, we are seeking a skilled Site Reliability Engineer to join our team. As the ideal candidate, you will be responsible for managing our 24/7 production support team and ensuring seamless operation of our banking and credit card applications.Key ResponsibilitiesManage a team of L2, L2, and L3 support engineers in...


  • Hyderabad, Telangana, India UnitedHealth Group Full time

    At UnitedHealth Group, we're committed to helping people live healthier lives and making the health system work better for everyone. As a Senior Site Reliability Engineer, you'll play a critical role in ensuring the reliability and performance of our cloud-based systems. Your expertise will help us deliver high-quality care to millions of people around the...


  • Hyderabad, Telangana, India Unison Consulting Pte Ltd Full time

    Job Title: Site Reliability Engineer - Cloud ExpertAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Unison Consulting Pte Ltd. As a Site Reliability Engineer, you will be responsible for ensuring the high availability and performance of our cloud-based applications.Key Responsibilities:Support Java (J2EE/Spring...