Senior Site Reliability Engineer

2 days ago


Hyderabad, Telangana, India Ivanti Full time US$ 1,00,000 - US$ 1,50,000 per year

Why We Need You

Site Reliability Engineering (SRE) is a growing team that partners closely with Product Engineering, Security, and Support. We are responsible for the reliability, deployment, and continuous operation of the Ivanti Cloud services.  We need your help to take our existing platform to the next level with observability, release automation, chaos engineering, and more.

The Senior SRE role is a blend of infrastructure, networking, operating systems, automation, development, and application administration. It is a hands-on technical position in a fast-paced atmosphere. The ideal candidate has prior experience managing cloud-based SaaS applications and strives to solve traditional operations problems through automation and software. More so, the candidate must possess a high standard of excellence, have a strong customer focus, and is capable of technical deep dives into code, app servers, databases, load balancers, operating systems, and networks.

What You Will Be Doing

  • Deploying, managing, and securing Ivanti's production Software-as-a-Service (SaaS) environments in AWS and Azure
  • Working with geographically dispersed, cross-departmental teams to solve difficult problems
  • Automating common and repetitive tasks
  • Write documentation and training material
  • Train other colleagues.
  • Participate in on-call rotations for 24x7 coverage (follow-the-sun model) for incident response, issue triage, and problem resolution

To Be Successful in The Role, You Will Have

  • A BSc in Computer Science, a related field, or equivalent practical experience
  • 5+ years of relevant industry experience.
  • Proficiency with Python and experience with one of the following languages:
    • Java
    • Golang
    • C#
  • Proficiency working with Bash or PowerShell programmatically
  • Familiarity with public cloud platforms (AWS or Azure preferred)
  • Experience troubleshooting Java and .NET applications
  • Experience troubleshooting network and storage infrastructure issues
  • Experience working with core Linux distributions (Debian, RHEL, SUSE, Slackware).
  • Experience working with Windows.
  • Experience working with one or more: SQL Server, PostgreSQL, Redis, Kafka, MongoDB, Elasticsearch, or similar
  • Ability to configure and fine tune at least one: HA Proxy, Apache, Nginx, IIS, or similar
  • Ability to configure: New Relic, DataDog, Splunk, or similar monitoring tools
  • Familiarity with container orchestration technologies (AWS EKS or Azure AKS preferred)
  • Experience with deployment pipeline tools such as Ansible, Jenkins, and/or GitHub Actions
  • Proficiency working and developing Infrastructure as Code (IaC)
  • A desire to adopt and implement emergent technologies and best practices
  • Strong verbal and written communication skills in English for the purposes of global collaboration

'Nice-to-haves' include:

  • Prior experience as a Site Reliability Engineer or DevOps Engineer
  • Certificates in one or more of the following categories, or demonstrated certificate-equivalent knowledge:
    • Cloud Development and architecture
    • Kubernetes Administration
    • Linux Administration
    • Software engineering disciplines
  • Experience with compliance frameworks such as SOC 2 Type 2, ISO-27001, FedRAMP, or IRAP and privacy regulations such as GDPR and PIPEDA

Roadmap for Success

90 Days:

  • Onboarding and role-training is complete
  • You're building foundational knowledge of the SRE-run product portfolio
  • You hold general knowledge of how SRE manages our SaaS environments
  • You've gotten to know the team and are building relationships with SRE peer teams

6 Months:

  • Self-sufficiency in core job functions and existing processes
  • Participating in SRE on-call rotations
  • Contributing to handling SRE tickets to fulfillment and responsible for individual SRE tasks
  • Active participation in SRE stability discussions with direct interaction with SRE peers

1 Year:

  • Contribute independently to improve reliability and compliance in our SaaS environments
  • Demonstrate ownership of SRE ticket management including triage and resolution
  • Lead one or more well-defined projects.
  • Identify areas where performance, scalability, security, and reliability can be improved in production systems and environments
  • Mentor junior team members and contribute to internal knowledge-sharing sessions.


  • Hyderabad, Telangana, India Microsoft Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    The Windows Cloud division is looking for a Senior Site Reliability Engineer that will help us take the Windows Cloud platform, as well as the Windows 365 Cloud PC and Azure Virtual Desktop business to the next level.Windows 365 Cloud PC (W365) and Azure Virtual Desktop (AVD) have recently been recognized as leaders in the Gartner Magic Quadrant for Desktop...


  • Hyderabad, Telangana, India Insight Global Full time

    Join a mission-critical SCADA reliability team —now hiring Lead, Senior, and Junior Site Reliability Engineers in HITECH Hyderabad Telangana.Step into a high-impact role with cutting-edge technologies, a flexible hybrid schedule, and a growth-driven culture backed by Evergreen, the professional services division of Insight Global.Key Technologies &...


  • Hyderabad, Telangana, India Talent Worx Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Site Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...


  • Hyderabad, Telangana, India Talent Worx Full time

    Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services.Your work will involve both software engineering and systems operations as you strive to improve customer experiences and operational...


  • Hyderabad, Telangana, India Chase Bank Full time

    Job DescriptionElevate your engineering prowess to unprecedented levels by joining a team of exceptionally gifted professionals and position yourself among the top echelon in site reliability.As a Principal Site Reliability Engineer at JPMorgan Chase within the Consumer & Community Banking, youwork with your fellow stakeholders to define non-functional...


  • Hyderabad, Telangana, India IntraEdge Full time

    Site Reliability EngineerExperience: 7+ YearsLocation: HyderabadHybrid 4-day office and 1 Day remoteSkills for Principal:Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Advanced project management capabilities.Excellent communication and collaboration skills.Adept at risk assessment and crisis...


  • Hyderabad, Telangana, India Cubic Corporation Full time US$ 1,50,000 - US$ 2,00,000 per year

    Business Unit:Cubic Transportation SystemsCompany Details:When you join Cubic, you become part of a company that creates and delivers technology solutions in transportation to make people's lives easier by simplifying their daily journeys, and defense capabilities to help promote mission success and safety for those who serve their nation. Led by our...


  • Hyderabad, Telangana, India Cubic Corporation Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Business Unit:Cubic Transportation SystemsCompany Details:When you join Cubic, you become part of a company that creates and delivers technology solutions in transportation to make people's lives easier by simplifying their daily journeys, and defense capabilities to help promote mission success and safety for those who serve their nation. Led by our...


  • Hyderabad, Telangana, India IntraEdge Full time

    Position - SRE (Site Reliability Engineer)Experience - 5+ YearsLocation - HyderabadSkills for Principal:Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Advanced project management capabilities.Excellent communication and collaboration skills.Adept at risk assessment and crisis...


  • Hyderabad, Telangana, India IntraEdge Full time

    Site Reliability EngineerExperience: 7+ YearsLocation: HyderabadHybrid 4-day office and 1 Day remoteSkills for Principal:- Strong leadership and people management skills.- Exceptional technical proficiency in Pearson's technology stack.- Advanced project management capabilities.- Excellent communication and collaboration skills.- Adept at risk assessment and...