Senior Site Reliability Engineer

1 month ago


bangalore, India Nexthink Full time

Company Description

Hi, we’re Nexthink. We’re not just the leader in the digital employee experience category, we invented the category. Our solutions combine real-time analytics, automation and employee feedback across all endpoints to help IT teams delight people at work. Our cloud-native platform pinpoints issues and solutions, automates response, and helps companies continuously improve their employees’ experience, making them more productive, efficient, and happy at work. We have millions of endpoints deployed, we’ve surpassed $100M in ARR, and we’ve recently secured $180M in Series D financing for a company valuation of $1.1B, but we’re just getting started.

Job Description

· Manage and maintain our Kubernetes clusters, including deployment, configuration, and upgrades. Ensure the stability and scalability of the clusters to accommodate increasing demands.

· Utilize your hands-on knowledge to automate routine tasks and streamline operations. Implement infrastructure as code (IaC) practices to facilitate rapid and reliable deployments, ensuring efficient resource provisioning and management.

· Participate in an on-call rotation, providing prompt responses and resolution to critical incidents. Your commitment to keeping the cloud infrastructure up and running will be crucial to maintaining high availability.

· Continuously assess the performance of our cloud infrastructure (AWS) and applications. Implement optimizations to enhance system efficiency and reduce response times.

 · Stay up to date with best practices, tools, and market trends. Evaluate and recommend innovative solutions to be applied in the company.

· Participate in incident handling

· Work closely with cloud architects and team’s technical lead to validate new system architecture proposals to support new features in the cloud

· Proactively identify potential issues and troubleshoot system anomalies. Collaborate with other teams to address incidents and implement preventive measures to reduce downtime.

· Set up and maintain comprehensive monitoring and alerting systems to detect anomalies, capacity constraints, and potential performance bottlenecks. Ensure timely responses to alerts and alarms.

· Maintain accurate and up-to-date documentation of processes, procedures, and troubleshooting guides to facilitate knowledge sharing and standardization.

Qualifications

· Bachelor’s degree in Computer Science, Computer Engineering, or related field, or 7+ years relevant work experience.

· Strong hands-on experience in managing Kubernetes clusters in a production environment.

· Excellent communication skills and teamwork

· Knowledge in config automation (Ansible), CI/CD (Jenkins), IaC (Terraform, Crossplane) for infrastructure management. Also proficient in at least one scripting language (bash, python)

· Extensive experience in Linux container technologies (e.g., Docker, LXC)

· Good knowledge of Linux, especially Debian and CentOS,

· Familiar with source code management solutions (GitHub, Bitbucket) and the Atlassian suite (JIRA, Confluence)

· Experience working in an on-call rotation environment and running operations.

· Proven problem-solving skills and the ability to troubleshoot complex technical issues.

· Deep commitment to maintaining high system reliability and availability.

· Extensive experience with AWS cloud computing platform and related services.

· Intense motivation/curiosity to learn new things and discover new technologies,

· Be able to work autonomously

· Knowledge of monitoring systems (e.g., ELK, Prometheus, Kibana, New relic, Datadog, Pagerduty)

· Speak professional-level English.

#LI-Hybrid

Additional Information

We are 900+ employees strong in 21 countries across 8 different time zones speaking 60+ languages. We are positive, we get things done, we keep growing, and we are one team, we are Nexthink. We believe actions are stronger than words when it comes to diversity, inclusion, and equity in the workplace. Nexthinkers are multinational and multilingual, and come from all walks of life. We are committed to hiring a genuinely representative workforce that can create solutions and foster innovation for the modern digital employee experience.

If you are looking for a change and like a nice atmosphere, lots of challenges, and having fun while working, this is a great opportunity for you



  • bangalore, India Ultrabot Innovations Full time

    Position Overview :As a Senior Site Reliability Engineer with 5-8 years of experience, you will play a key role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will leverage your expertise in Site Reliability Engineering (SRE) to implement best practices and methodologies, effectively troubleshoot complex...


  • Bangalore, Karnataka, India Ultrabot Innovations Full time

    Position Overview :As a Senior Site Reliability Engineer with 5-8 years of experience, you will play a key role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will leverage your expertise in Site Reliability Engineering (SRE) to implement best practices and methodologies, effectively troubleshoot complex...


  • Bangalore, Karnataka, India Ultrabot Innovations Full time

    Position Overview :As a Senior Site Reliability Engineer with 5-8 years of experience, you will play a key role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will leverage your expertise in Site Reliability Engineering (SRE) to implement best practices and methodologies, effectively troubleshoot complex...


  • Bangalore, India Ultrabot Innovations Full time

    Position Overview :As a Senior Site Reliability Engineer with 5-8 years of experience, you will play a key role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will leverage your expertise in Site Reliability Engineering (SRE) to implement best practices and methodologies, effectively troubleshoot complex...


  • Bangalore, India Ultrabot Innovations Full time

    Position Overview :As a Senior Site Reliability Engineer with 5-8 years of experience, you will play a key role in ensuring the reliability, scalability, and performance of our systems and infrastructure. You will leverage your expertise in Site Reliability Engineering (SRE) to implement best practices and methodologies, effectively troubleshoot complex...


  • bangalore, India Oracle Full time

    Title: Senior Site Reliability Engineering Job Description :  Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health Applications & Infrastructure. This team will focus on product development and product strategy for Oracle Health while building out a complete platform supporting modernized, automated healthcare....


  • bangalore, India Oracle Full time

    Title: Senior Site Reliability Engineering Job Description :  Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health Applications & Infrastructure. This team will focus on product development and product strategy for Oracle Health while building out a complete platform supporting modernized, automated healthcare....


  • Bangalore, Karnataka, India SWAI TECHNOLOGIES PRIVATE LIMITED Full time

    Role : Senior Site reliability Engineer Exp : 5 to 10 Years of experience Remote Opportunity Company Description :Tech recruitment is broken Companies say there is a shortage of talent and it's hard to find good developers, while developers find it hard to find companies that value the skill, experience and passion they bring to the table.Quite the...


  • Bangalore, India SWAI TECHNOLOGIES PRIVATE LIMITED Full time

    Role : Senior Site reliability Engineer Exp : 5 to 10 Years of experience Remote Opportunity Company Description : Tech recruitment is broken Companies say there is a shortage of talent and it's hard to find good developers, while developers find it hard to find companies that value the skill, experience and passion they bring to the table.Quite...


  • Bangalore, Karnataka, India SWAI TECHNOLOGIES PRIVATE LIMITED Full time

    Role : Senior Site reliability Engineer Exp : 5 to 10 Years of experience Remote Opportunity Company Description :Tech recruitment is broken Companies say there is a shortage of talent and it's hard to find good developers, while developers find it hard to find companies that value the skill, experience and passion they bring to the table.Quite the...


  • Bangalore, India SWAI TECHNOLOGIES PRIVATE LIMITED Full time

    Role : Senior Site reliability Engineer Exp : 5 to 10 Years of experience Remote Opportunity Company Description : Tech recruitment is broken Companies say there is a shortage of talent and it's hard to find good developers, while developers find it hard to find companies that value the skill, experience and passion they bring to the table.Quite...


  • bangalore, India SWAI TECHNOLOGIES PRIVATE LIMITED Full time

    Role : Senior Site reliability Engineer Exp : 5 to 10 Years of experience Remote Opportunity Company Description : Tech recruitment is broken Companies say there is a shortage of talent and it's hard to find good developers, while developers find it hard to find companies that value the skill, experience and passion they bring to the table.Quite the...


  • Bangalore, India Squareroot Consulting Pvt Ltd. Full time

    Job Title : Senior Site Reliability Engineer (SRE)Location : Bangalore (Hybrid)Company Overview :We are Hiring for a dynamic and innovative FinTech company committed to delivering cutting-edge solutions to their clients. As part of our growth strategy, we are seeking a talented and experienced Hands-On Site Reliability Engineer (SRE) to join our...


  • bangalore, India Oracle Full time

    Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health Applications & Infrastructure. This team will focus on product development and product strategy for Oracle Health while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial...


  • bangalore, India Laerdal Bangalore Full time

    As a Senior Site Reliability Engineer, you’ll play a pivotal role in ensuring the reliability and performance of our cloud-based applications and solutions. Collaborating closely with our team, you will cultivate a culture of SRE breaking down silos and managing incidents and problems. Your role will involve developing and implementing innovative solutions...


  • bangalore, India Mimecast Full time

    Site Reliability Engineers - Senior & Principal (Hybrid)   We are recruiting for a number of Site Reliability Engineers to work cross-functionally on the latest cloud infrastructure and platforms to build services providing security for collaboration suites in Bangalore, India.  We’re expanding our global footprint and Bangalore offers a clear...


  • bangalore, India Mimecast Full time

    Site Reliability Engineers - Senior & Principal (Hybrid)   We are recruiting for a number of Site Reliability Engineers to work cross-functionally on the latest cloud infrastructure and platforms to build services providing security for collaboration suites in Bangalore, India.  We’re expanding our global footprint and Bangalore offers a clear...


  • bangalore, India Mimecast Full time

    Senior Devops/Site Reliability Engineer (Cloud and Containerization) – Platform Devops Team   The driving force behind Platform Devops Team at Mimecast Dive into Platform DevOps team to drive efficiency and excellence across our platforms. Our team collaborates with engineering teams to expedite end-to-end delivery lifecycles and streamline workload...


  • Bangalore/Hyderabad, India Nilasu consulting Full time

    Job Title : Senior Site Reliability Engineer (SRE)Department : Cloud EngineeringJob Type : Full-timeJob Description:We are seeking a highly skilled Senior Site Reliability Engineer (SRE) with extensive experience in Cloud Engineering, particularly in AWS. The ideal candidate should have hands-on expertise in developing Cloud solutions using Terraform or...


  • Bangalore/Hyderabad, India Nilasu consulting Full time

    Job Title : Senior Site Reliability Engineer (SRE)Department : Cloud EngineeringJob Type : Full-timeJob Description:We are seeking a highly skilled Senior Site Reliability Engineer (SRE) with extensive experience in Cloud Engineering, particularly in AWS. The ideal candidate should have hands-on expertise in developing Cloud solutions using Terraform or...