SRE Lead
2 weeks ago
Company Description
Nexthink is the leader in digital employee experience management software. The company provides IT leaders with unprecedented insight allowing them to see, diagnose and fix issues at scale impacting employees anywhere, with any application or network, before employees notice the issue. As the first solution to allow IT to progress from reactive problem solving to proactive optimization, Nexthink enables its more than 1,200 customers to provide better digital experiences to more than 15 million employees. Dual headquartered in Lausanne, Switzerland and Boston, Massachusetts, Nexthink has 9 offices worldwide.
#LI-Hybrid
Job Description
Nexthink is looking for a Lead Site Reliability Engineer who is passionate about building and running a high-performance cloud platform and enabling best-in-class site reliability and operations practices. This role will support Nexthink operations globally. The candidate will drive the development of modern, cloud-native SRE processes and the management and operations for Nexthink's multi-tenant, microservices-based cloud platform. The platform has multiple instances deployed across the globe.
This role involves working closely with cross-functional teams to integrate reliability and security into our systems, ensuring they meet standards. The ideal candidate will have extensive experience in both software engineering and systems administration, with a strong understanding of SRE concepts, requirements and security practices.
Leadership and Team Management:
- Lead, mentor, and develop a team of India-based Site Reliability Engineers.
- Foster a culture of continuous improvement, collaboration, and innovation.
Infrastructure Management:
- Oversee the design, deployment, and management of scalable and secure cloud infrastructure.
- Drive automation of infrastructure provisioning, configuration, and management using Infrastructure as Code (IaC) tools.
Monitoring and Performance:
- Develop and maintain comprehensive monitoring, logging, and alerting systems to ensure high availability and performance.
- Lead efforts in performance tuning and optimization for applications and infrastructure.
Security and Compliance:
- Ensure implementation and maintenance of security controls and best practices to achieve compliance with standards and certifications.
- Conduct and oversee regular security assessments, vulnerability scans, and penetration testing.
- Collaborate with the compliance team to prepare for and respond to audits.
Incident Management:
- Lead incident management efforts, ensuring rapid resolution and thorough root cause analysis.
- Develop and implement strategies for improving incident response and minimizing downtime.
Collaboration and Communication:
- Work closely with development, operations, and security teams to integrate reliability and security into the software development lifecycle.
- Communicate effectively with stakeholders, providing regular updates on system performance, reliability, and compliance status.
Qualifications
- Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience).
- 5 years of experience in site reliability engineering, DevOps, or a related role, with at least 2 years in a leadership position.
- Proficiency in cloud platforms (AWS, Azure, GCP) and cloud-native services.
- Strong scripting and programming skills (Python, Bash, Go, or similar).
- Experience with Infrastructure as Code (IaC) tools such as Terraform, CrossPlane, CloudFormation, or Ansible.
- Knowledge of containerization and orchestration (Docker, Kubernetes).
- Familiarity with CI/CD pipelines and tools (Jenkins, GitLab, GitHub, etc.).
- In-depth knowledge of standards (ISO, SOC2...) requirements and best practices.
- Experience with security tools and practices (SIEM, IDS/IPS, firewalls).
- Understanding of network security, encryption, and secure software development practices.
- Ability to collaborate with and foster effective communication with global and multicultural engineering teams in EU and US timezones.
- Ability to report timely and effectively to the upper engineering management.
#LI-Hybrid
Additional Information
We are the pioneers and trailblazers of a global IT Market Category (DEX) that is shaping the future of how the world works, giving our customers' IT Teams total digital visibility across their enterprise. Our innovative solutions integrate real-time analytics, automation, and employee feedback across all endpoints. This enables our IT teams to solve complex technical challenges, create ever more productive workplaces, and deliver happy, satisfied employees in the digital workplace.
With over 1000 employees across 5 continents, Nexthink operates as One Team, connecting, collaborating and innovating to continuously grow. We call our employees 'Nexthinkers' and our commitment to diversity, inclusion, and equity is second to none. We currently have over 75 nationalities working with us, from all cultures and backgrounds, speaking many different languages.
If you are looking for a change and like a nice atmosphere, lots of challenges, and having fun while working, this is a great opportunity for you Check what we offer:
- Permanent Contract and a competitive compensation package (including stock options).
- Hybrid work model balancing office and remote work, with a structured approach for new hires to foster connections and onboarding.
- Flexible Hours and unlimited vacation (employees have unlimited paid time off on top of the 22 days of holidays we offer) plus 3 company-paid volunteer days.
- Fresh fruit, cookies, and soft drinks as well.
- Regular company and team events like Voluntary Days, Pizza talks, Team Building activities, hosting Meetups at the office and more
- Bonuses for referring successful hires after three months of continuous employment.
Please note that not all the benefits listed above are available for temporary, contract, and internship roles. To ensure you have the most up-to-date information, we recommend checking with your Recruitment Partner.
-
SRE Lead
2 days ago
Bengaluru, Karnataka, India WOW Softech Full time ₹ 20,00,000 - ₹ 25,00,000 per yearAbout the RoleWe are looking for an experienced Site Reliability Engineering (SRE) Lead with strong hands-on experience in Java, Kubernetes, and modern cloud-native infrastructure. You will lead reliability initiatives across our platform, build automation, optimize performance, and work closely with development teams to ensure scalable, highly available,...
-
SRE Lead
2 weeks ago
Bengaluru, Karnataka, India Nexthink Full time ₹ 20,00,000 - ₹ 25,00,000 per yearNexthink is looking for a Lead Site Reliability Engineer who is passionate about building and running a high-performance cloud platform and enabling best-in-class site reliability and operations practices. This role will support Nexthink operations globally. The candidate will drive the development of modern, cloud-native SRE processes and the management and...
-
Chief SRE
2 days ago
Bengaluru, Karnataka, India Credence HR Services Full time ₹ 20,00,000 - ₹ 25,00,000 per yearJob Title:Chief SRE(IC Role)Location:BengaluruYour responsibilities:As a matured Big Thinker, you'll work closely with senior leaders on the strategic development of the SRE practiceCreating, developing, installing and implementing tools required to support the operational management (including security) of software applications and systemsTesting,...
-
SRE Project Lead
1 day ago
Bengaluru, Karnataka, India Persistent Systems Full time ₹ 20,00,000 - ₹ 25,00,000 per yearAbout PositionWe are looking for an experiencedSREProject Leadto join our team. The ideal candidate will lead the initiatives to enhance the reliability, scalability, and performance of our cloud-native infrastructure across AWS and GCP. As a senior technical leader, you will collaborate with cross-functional engineering teams to define infrastructure...
-
SRE Engineer
2 days ago
Bengaluru, Karnataka, India AMERICAN EXPRESS Full time ₹ 12,00,000 - ₹ 36,00,000 per yearDescription - ExternalYou Lead the Way. We've Got Your Back.With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, youll learn and grow as we help...
-
Principal SRE
1 week ago
Bengaluru, Karnataka, India Palo Alto Networks Full time ₹ 20,00,000 - ₹ 25,00,000 per yearWe are looking for a highly motivated Principal DevOps / Senior SRE Engineer to join the Cortex DevOps Production group at our India Development Center (IDC). In this role, you will work side by side with the Cortex Cyber Security Research Group to build, operate, and scale the production environment for our SaaS platform, deployed across tens of thousands...
-
SRE Engineering Manager
4 days ago
Bengaluru, Karnataka, India hackajob Full time ₹ 20,00,000 - ₹ 25,00,000 per yearhackajob*is collaborating withOneAdvanced*to connect them with exceptional tech professionals for this role.Job Description: SRE Engineering Manager - DevOps & ReliabilityOverview: We are seeking a highly skilled and experienced SRE Engineering Manager to lead our Site Reliability Engineering (SRE) and DevOps teams. This leader will play a crucial role in...
-
SRE Cloud Security
2 weeks ago
Bengaluru, Karnataka, India Xebia It Architects Full time ₹ 9,00,000 - ₹ 12,00,000 per yearSRE Cloud Security & ObservabilityLocation: Bangalore (Hybrid 3 days office per week)We are looking for a Cloud Site Reliability Engineer (SRE) with strong expertise in Cloud Security and Observability to design, build, and scale resilient cloud platforms.ResponsibilitiesArchitect and optimize Terraform modules for multi-environment deployments.Drive...
-
Principal SRE
1 week ago
Bengaluru, Karnataka, India Red Hat Full time ₹ 15,00,000 - ₹ 30,00,000 per yearAbout the Job:The IT AI Application Platform team is seeking a Principal Senior Site Reliability Engineer (SRE) to design, develop, scale, and operate our AI Application Platform based on Red Hat technologies, including OpenShift AI (RHOAI) and Red Hat Enterprise Linux AI (RHEL AI). As a Principal SRE you will contribute to running core AI services at scale...
-
Senior SRE
1 week ago
Bengaluru, Karnataka, India Red Hat Full time ₹ 15,00,000 - ₹ 25,00,000 per yearThe IT AI Application Platform team is seeking a Senior Site Reliability Engineer (SRE) to develop, scale, and operate our AI Application Platform based on Red Hat technologies, including OpenShift AI (RHOAI) and Red Hat Enterprise Linux AI (RHEL AI). As an SRE you will contribute to running core AI services at scale by enabling customer self-service, making...