Lead Site Reliability Engineer

4 weeks ago


Delhi, India HCLSoftware Full time

About the job
Greetings from “HCL Software” Is a Product Development Division of HCL Tech

HCL Software Is a Product Development Division of HCL Tech: That operates its primary software business. At HCL Software we develop, market, sell and support over 20 product families in the areas of Customer Experience, Digital Solutions, Secure DevOps, Security & Automation. About BigFix Product: -
HCL BigFix is the only Endpoint Management Platform that enables IT Operations and Security teams to fully automate discovery, management, and remediation – whether its on-premises, virtual, or cloud – regardless of operating system, location, or connectivity. BigFix can find and fix endpoints faster than any other solution.
Our strengths come from our solid background in SW development practices like Agile methodologies and Design Thinking. We are focused on innovation and new technologies, and we continue to grow year on year.

Job Description: What we are looking for: -
HCL BigFix is looking for a ‘Site Reliability Engineer‘ to work on infrastructure for a new product that will help keep our customers’ end points secure. You will be a part of a team. that leverages modern technological solutions to drive growth and efficiency. Your daily responsibilities will be centered on HCL BigFix’s cloud infrastructure, with daily tasks related. to improve scalability, reliability, and observability.
The ideal candidate will have a strong background in software engineering and systems. administration, with a proficiency in modern infrastructure tools (e.g., Kubernetes, Docker, AWS/GCP/Azure), with a passion for designing, implementing, and maintaining reliable and scalable systems. On-call duties are involved in this role.
What You Bring: -
BS in Computer Science or related technical field or proof of exceptional skills in related fields with practical software engineering experience.
Expert knowledge of cloud operating system internals, filesystems, disk/storage technologies, and storage protocols, and networking stack.
3+ years of managing services in Distributed Systems.
3+ years of experience with common containerization tools, such as Kubernetes or Docker.
Expert knowledge of at least one higher-level language such as Python or Go.
Experience leading troubleshooting and full-cycle incident response, including mitigation, correction, and prevention.
Expert knowledge of CI/CD tools, Jenkins, or GitHub Actions.
What You Do:
Collaborate with development and operations teams to design, implement,
and maintain scalable and reliable infrastructure solutions.
Implement and manage monitoring, alerting, and logging systems to ensure.
proactive identification and resolution of issues.
Work on the automation of infrastructure provisioning.
Perform regular system and application performance analysis, tuning, and capacity planning.
Ensure cost efficiency and efficacy of complex, multi-cloud products and tackle.
Ongoing cost minimization efforts.
Ensure the availability of new and existing developer tools.
Drive the migration of large-scale, distributed diagnostics applications. towards cloud-native microservices.
Analyse and plan for capacity management and lead infrastructure change.
Management for cloud-based services.
Work with SWE counterparts to identify and mitigate production issues.
Document and implement failover/disaster recovery plans.
Participate in code reviews and contribute to technical architecture documents.
Participate in team on-call rotations.



  • delhi, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes , send me your updated cv : Please...


  • Delhi, India IDFC FIRST Bank Full time

    Role/Job Title:Site Reliability Engineering LeadFunction/Department:Information TechnologyJob Purpose:Site Reliability Engineering (SRE) department plays a pivotal role in providing seamless experience for our customers. With state-of-the-art technology and tools, we are transforming the overall application development and maintenance lifecycle. If you hate...


  • delhi, India Cricbuzz.com Full time

    Site Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 3 - 5 yearsResponsibilities:● Design,...


  • Delhi, India Quiktrak, LLC Full time

    Job Title: Azure Site Reliability Engineer (SRE) / DevOps EngineerJob Description:Summary:As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous...


  • delhi, India WaferWire Cloud Technologies Full time

    Role: SRE (Site Reliability Engineer)Experience: 4+ YearsAbout WaferWire Cloud Technologies:WaferWire Cloud Technologies is a leading provider of innovative cloud solutions aimed at transforming businesses and driving digital growth. With a focus on cutting-edge technology and customer-centric approaches, we empower organizations to thrive in the digital...


  • Delhi, India ViewSonic Full time

    Job Requirements:Bachelor’s degree in computer science, Engineering, or a related field.3+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role.Proficient in AWS solutions including but not limited to EC2, S3, CloudWatch, Lambda, and RDS.Strong understanding of Platform Engineering concepts and principles.Experience with...


  • delhi, India Korn Ferry Full time

    Role - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM ISTWe are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...


  • Delhi, India Daxko Full time

    Company DescriptionDaxko powers health & wellness throughout the world. Every day our team members focus their passion and expertise in helping health & wellness facilities operate efficiently and engage their members.Whether a neighborhood yoga studio, a national franchise with locations in every city, a YMCA or JCC--and every type of organization in...


  • Delhi, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners8 to 9 years for Hyderabad Locationfor a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience inSRE, GCP and Kubernetes , send me your updated cv : find below the...


  • Delhi, India SLK Full time

    **Immediate Joiners only **We are hiring an Senior Engineer with expertise in Observability andSite Reliability Engineer (SRE) ,emphasizing strong Experience with Prometheus, Grafana, ELK Stack, Jaeger, and OpenTelemetry.. (3+ years of experience)If you meet these qualifications and are passionate about driving innovative solutions, we encourage you to...


  • delhi, India SLK Full time

    **Immediate Joiners only **We are hiring an Senior Engineer with expertise in Observability and Site Reliability Engineer (SRE) , emphasizing strong Experience with Prometheus, Grafana, ELK Stack, Jaeger, and OpenTelemetry.. (3+ years of experience)If you meet these qualifications and are passionate about driving innovative solutions, we encourage you to...


  • Delhi, India System Soft Technologies Full time

    Title: Site Reliability Engineer 100% REMOTE The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • delhi, India System Soft Technologies Full time

    Title: Site Reliability Engineer100% REMOTEThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • delhi, India CloudBees Full time

    J ob Title - Manager, Site Reliability EngineerLocation - Bangalore and ChennaiYear of Experience - 10+ YearsAbout CloudBeesCloudBees is the leading software delivery platform that enables enterprises to deliver scalable, compliant, and secure software, empowering developers to do their best work.Seamlessly integrating into any hybrid and heterogeneous...


  • Delhi, India Career Stone Consultant Full time

    PRINCIPAL ACCOUNTABILITIES:1.AWS Infrastructure Design:o Lead the design and implementation of scalable, reliable, and secure AWS infrastructure.o Provide expertise in architecting solutions that maximize the benefits of AWS services.o Lead the upgrade of Apache web servers for improved performance and security.o Oversee the database (DB) upgrade process,...


  • new delhi, India dentsu Full time

    The purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...


  • Delhi, India Akamai Full time

    Do you like collaborating across teams to solve complex problems?Do you enjoy solving large scale distributed content delivery challenges?Join our highly skilled Site Reliability teamOur team designs, develops, and manages applications and infrastructure that support Akamai's Compute products and services. We do this while maintaining Akamai's mission at the...


  • delhi, India Infogain Full time

    You can send your applications on This Job is available at multiply locations in India like Mumbai, Pune, Bangalore, Noida & Gurgaon.Title:"SRE developers responsible for Design and implementation details reviewed/approved by SRE / Reliability Engineer (Lead): A SRE/Reliability Engineer at a Lead level is responsible for maintaining the reliability,...


  • Delhi, India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • Delhi, India Next-Link Full time

    Job DescriptionSenior Site Reliability EngineerDesirable Skills:Experience with additional programming languages and technologies beyond Python and Ruby.Familiarity with cloud platforms such as AWS, Azure, or GCP.Proficiency in additional logging and monitoring tools.Experience with other Infrastructure as Code (IaC) tools and practices.Knowledge of...