Lead Software Engineer, Cloud Site Reliability

2 days ago


Pune, Maharashtra, India Icertis Full time ₹ 20,00,000 - ₹ 25,00,000 per year

Job Description
About CloudOps Team:
CloudOps team is responsible for availability, reliability, performance, monitoring, emergency response, and capacity planning of Icertis SaaS applications and related services. CloudOps executes infra & access provisioning, upgrades, deployments, and change management to drive faster time to market. This team plays a critical role in building and executing the cloud strategy for the company, driving architectural improvements to enhance scalability and optimize overall cost.

Responsibilities
Role Responsibilities
:

  • Lead and execute large-scale site reliability engineering initiatives to improve performance, reliability, and scalability in Azure, AWS, and GCP environments.
  • Implement and manage Azure AKS, AWS ECS/EKS environments, Kubernetes, and Docker-based container management platforms for mission-critical applications.
  • Build automation and operational workflows using cloud-native capabilities for provisioning, scaling, monitoring, and self-healing systems.
  • Collaborate with engineering teams to design and deploy cloud-native, containerized applications with robust CI/CD pipelines.
  • Drive early detection and prevention of incidents through improved telemetry, monitoring, and automated recovery mechanisms.
  • Work closely with cloud providers to optimize offerings and leverage new features for reliability and cost efficiency.
  • Mentor and guide junior engineers in best practices for SRE, Kubernetes management, and automation.

Qualifications
Required Skills:

  • 8–12 years of experience in Cloud Operations / SRE roles in mission-critical, 24x7 SaaS environments.
  • Strong hands-on experience with Azure Kubernetes Service (AKS), AWS ECS/EKS, Kubernetes, Docker, and container lifecycle management.
  • Proficiency in creating infrastructure automation using cloud-native tools, Helm charts, ARM templates, Terraform, or similar IaC frameworks.
  • Strong understanding of cloud compute, storage, networking, and container orchestration concepts.
  • Scripting skills in PowerShell, Python, Bash, or similar languages.
  • Experience with CI/CD pipelines, monitoring solutions (Prometheus, Grafana, Azure Monitor), and log management systems.
  • Proven expertise in cloud operations, SRE/DevOps practices, automation (IaC/CI-CD), observability, and AIOps leveraging AI/ML for predictive monitoring, incident correlation, anomaly detection, and self-healing in cloud-native environments
  • Excellent problem-solving, communication, and collaboration skills.

About Us
Icertis is the global leader in AI-powered contract intelligence. The Icertis platform revolutionizes contract management, equipping customers with powerful insights and automation to grow revenue, control costs, mitigate risk, and ensure compliance - the pillars of business success. Today, more than one third of the Fortune 100 trust Icertis to realize the full intent of millions of commercial agreements in 90+ countries.

About The Team
Who we a re: Icertis is the only contract intelligence platform companies trust to keep them out in front, now and in the future. Our unwavering commitment to contract intelligence is grounded in our FORTE values—Fairness, Openness, Respect, Teamwork and Execution—which guide all our interactions with employees, customers, partners, and stakeholders. Because in our mission to be the contract intelligence platform of the world, we believe how we get there is as important as the destination.

Icertis, Inc. provides Equal Employment Opportunity to all employees and applicants for employment without regard to race, color, religion, gender identity or expression, sex, sexual orientation, national origin, age, disability, genetic information, marital status, amnesty, or status as a covered veteran in accordance with applicable federal, state and local laws. Icertis, Inc. complies with applicable state and local laws governing non-discrimination in employment in every location in which the company has facilities. If you are in need of accommodation or special assistance to navigate our website or to complete your application, please send an e-mail with your request to or get in touch with your recruiter.



  • Pune, Maharashtra, India Boomi Software Full time

    Job Description- The Software Engineering team delivers next-generation application enhancements and new products for a changing world- Working at the cutting edge, we design and develop software for platforms, peripherals, applications and diagnostics all with the most advanced technologies, tools, software engineering methodologies and the collaboration of...


  • Pune, Maharashtra, India Ather Energy Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    You'll be our: Site Reliability EngineerYou'll be based at: Pune Zonal OfficeYou'll be aligned with: Cloud and Data Platform Lead / Cloud ArchitectYou'll be a member of: Cloud and Data Platform TeamAther's fleet of smart scooters is growing rapidly, and so is the volume of data they generate. Our Vehicle Data Platform (VDP) is the core of this ecosystem, and...


  • Pune, Maharashtra, India Boomi Software Full time

    Job DescriptionAs a Principal Site Reliability Engineer, you will be responsible for developing sophisticated systems and software based on the customer s business goals, needs and general business environment. You will work with product management, other engineering teams, customer success and support on developing cutting edge new product features and...


  • Pune, Maharashtra, India Apex One Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Job Overview We are looking for a detail-oriented and experienced Site Reliability Engineer to join our team. The Site Reliability Engineer will be responsible for creating and implementing scalable software solutions in order to meet system and application performance goals. You will also be responsible for troubleshooting system errors and resolving any...


  • Pune, Maharashtra, India NiCE Full time US$ 1,00,000 - US$ 1,50,000 per year

    At NiCE, we don't limit our challenges. We challenge our limits. Always. We're ambitious. We're game changers. And we play to win. We set the highest standards and execute beyond them. And if you're like us, we can offer you the ultimate career opportunity that will light a fire within you.So, what's the role all about?NICE Public Safety has expanded...


  • Pune, Maharashtra, India Equifax Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Site Reliability Engineering (SRE)at Equifax is a discipline that combines software and systems engineering for building and running large-scale, distributed, fault-tolerant systems. SRE ensures that internal and external services meet or exceed reliability and performance expectations while adhering to Equifax engineering principles.SRE is also an...


  • Pune, Maharashtra, India Mastercard Full time US$ 1,25,000 - US$ 1,75,000 per year

    Job Title:Lead Site Reliability Engineer Overview:==Lead Site Reliability Engineer (Storage) – Pune, IndiaOur PurposeWe work to connect and power an inclusive, digital economy that benefits everyone, everywhere by making transactions safe, simple, smart and accessible. Using secure data and networks, partnerships and passion, our innovations and solutions...


  • Pune, Maharashtra, India Reveille Technologies Full time

    Job Summary :We are seeking a skilled and proactive Site Reliability Engineer (SRE) with a strong DevOps mindset and hands-on experience in application troubleshooting. The ideal candidate will be responsible for ensuring the reliability, scalability, and performance of our applications and infrastructure. This role requires a blend of software engineering,...


  • Pune, Maharashtra, India Equifax Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    What You'll DoWork in a DevSecOps environment responsible for the building and running of large-scale, massively distributed, fault-tolerant systems.Work closely with development and operations teams to build highly available, cost effective systems with extremely high uptime metrics.Work with cloud operations team to resolve trouble tickets, develop and run...


  • Pune, Maharashtra, India Barclays Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Step into the role of Senior Site Reliability Engineer - Database Specialist. At Barclays, we are more than a bank we are a force for progress. You will be the part of the central SRE (Site Reliability Engineer) core team within our wider Infrastructure team. You will act as a centre of excellence providing hands on consultancy to our different...