Senior Site Reliability Engineer

2 days ago

Mumbai, Maharashtra, India Pivotree Full time ₹ 10,00,000 - ₹ 25,00,000 per year

Introduction
Our goal at Pivotree is to help accelerate the future of frictionless commerce. We will help lead this change over the next decade because we believe a future where technology is embedded intimately into all aspects of our everyday lives can benefit everyone and will shape the interactions with the brands we love. We will help shape the future of frictionless commerce by working together with some of the best brands in the world and some of the best people in the industry to leverage converging technologies that will make it possible to accelerate frictionless commerce faster than ever.

Pivotree provides services focused on the design, implementation, management, and maintenance of complex ecommerce solutions for large enterprises. We provide the technical skills necessary to enable the effective use of technologies combined with the business context to leverage a solution to solve our clients' business challenges. We strive to fill the gaps in available technology with our own IP to reduce the barriers to adoption.

We enable inclusive, immersive and highly personalized experiences for our clients and their customers. We build our products with a view to productizing and scaling technology to lower the costs and reduce the risks of implementing and managing our integrated solutions. Each of our solutions starts with reliable and reputable e-commerce and MDM platforms, which run on enterprise grade infrastructure that are customized to meet a variety of client needs, situations, and budgets. Over the next 10 years we will add new categories and capabilities that will define frictionless commerce ecosystems.

This is a journey of technology acceleration combined with consumer readiness and adoption. We are looking for people capable of adapting relentlessly to the rapidly evolving world around us.

Position Summary
We are currently seeking a Senior Site Reliability Engineer (SRE) to join our team. In

this role you will contribute to the reliability and enhancement of the technology

engine that powers multiple Pivotree solutions. The primary function of this role is the

direct responsibility for the availability of platform solutions, focusing on several key

areas, including availability, performance, change management, monitoring and

emergency response. You will work with other members of the platform, solutions,

operations, and application teams to understand and ultimately address changing

and evolving requirements through extending and exposing capabilities in a simple

and consistent fashion. You will be a member of a team who maintains expertise

with Utility Computing services and will advise management and the organization as

a whole on this mode of computing.

You will

Be responsible for the availability, performance, and reliability of platform

services.

Design and manage infrastructure in AWS, especially across multi-account

AWS Organization setups.

Lead efforts to automate cloud resource provisioning using IaC tools such as

Terraform and AWS CloudFormation.

Contribute to ensuring pooled and independent utility services are highly

available

Actively take part and initiate continuous improvement: measure and reduce

manual tasks and overhead

Be a subject matter expert for Utility Computing providers and respective

services both existing and emerging - with particular focus on AWS

Complete systems development, administration, and engineering tasks

including integration, documentation and testing

Develop and maintain tools, processes, and workflows for automated

infrastructure resource(s) and application deployment, configuration

management & maintenance

Own the responsibility for platform management, supporting services, and all

related tooling and automation

Design, implement, and maintain monitoring, alerting, and observability

solutions to ensure system reliability, performance, and timely incident

detection.

Investigate and troubleshoot relevant platform-based issues and incidents,

(high availability, performance, security, etc.)

Collaborate across distributed teams and act as a technical liaison for various

stakeholders.

Participate in change management, release automation, compliance, and

audit-readiness practices.

Participate in recurring stand-ups with other team members located in

different locations and time zones

Participate in on-call rotation, escalations, and shift work
Work with other team members to improve processes and advance relevant

and related competencies

Provide technical mentorship, helping teammates grow through knowledge

sharing and peer coaching.

You are

Experienced in production-grade AWS Cloud environments with a focus on

scalability, security, and governance.

Well-versed in Linux (RHEL/Debian) environments and proficient in system

administration.

Adept at supporting modern development workflows, Agile teams, and CI/CD

tooling

A strong communicator, with the ability to interface with technical and non-

technical stakeholders across geographies.

Motivated to mentor others and foster a collaborative engineering culture.
Capable of operating independently and making strategic infrastructure

decisions.

Passionate about reliability engineering and operational excellence.
Experienced at working on large projects with deadlines
Committed to high quality and attention to detail
Focused and committed to delivering high quality services
A strategic thinker who is able to link business and technical objectives
Someone that can go wide and deep, who works with several disparate

systems and services and ultimately acquires expert knowledge and who can

navigate accordingly

You have (MUST HAVE)

5+ years of experience in Site Reliability Engineering, Cloud Engineering, or

DevOps roles.

Minimum one Associate-level Amazon AWS certification.
3+ years mature, production level experience with infrastructure-as-code

concepts and practices using Terraform, AWS CloudFormation or similar.

3+ years of hands-on experience managing Kubernetes clusters (EKS

preferred), including container orchestration and troubleshooting.

Strong knowledge and practical experience in Linux systems administration

(RHEL and Debian-based distros), networking, storage, and virtualization in

production-grade environments.

Experience working with API-driven and/or Event-driven architectures at scale

in AWS environments.

Demonstrated ability to manage and troubleshoot web applications,

middleware, and databases in real-world deployments.

Expertise with observability stacks and performance monitoring using tools

like Grafana, Prometheus, CloudWatch, and Loki or similar.

Advanced scripting proficiency in Python, Bash, and basic PowerShell.
Solid experience with CI/CD pipelines, version control, and automated testing

using Git, Bitbucket, GitHub, Jenkins, or similar.

Proven track record in implementing security and compliance controls,

particularly in regulated environments (SOC 2, PCI-DSS, or ISO

Strong understanding of systems security, including identity, permissions,

network policies, and audit tooling.

Exceptional troubleshooting skills, attention to detail, and a strong drive for

continuous improvement.

Excellent communication skills with the ability to clearly articulate complex

concepts to both technical and non-technical audiences, and collaborate

effectively across distributed teams.

Demonstrated ability to mentor junior engineers, share knowledge, and act as

a regional technical leader.

Ability to work both independently and collaboratively, learn new technologies

quickly, and help set standards and best practices.

Nice to Have

Experience and/or exposure to the Serverless Framework
Experience with APM tools such as AppDynamics, NewRelic, Grafana or

Dynatrace, Amazon X-Ray

Experience with the following Amazon AWS services in a production

environment (API Gateway, Cognito, RDS, DynamoDB, ECS, EMR, Lambda)

AWS Certified Developer
AWS Certified SysOps Administrator
AWS Certified Solution Architect

Pivotree is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive and accessible workplace.

Site Reliability Engineer

1 week ago

Mumbai, Maharashtra, India Oracle Financial Services Software Ltd Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Senior Site Reliability Developer OCI is Oracle's next-generation cloud platform, built for the most demanding enterprise workloads. We deliver high-performance computing, storage, networking, and platform services at global scale. The AI Platform, Services & Solutions organization within OCI is building the foundation for enterprise AI—spanning GPU...
Senior Lead Site Reliability Engineer

2 days ago

Mumbai, Maharashtra, India JPMorganChase Full time US$ 1,20,000 - US$ 2,00,000 per year

DescriptionGuide and shape the future of technology at a globally recognized firm, driven by pride in ownership.As a Senior Manager of Site Reliability Engineering at JPMorgan Chase within the Finance technology team which is aligned to Corporate Technology Division, you are the non-functional requirement owner and champion for the applications in your...
Site Reliability Engineer

2 weeks ago

Mumbai, Maharashtra, India Oracle Financial Services Software Ltd Full time ₹ 20,00,000 - ₹ 25,00,000 per year

Site Reliability Developer 3 Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services. Design and develop designs, architectures, standards, and methods for large-scale...
Sr. Site Reliability Engineer

4 days ago

Mumbai, Maharashtra, India ETP Group Full time ₹ 1,04,000 - ₹ 1,30,878 per year

Experience Required7-10LocationMumbaiRole TypeFull timeJob Title: Senior Site Reliability Engineer (SRE) – MACH SaaS PlatformKey ResponsibilitiesEnsure uptime SLAs and overall reliability of production, staging, and test environments.Continuously assess all platform components for correct configuration — including instance sizes, memory allocation,...
Site Reliability Engineer

2 weeks ago

Mumbai, Maharashtra, India Hirexa Solutions Full time ₹ 4,00,000 - ₹ 12,00,000 per year

HI All, We are hiring for Site Reliability Engineer with one of our product-based client - Permanent hiring Skills: Should Have At least 7+ years of Experience on AWSShould have Good Hands-On Experience on Below skillsObservability/Monitoring*Python*Bash/Shell ScriptTerraform*Automation*Account PipelineService NowGitlabJira Exp: 7 to 14 Yrs CTC: Exp*2.5...
Site Reliability Engineer

1 week ago

Mumbai, Maharashtra, India Talent Leads HR Solutions Pvt Ltd Full time ₹ 20,00,000 - ₹ 25,00,000 per year

Skill, Knowledge &Trainings : - Site Reliability Engineer will be responsible to develop and implement services that improve Software development Life Cycle. - Build automations which will help optimize software delivery. - Improve reliability, quality, and time-to-market of our suite of software solutions. - Will be responsible for availability,...
CSB/Site Reliability Architect

4 hours ago

Navi Mumbai, Maharashtra, India Acura Solution Full time ₹ 4,00,00,000 - ₹ 8,00,00,000 per year

Job Description: Designation: Site Reliability ArchitectLocation: Turbhe Office, MumbaiCTC: as per company normsThe Site Reliability Architect is a key leadership role, responsible for designing and implementing the architectural vision for our production systems, with a primary focus on reliability, scalability, and performance. This individual will work...
Site Reliability Engineer

6 days ago

Mumbai, Maharashtra, India Aanseacore Full time ₹ 12,00,000 - ₹ 24,00,000 per year

We are seeking experienced Site Reliability Engineers (SREs) and CDN Specialists with deep expertise in global performance optimization, cloud infrastructure reliability, and edge computing. The ideal candidate will have a strong technical foundation in network performance engineering, Azure cloud operations, and CDN/edge delivery systems, ensuring...
Site Reliability Engineer

2 days ago

Mumbai, Maharashtra, India Avant-Garde Corporate Services Private Limited Full time ₹ 15,00,000 - ₹ 25,00,000 per year

We are seeking a skilled and proactive Site Reliability Engineer (SRE) to join the IT Transformation team.The role involves driving automation, reliability, and performance optimization across mission-critical applications and infrastructure within a financial market ecosystem.The successful candidate will manage end-to-end deployment automation, CI/CD...
Senior Site Reliability Engineer- Cloud Platform

2 weeks ago

Mumbai, Maharashtra, India Baker Hughes Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Are you an Engineer looking for an interesting and inspiring opportunity?Are you passionate about being part of a successful team?Join the TeamBaker Hughes has a new opportunity for Senior Site Reliability Engineer to join the team in IndiaPartner with the bestAs a Senior Site Reliability Engineer, you'll be responsible for building and supporting Cloud...

Americas

Europe

Asia / Oceania

Africa

Senior Site Reliability Engineer