
Site Reliability Engineer/Lead
6 hours ago
Key Responsibilities :
- Own the availability, scalability, and performance of production systems and services.
- Design and manage distributed systems and microservices architectures at scale.
- Develop and implement incident response strategies, root cause analysis, and create actionable postmortems.
- Drive improvements in infrastructure automation, CI/CD pipelines, and deployment strategies.
- Collaborate with cross-functional teams including engineering, product, and QA to embed SRE best practices.
- Implement observability tools (e.g., Prometheus, Grafana, ELK, Datadog) to monitor system performance and proactively detect issues.
- Manage and optimize cloud infrastructure on AWS, including services such as EC2, ELB,
AutoScaling, S3, CloudFront, and CloudWatch.
- Utilize Infrastructure-as-Code tools such as Terraform, CloudFormation, or Pulumi for provisioning and maintaining infrastructure.
- Apply strong Linux, networking, load balancing, and security principles to ensure platform
resilience.
- Leverage Docker and Kubernetes for container orchestration and scalable deployments.
- Build internal tools and automation using Python, Go, or Bash scripting.
- Support event-driven architectures leveraging Kafka or RabbitMQ for high-throughput, real-time systems.
- Proactively contribute to reliability-focused architecture and design discussions.
Required Skills & Experience :
years of overall experience in backend engineering, infrastructure, DevOps, or SRE roles.- Minimum 3 years of experience leading SRE, DevOps, or Infrastructure teams.
- Proven track record managing distributed systems and microservices at scale.
- Deep understanding of Linux systems, networking fundamentals, load balancing, and infrastructure security.
- Strong hands-on experience with AWS services : EC2, ELB, AutoScaling, CloudFront, S3, and CloudWatch.
- Expert-level knowledge of Docker and Kubernetes in production environments.
- Proficient with Infrastructure-as-Code tools : Terraform, CloudFormation, or Pulumi.
- Hands-on experience with monitoring and observability tools : Prometheus, Grafana, ELK
Stack, or Datadog.
- Strong scripting or programming skills in Python, Go, Bash, or similar languages.
- Familiarity with Kafka or RabbitMQ for event-driven and messaging architectures.
- Excellent incident management skills, including triage, RCA, and communication.
- Ability to thrive in fast-paced environments and adapt to changing priorities.
Preferred Qualifications :
- Bachelors degree in Computer Science, Engineering, or a related field.- Experience in startup or high-growth environments.
- Contributions to open-source DevOps or SRE tools are a plus.
- Certifications in AWS, Kubernetes, or other cloud-native technologies are advantageous.
-
VP – Site Reliability Engineering
1 week ago
Mumbai, Maharashtra, India Natobotics Full timeJob DescriptionWere on an exciting journey with our client and we want you to join us. With our client, you will beexposed to the latest technologies and work with some of the brightest minds in the industry.Our client is leading Banking company so you will be playing a key role as a VP Site Reliability Engineering (SRE), who can assist with the below:Roles...
-
Senior Site Reliability Engineer
2 weeks ago
Mumbai, Maharashtra, India beBeeSiteReliability Full time US$ 1,00,000 - US$ 1,50,000Unlock Your Potential as a Senior Site Reliability EngineerWe are seeking a highly skilled and motivated Senior Site Reliability Engineer to join our team. As a key member of our Information Systems (IS) team, you will play a critical role in ensuring the smooth operation of our production services, supporting over 60 million Ubuntu users.The ideal candidate...
-
Site Reliability Engineer 2
1 week ago
Navi Mumbai, Maharashtra, India Uplers Full time ₹ 8,00,000 - ₹ 25,00,000 per yearExperience: 4+ yearsSalary: ConfidentialShift: (GMT+05:30) Asia/Kolkata (IST)Opportunity Type: Office (Mumbai)Placement Type: Full time Permanent Position(*Note: This is a requirement for one of Uplers' client--Gofynd)What do you need for this opportunity?Must have skills required: and AWS/Google Cloud and MongoDB/CI/CD/GrafanaJob descriptionFynd is Indias...
-
Site Reliability Engineer
2 weeks ago
Mumbai, Maharashtra, India Deqode Full timeProfile : Site Reliability Engineer (SRE)Experience Required : 6+ YearsLocations : Mumbai, Gurgaon, ChennaiWork Arrangement : HybridKey Responsibilities :- Design and implement scalable, resilient cloud-native infrastructure across AWS/Azure/GCP platforms- Own the SRE function including availability, latency, performance monitoring, emergency response,...
-
Site Reliability Engineer
2 hours ago
Mumbai, Maharashtra, India Search Synergy Pvt Ltd Full time ₹ 6,00,000 - ₹ 18,00,000 per yearNote - Location - Dadar/Kurla (Mumbai)Skill, Knowledge &Trainings : - Own and manage the CI/CD pipelines for automated build, test, and deployment. - Design and implement robust deployment strategies for microservices and web applications. - Set up and maintain monitoring, alerting, and logging frameworks (e.g., Prometheus, Grafana, ELK) - Build...
-
Lead Site Reliability Engineer
6 hours ago
Mumbai, Maharashtra, India Neemtree Tech Hiring Full time ₹ 12,00,000 - ₹ 36,00,000 per yearResponsibilities : - Team Leadership : Manage and mentor a team of SREs, assigning tasks, providing technical guidance, and fostering a culture of collaboration and continuous learning. - Design and Implement Monitoring and Alerting : Lead the implementation of reliable, scalable, and fault-tolerant systems, including infrastructure, monitoring, and...
-
Sr. Site Reliability Engineer
2 days ago
Mumbai, Maharashtra, India ETP Group Full time ₹ 1,04,000 - ₹ 1,30,878 per yearExperience Required7-10LocationMumbaiRole TypeFull timeJob Title: Senior Site Reliability Engineer (SRE) – MACH SaaS PlatformKey ResponsibilitiesEnsure uptime SLAs and overall reliability of production, staging, and test environments.Continuously assess all platform components for correct configuration — including instance sizes, memory allocation,...
-
Mumbai, Maharashtra, India beBeeInfrastructure Full time ₹ 5,00,000 - ₹ 8,00,000Job Title: Site Reliability EngineerThis is an exceptional opportunity to join a global team of skilled professionals as a Site Reliability Engineer. In this role, you will be responsible for ensuring the reliability and performance of our cloud-based services.Job DescriptionWe are seeking a highly skilled engineer with experience in IT operations...
-
Senior Site Reliability Engineer
14 minutes ago
Mumbai, Maharashtra, India Pivotree Full time ₹ 10,00,000 - ₹ 25,00,000 per yearIntroductionOur goal at Pivotree is to help accelerate the future of frictionless commerce. We will help lead this change over the next decade because we believe a future where technology is embedded intimately into all aspects of our everyday lives can benefit everyone and will shape the interactions with the brands we love. We will help shape the future of...
-
Site Reliability Engineer II
2 days ago
Mumbai, Maharashtra, India JPMorganChase Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJOB DESCRIPTIONPlay a key role in ensuring system reliability at one of the world's most iconic and largest financial institutions.As a Site Reliability Engineer II at JPMorgan Chase within the Client Onboarding team which is aligned to Corporate Technology division, you will use technology to solve business problems and leverage software engineering best...