SRE - 3 (Site Reliability Engineer)
1 week ago
Fynd is India's largest omnichannel platform and a multi-platform tech company specializing in retail technology and products in AI, ML, big data, image editing, and the learning space. It provides a unified platform for businesses to seamlessly manage online and offline sales, store operations, inventory, and customer engagement. Serving over 2,300 brands, Fynd is at the forefront of retail technology, transforming customer experiences and business processes across various industries.
We're looking for an SRE 3 – Site Reliability Engineering (SRE) to join our Engineering Team. The Engineering Team forms the backbone of our core business. We build and operate critical systems that ensure the reliability, scalability, and performance of our services across the Fynd ecosystem. This includes infrastructure automation, monitoring, deployment pipelines, and building a culture of reliability throughout engineering. The SRE team works closely with product engineers, DevOps, and platform teams to keep our systems running smoothly and efficiently.
What will you do at Fynd?- Lead, mentor, and grow a team of 2-5 Site Reliability Engineers.
- Define, implement, and advocate SRE best practices like SLAs, SLOs, SLIs, error budgets, and chaos engineering.
- Build and maintain automated CI/CD pipelines and infrastructure using tools like Terraform, Jenkins, or GitHub Actions.
- Own the observability stack—monitoring, alerting, logging, and tracing across microservices and platforms.
- Improve reliability and scalability of services by proactively identifying bottlenecks and automating manual ops tasks.
- Drive incident response practices including on-call rotations, runbooks, and blameless postmortems.
- Ensure high availability and uptime across distributed systems hosted on AWS.
- Collaborate with cross-functional teams to ensure the architecture is cloud-native, secure, and fault-tolerant.
- Implement and optimize systems for cost-efficiency, auto-scaling, and performance.
- Contribute to open source or write technical blogs to share insights and practices with the broader tech community.
- This is a startup, so expect rapid changes and plenty of opportunities to take initiative and drive new initiatives.
- At least 5+ years of experience leading SRE/DevOps/Infrastructure teams, with 5+ years overall in backend, systems, or infrastructure roles.
- Strong experience managing distributed systems and microservices at scale.
- Good understanding of Linux, Networking, Load Balancing, and Security concepts.
- Hands-on experience with AWS services like EC2, ELB, AutoScaling, CloudFront, S3, CloudWatch.
- Experience with container technologies and orchestration—Docker and Kubernetes is a must.
- Strong proficiency with Infrastructure-as-Code tools like Terraform, CloudFormation, or Pulumi.
- Familiarity with observability tools like Prometheus, Grafana, ELK, or Datadog.
- Programming/scripting skills in Python, Go, Bash or similar for automation and tooling.
- Understanding of message queues and event-driven architectures using Kafka or RabbitMQ.
- Ability to manage incidents, write detailed postmortems, and improve reliability across teams and services.
- Comfortable working in a fast-paced environment with a strong culture of ownership and continuous improvement.
What do we offer?
Growth
Growth knows no bounds, as we foster an environment that encourages creativity, embraces challenges, and cultivates a culture of continuous expansion. We are looking at new product lines, international markets and brilliant people to grow even further. We teach, groom and nurture our people to become leaders. You get to grow with a company that is growing exponentially.
Flex University
We help you upskill by organising in-house courses on important subjects
Learning Wallet: You can also do an external course to upskill and grow, we reimburse it for you.
Culture
Community and Team building activities
Host weekly, quarterly and annual events/parties.
Wellness
Mediclaim policy for you + parents + spouse + kids
Experienced therapist for better mental health, improve productivity & work-life balance
We work from the office 5 days a week to promote collaboration and teamwork. Join us to make an impact in an engaging, in-person environment
-
Site Reliability Engineer L2
2 weeks ago
Mumbai, Maharashtra, India APTO SOLUTIONS - EXECUTIVE SEARCH & CONSULTANTS Full time ₹ 6,00,000 - ₹ 18,00,000 per year#Hiring Alert – Site Reliability Engineer L2 (SRE) Location: Mumbai - contractualExperience - 5+ YearsNotice - Immediate Joiners Apply Now: Skills & Experience:5+ years of proven tech experience.Hands-on in Data Center Operations (DCOps) – Linux installation, configuration & troubleshooting.Strong experience in Java, container technologies...
-
Sre
2 weeks ago
Mumbai, Maharashtra, India Weekday AI Full time ₹ 8,00,000 - ₹ 24,00,000 per yearThis role is for one of the Weekday's clientsMin Experience: 4 yearsLocation: IndiaJobType: full-timeWe are looking for a dedicated Site Reliability Engineer (SRE) to join our technology team. This position focuses on maintaining the reliability, availability, and performance of systems used across the organization. The role requires someone who can work...
-
Site Reliability Engineering
3 days ago
Navi Mumbai, Maharashtra, India Koantek Full timeAbout the Role:We are seeking a highly skilled and experienced SREDatabricks Platform Administrator to join our DataOperations Team. In this critical role, you will be responsible for the availability, performance,Reliability and scalability of our enterprise Databricks platform. You will blend deep expertise in Databricks administration with SRE principles...
-
Site Reliability Engineering Manager
1 day ago
Mumbai, Maharashtra, India equentis Full timeJob Title: Site Reliability Engineer (SRE)Company – Equentis Wealth Advisory LimitedLocation – Lower Parel, MumbaiJob Summary: We are seeking a talented Site Reliability Engineer (SRE) to join our team andplay a critical role in ensuring the reliability, scalability, and performance of our systems andapplications. The ideal candidate will have a strong...
-
Site Reliability Engineer
6 days ago
Mumbai, Maharashtra, India, Maharashtra Insight Global Full timeSite Reliability EngineerLocation: Mumbai, India - working onsite 1x a week Salary: 22-25 LPATarget Start Date: January 2026Join our dynamic and highly collaborative agile team, where you'll play a pivotal role in ensuring the reliability, scalability, and efficiency of our premier InsurTech solution. Our platform enables clients to obtain quotes and issue...
-
Senior Site Reliability Engineering II
1 week ago
Mumbai, Maharashtra, India RELX Full time ₹ 8,00,000 - ₹ 24,00,000 per yearJob DescriptionWould you like to be part of a team that delivers high-quality software to our customers?Are you a visible champion with a 'can do' attitude and enthusiasm that inspires others?About The BusinessLexisNexis Risk Solutions is the essential partner in the assessment of risk. Within our Business Services vertical, we offer a multitude of solutions...
-
Site Reliability Engineer
1 week ago
Mumbai, Maharashtra, India Hirexa Solutions Full time ₹ 4,00,000 - ₹ 12,00,000 per yearHI All, We are hiring for Site Reliability Engineer with one of our product-based client - Permanent hiring Skills: Should Have At least 7+ years of Experience on AWSShould have Good Hands-On Experience on Below skillsObservability/Monitoring*Python*Bash/Shell ScriptTerraform*Automation*Account PipelineService NowGitlabJira Exp: 7 to 14 Yrs CTC: Exp*2.5...
-
Senior Specialist
2 weeks ago
Mumbai, Maharashtra, India Datavail Career Site Full time ₹ 60,00,000 - ₹ 1,80,00,000 per yearJob Title: Senior Specialist – Cloud SRE Education: Bachelor's Degree Experience: 8+ years Location: Mumbai As a Senior SRE Engineer (Cloud SRE Specialist), you will be responsible for ensuring the reliability, scalability, performance, and cost optimization of cloud services across AWS, Azure, and multi-cloud environments. You will act as the primary...
-
Site Reliability Engineer
1 week ago
Mumbai, Maharashtra, India BNP Paribas Full time ₹ 6,00,000 - ₹ 18,00,000 per yearPosition Purpose The main responsibility of Stability & Resilience division is to support the IT strategy & Production and gathers activities contributing directly to the stability and integrity of the Production and to the Information Systems resilience.Within the division, the domain Control Tower Process & Framework oversees the Global Production...
-
Senior Site Reliability Engineering II
1 week ago
Mumbai, Maharashtra, India LexisNexis Risk Solutions Full time ₹ 6,00,000 - ₹ 18,00,000 per yearJob DescriptionWould you like to be part of a team that delivers high-quality software to our customers?Are you a visible champion with a 'can do' attitude and enthusiasm that inspires others?About the BusinessLexisNexis Risk Solutions is the essential partner in the assessment of risk. Within our Business Services vertical, we offer a multitude of solutions...