SRE - 3 (Site Reliability Engineer)

1 week ago


Mumbai, Maharashtra, India Fynd Full time ₹ 8,00,000 - ₹ 24,00,000 per year

Fynd is India's largest omnichannel platform and a multi-platform tech company specializing in retail technology and products in AI, ML, big data, image editing, and the learning space. It provides a unified platform for businesses to seamlessly manage online and offline sales, store operations, inventory, and customer engagement. Serving over 2,300 brands, Fynd is at the forefront of retail technology, transforming customer experiences and business processes across various industries.

We're looking for an SRE 3 – Site Reliability Engineering (SRE) to join our Engineering Team. The Engineering Team forms the backbone of our core business. We build and operate critical systems that ensure the reliability, scalability, and performance of our services across the Fynd ecosystem. This includes infrastructure automation, monitoring, deployment pipelines, and building a culture of reliability throughout engineering. The SRE team works closely with product engineers, DevOps, and platform teams to keep our systems running smoothly and efficiently.

What will you do at Fynd?
  • Lead, mentor, and grow a team of 2-5 Site Reliability Engineers.
  • Define, implement, and advocate SRE best practices like SLAs, SLOs, SLIs, error budgets, and chaos engineering.
  • Build and maintain automated CI/CD pipelines and infrastructure using tools like Terraform, Jenkins, or GitHub Actions.
  • Own the observability stack—monitoring, alerting, logging, and tracing across microservices and platforms.
  • Improve reliability and scalability of services by proactively identifying bottlenecks and automating manual ops tasks.
  • Drive incident response practices including on-call rotations, runbooks, and blameless postmortems.
  • Ensure high availability and uptime across distributed systems hosted on AWS.
  • Collaborate with cross-functional teams to ensure the architecture is cloud-native, secure, and fault-tolerant.
  • Implement and optimize systems for cost-efficiency, auto-scaling, and performance.
  • Contribute to open source or write technical blogs to share insights and practices with the broader tech community.
  • This is a startup, so expect rapid changes and plenty of opportunities to take initiative and drive new initiatives.
Some Specific Requirements
  • At least 5+ years of experience leading SRE/DevOps/Infrastructure teams, with 5+ years overall in backend, systems, or infrastructure roles.
  • Strong experience managing distributed systems and microservices at scale.
  • Good understanding of Linux, Networking, Load Balancing, and Security concepts.
  • Hands-on experience with AWS services like EC2, ELB, AutoScaling, CloudFront, S3, CloudWatch.
  • Experience with container technologies and orchestration—Docker and Kubernetes is a must.
  • Strong proficiency with Infrastructure-as-Code tools like Terraform, CloudFormation, or Pulumi.
  • Familiarity with observability tools like Prometheus, Grafana, ELK, or Datadog.
  • Programming/scripting skills in Python, Go, Bash or similar for automation and tooling.
  • Understanding of message queues and event-driven architectures using Kafka or RabbitMQ.
  • Ability to manage incidents, write detailed postmortems, and improve reliability across teams and services.
  • Comfortable working in a fast-paced environment with a strong culture of ownership and continuous improvement.

What do we offer?

Growth

Growth knows no bounds, as we foster an environment that encourages creativity, embraces challenges, and cultivates a culture of continuous expansion. We are looking at new product lines, international markets and brilliant people to grow even further. We teach, groom and nurture our people to become leaders. You get to grow with a company that is growing exponentially.

Flex University

We help you upskill by organising in-house courses on important subjects

Learning Wallet: You can also do an external course to upskill and grow, we reimburse it for you.

Culture

Community and Team building activities

Host weekly, quarterly and annual events/parties.

Wellness

Mediclaim policy for you + parents + spouse + kids

Experienced therapist for better mental health, improve productivity & work-life balance

We work from the office 5 days a week to promote collaboration and teamwork. Join us to make an impact in an engaging, in-person environment



  • Mumbai, Maharashtra, India APTO SOLUTIONS - EXECUTIVE SEARCH & CONSULTANTS Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    #Hiring Alert – Site Reliability Engineer L2 (SRE) Location: Mumbai - contractualExperience - 5+ YearsNotice - Immediate Joiners Apply Now: Skills & Experience:5+ years of proven tech experience.Hands-on in Data Center Operations (DCOps) – Linux installation, configuration & troubleshooting.Strong experience in Java, container technologies...

  • Sre

    2 weeks ago


    Mumbai, Maharashtra, India Weekday AI Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    This role is for one of the Weekday's clientsMin Experience: 4 yearsLocation: IndiaJobType: full-timeWe are looking for a dedicated Site Reliability Engineer (SRE) to join our technology team. This position focuses on maintaining the reliability, availability, and performance of systems used across the organization. The role requires someone who can work...


  • Navi Mumbai, Maharashtra, India Koantek Full time

    About the Role:We are seeking a highly skilled and experienced SREDatabricks Platform Administrator to join our DataOperations Team. In this critical role, you will be responsible for the availability, performance,Reliability and scalability of our enterprise Databricks platform. You will blend deep expertise in Databricks administration with SRE principles...


  • Mumbai, Maharashtra, India equentis Full time

    Job Title: Site Reliability Engineer (SRE)Company – Equentis Wealth Advisory LimitedLocation – Lower Parel, MumbaiJob Summary: We are seeking a talented Site Reliability Engineer (SRE) to join our team andplay a critical role in ensuring the reliability, scalability, and performance of our systems andapplications. The ideal candidate will have a strong...


  • Mumbai, Maharashtra, India, Maharashtra Insight Global Full time

    Site Reliability EngineerLocation: Mumbai, India - working onsite 1x a week Salary: 22-25 LPATarget Start Date: January 2026Join our dynamic and highly collaborative agile team, where you'll play a pivotal role in ensuring the reliability, scalability, and efficiency of our premier InsurTech solution. Our platform enables clients to obtain quotes and issue...


  • Mumbai, Maharashtra, India RELX Full time ₹ 8,00,000 - ₹ 24,00,000 per year

    Job DescriptionWould you like to be part of a team that delivers high-quality software to our customers?Are you a visible champion with a 'can do' attitude and enthusiasm that inspires others?About The BusinessLexisNexis Risk Solutions is the essential partner in the assessment of risk. Within our Business Services vertical, we offer a multitude of solutions...


  • Mumbai, Maharashtra, India Hirexa Solutions Full time ₹ 4,00,000 - ₹ 12,00,000 per year

    HI All, We are hiring for Site Reliability Engineer with one of our product-based client - Permanent hiring Skills: Should Have At least 7+ years of Experience on AWSShould have Good Hands-On Experience on Below skillsObservability/Monitoring*Python*Bash/Shell ScriptTerraform*Automation*Account PipelineService NowGitlabJira Exp: 7 to 14 Yrs CTC: Exp*2.5...

  • Senior Specialist

    2 weeks ago


    Mumbai, Maharashtra, India Datavail Career Site Full time ₹ 60,00,000 - ₹ 1,80,00,000 per year

    Job Title: Senior Specialist – Cloud SRE Education: Bachelor's Degree Experience: 8+ years Location: Mumbai As a Senior SRE Engineer (Cloud SRE Specialist), you will be responsible for ensuring the reliability, scalability, performance, and cost optimization of cloud services across AWS, Azure, and multi-cloud environments. You will act as the primary...


  • Mumbai, Maharashtra, India BNP Paribas Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Position Purpose The main responsibility of Stability & Resilience division is to support the IT strategy & Production and gathers activities contributing directly to the stability and integrity of the Production and to the Information Systems resilience.Within the division, the domain Control Tower Process & Framework oversees the Global Production...


  • Mumbai, Maharashtra, India LexisNexis Risk Solutions Full time ₹ 6,00,000 - ₹ 18,00,000 per year

    Job DescriptionWould you like to be part of a team that delivers high-quality software to our customers?Are you a visible champion with a 'can do' attitude and enthusiasm that inspires others?About the BusinessLexisNexis Risk Solutions is the essential partner in the assessment of risk. Within our Business Services vertical, we offer a multitude of solutions...