SRE

2 weeks ago


Mumbai, India Fynd Full time

Job Description What will you do at Fynd - Lead, mentor, and grow a team of 2-5 Site Reliability Engineers. - Define, implement, and advocate SRE best practices like SLAs, SLOs, SLIs, error budgets, and chaos engineering. - Build and maintain automated CI/CD pipelines and infrastructure using tools like Terraform, Jenkins, or GitHub Actions. - Own the observability stackmonitoring, alerting, logging, and tracing across microservices and platforms. - Improve reliability and scalability of services by proactively identifying bottlenecks and automating manual ops tasks. - Drive incident response practices including on-call rotations, runbooks, and blameless postmortems. - Ensure high availability and uptime across distributed systems hosted on AWS. - Collaborate with cross-functional teams to ensure the architecture is cloud-native, secure, and fault-tolerant. - Implement and optimize systems for cost-efficiency, auto-scaling, and performance. - Contribute to open source or write technical blogs to share insights and practices with the broader tech community. - This is a startup, so expect rapid changes and plenty of opportunities to take initiative and drive new initiatives. Some Specific Requirements - At least 3+ years of experience leading SRE/DevOps/Infrastructure teams, with 5+ years overall in backend, systems, or infrastructure roles. - Strong experience managing distributed systems and microservices at scale. - Good understanding of Linux, Networking, Load Balancing, and Security concepts. - Hands-on experience with AWS services like EC2, ELB, AutoScaling, CloudFront, S3, CloudWatch. - Experience with container technologies and orchestrationDocker and Kubernetes is a must. - Strong proficiency with Infrastructure-as-Code tools like Terraform, CloudFormation, or Pulumi. - Familiarity with observability tools like Prometheus, Grafana, ELK, or Datadog. - Programming/scripting skills in Python, Go, Bash or similar for automation and tooling. - Understanding of message queues and event-driven architectures using Kafka or RabbitMQ. - Ability to manage incidents, write detailed postmortems, and improve reliability across teams and services. - Comfortable working in a fast-paced environment with a strong culture of ownership and continuous improvement.


  • SRE Head

    4 weeks ago


    Mumbai, India SID Global Solutions Full time

    Job Title: SRE HeadExperience Level: ~10 yearsRole Type: Engineering / ReliabilityRole Overview:The SRE Head is responsible for leading and scaling the Site Reliability Engineering (SRE) function across the organization. This role defines the reliability strategy, standards, and practices to ensure high availability, performance, and resilience of critical...

  • SRE Head

    4 weeks ago


    Mumbai, India SID Global Solutions Full time

    Job Title: SRE Head Experience Level: ~10 years Role Type: Engineering / Reliability Role Overview: The SRE Head is responsible for leading and scaling the Site Reliability Engineering (SRE) function across the organization. This role defines the reliability strategy, standards, and practices to ensure high availability, performance, and resilience of...

  • SRE Lead

    4 weeks ago


    Mumbai, India SID Global Solutions Full time

    Job Title: SRE Lead Experience Level: ~10 years Role Type: Engineering / Reliability Role Overview: The SRE Lead is responsible for leading site reliability initiatives across assigned product or platform areas, ensuring systems are scalable, reliable, and performant. This role defines and manages reliability goals, drives operational excellence, and...

  • Devobs Sre

    1 week ago


    Mumbai, India Tekskills Inc Full time

    **Role**:Devops SRE - Experience**:4 years to 8 years** Strong development backgroundExperience in - IT Operations/support and SRE initiatives Experienced in Agile development lifecycle Hands-on experience with the following: Programming languages**:Java, Python, Shell, Perl(**strong into any one skill) Database platforms**:DB2/Sybase/MSSQL(**strong into...

  • SRE Lead

    4 days ago


    Mumbai, Maharashtra, India SID Global Solutions Full time ₹ 1,50,000 - ₹ 2,50,000 per year

    Job InformationJob Opening IDZR_1066_JOBDate Opened11/10/2025IndustryIT ServicesJob TypeFull timeCityMumbaiState/ProvinceMaharashtraCountryIndiaZip/Postal Code400001Job DescriptionLead SRE initiatives for product areas; Set SLOs, error budgets, instrumentation, alerting; Partner with dev / infra / QA for reliability design; Mentor SRE engineers; Participate...

  • SRE Lead

    2 days ago


    Mumbai, Maharashtra, , India SID Global Solutions Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Lead SRE initiatives for product areas; Set SLOs, error budgets, instrumentation, alerting; Partner with dev / infra / QA for reliability design; Mentor SRE engineers; Participate in incident resolution, postmortems, improvements 10+ years in reliability / ops / SRE; Monitoring, observability, incident management skills; Understanding distributed systems,...

  • Sre Architect

    1 week ago


    Mumbai, Maharashtra, India Infogain Full time

    ROLES & RESPONSIBILITIES **Core Skills** 12 to 14 years of experience in Site Reliability Engineering, DevOps, or a related field, with at least 3 years in a senior or architect-level role. Strong expertise in system architecture, distributed systems, cloud computing (e.g., AWS, Azure, GCP), containerization (e.g., Docker, Kubernetes), and infrastructure...

  • SRE Coach

    3 weeks ago


    Mumbai, India Acme Services Full time

    Job Description - Bachelors or Masters degree in Computer Science, Engineering, or a related field. - 5+ years of experience as an SRE or in a related role with overall 10+ years of experience - Strong leadership skills and experience mentoring and coaching team members. - In-depth knowledge of site reliability engineering principles, best practices, and...

  • Api, Microservices

    2 days ago


    Mumbai, India DEQTAL Full time

    You will have hands-on experience taking requirements and designing, architecting and implementing reusable and robust solutions. - 5+ years of experience with AWS. Bonus points for experience with lambdas, API gateway, and other serverless technologies. - Strong experience with Typescript. Java and Spring Boot as well will be a distinct advantage. - 5+...

  • Production Support

    2 weeks ago


    Mumbai, Maharashtra, India People Connect Solutions Full time

    **Location**:Airoli**: - **New Mumbai **Position**: Production Support - SRE **Ideal Exp**: 2 - 5 Yrs **Primary Responsibilities**: - Proven ability to debug, optimize Live/Production system incidents, and automate mundane tasks - Expertise in analyzing, and troubleshooting large-scale OLTP/distributed systems - Systematic an individual ‘can do...