Staff site reliability engineer

1 week ago


Delhi, India Pocket FM Full time
Job Description: SRE Staff Engineer
Company: Pocket FM
Position Overview:
Pocket FM is seeking an experienced and dynamic SRE Staff Engineer to drive the reliability, scalability, and performance of our audio streaming systems. As a key leader in the engineering team, you will design robust infrastructure, ensure platform availability, and guide a team of SREs to uphold our commitment to delivering a seamless listening experience to millions of users.
Key Responsibilities
Leadership and Strategy
Lead and mentor a team of SREs, fostering a culture of reliability and excellence.
Define and implement SLOs, SLAs, and SLIs tailored to Pocket FM's business goals.
Drive cross-functional initiatives to embed reliability across the engineering lifecycle.
Infrastructure and Operations
Architect and maintain highly scalable, secure, and fault-tolerant infrastructure for Pocket FM's audio streaming platform.
Enhance CI/CD pipelines for efficient and reliable software delivery.
Manage system capacity planning and optimize performance for peak user loads.
Incident Management and Root Cause Analysis
Lead incident resolution efforts, ensuring timely communication and minimal user impact.
Conduct post-incident reviews, driving improvements to system reliability and operational processes.
Automation and Monitoring
Build and maintain automated tools to detect, monitor, and mitigate system anomalies.
Develop advanced observability systems to provide actionable insights into platform health.
Collaboration and Innovation
Partner with development, product, and QA teams to incorporate reliability principles from design to deployment.
Contribute to Pocket FM’s long-term engineering roadmap and reliability goals.
Required Skills and Experience
Education: Bachelor's or Master’s degree in Computer Science, Engineering, or a related field.
Experience:
6+ years of experience in software engineering, Dev Ops, or SRE roles.
3+ years of experience in a leadership or senior technical position, managing teams or projects.
Technical Proficiency:
Hands-on expertise with cloud platforms (AWS, GCP, Azure).
Proficiency in containerization and orchestration tools (Docker, Kubernetes).
Strong programming skills in Python, Go, or Java.
Experience with monitoring and logging tools (Prometheus, Grafana, ELK, Datadog).
Expertise in infrastructure-as-code tools (Terraform, Ansible, etc.).
Soft Skills:
Strong analytical and problem-solving skills.
Excellent communication and team collaboration abilities.
Thrives in a dynamic, fast-paced environment.
Preferred Qualifications
Certification in cloud platforms (e.g., AWS Solutions Architect, Google Cloud Dev Ops Engineer).
Familiarity with ITIL or similar incident management frameworks.
Experience managing large-scale distributed systems for consumer-facing applications.
Why Join Pocket FM?
Be a part of a rapidly growing, mission-driven team transforming audio entertainment.
Work on cutting-edge technologies at scale, impacting millions of users worldwide.
Enjoy competitive compensation, a collaborative work environment, and ample growth opportunities.
If you’re passionate about reliability engineering and love solving complex challenges in a high-impact environment, Pocket FM is the place for you

  • Delhi, India Zscaler Full time

    About the role:Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185...


  • Delhi, India Zscaler Full time

    About the role:Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185...


  • Delhi, India Pocket FM Full time

    Job Description: SRE Staff EngineerCompany: Pocket FMPosition Overview:Pocket FM is seeking an experienced and dynamicSRE Staff Engineerto drive the reliability, scalability, and performance of our audio streaming systems. As a key leader in the engineering team, you will design robust infrastructure, ensure platform availability, and guide a team of SREs to...


  • Delhi, India Pocket FM Full time

    Job Description: SRE Staff EngineerCompany: Pocket FMPosition Overview:Pocket FM is seeking an experienced and dynamic SRE Staff Engineer to drive the reliability, scalability, and performance of our audio streaming systems. As a key leader in the engineering team, you will design robust infrastructure, ensure platform availability, and guide a team of...


  • Delhi, India Zscaler Full time

    About the role:Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185...


  • Delhi, India Tata Consultancy Services Full time

    Dear Candidate,Greetings from TCS !!!TCS is hiring for SRE, please find the below JD…..Experience range – 5+ yearsLocation- Bangalore, Pune, Hyderabad, ChennaiSkills Required - Site Reliability EngineerRole& Responsibilities –Collaborates with cloud platform engineers and teams to design, develop, test, and implement availability, reliability,...


  • Delhi, India Tata Consultancy Services Full time

    Dear Candidate,Greetings from TCS !!!TCS is hiring for SRE, please find the below JD…..Experience range – 5+ yearsLocation- Bangalore, Pune, Hyderabad, ChennaiSkills Required - Site Reliability EngineerRole& Responsibilities –Collaborates with cloud platform engineers and teams to design, develop, test, and implement availability, reliability,...


  • Delhi, India IDEMIA Full time

    We are hiring for Site Reliability Engineer role at Noida location.Responsibility:- Involved in deploy/manage/operate of medium to large scale production systems- Understanding of Linux as a runtime environment- Familiar to Cloud native concepts and virtualisation- Familiar to CI/CD concepts and tools like Jenkins, Gitlab etc- Previous experience of working...


  • Delhi, India IDEMIA Full time

    We are hiring forSite Reliability Engineerrole atNoidalocation.Responsibility:Involved in deploy/manage/operate of medium to large scale production systemsUnderstanding of Linux as a runtime environmentFamiliar to Cloud native concepts and virtualisationFamiliar to CI/CD concepts and tools like Jenkins, Gitlab etcPrevious experience of working with Docker,...


  • delhi, India InstaService Inc Full time

    About Us :At InstaService, we are committed to delivering reliable, high-performance home services to our customers. As a fast-growing on-demand services platform, we are looking for a talentedDevOps / Site Reliability Engineer (SRE)to join our dynamic team. This role is crucial to scaling and maintaining our infrastructure, ensuring our platform remains...


  • delhi, India InstaService Inc Full time

    About Us :At InstaService, we are committed to delivering reliable, high-performance home services to our customers. As a fast-growing on-demand services platform, we are looking for a talented DevOps / Site Reliability Engineer (SRE) to join our dynamic team. This role is crucial to scaling and maintaining our infrastructure, ensuring our platform remains...


  • Delhi, India IDEMIA Full time

    We are hiring for Site Reliability Engineer role at Noida location.Responsibility:Involved in deploy/manage/operate of medium to large scale production systemsUnderstanding of Linux as a runtime environmentFamiliar to Cloud native concepts and virtualisationFamiliar to CI/CD concepts and tools like Jenkins, Gitlab etcPrevious experience of working with...


  • Delhi, India InstaService Inc Full time

    About Us :At Insta Service, we are committed to delivering reliable, high-performance home services to our customers. As a fast-growing on-demand services platform, we are looking for a talented Dev Ops / Site Reliability Engineer (SRE) to join our dynamic team. This role is crucial to scaling and maintaining our infrastructure, ensuring our platform...


  • Delhi, India InstaService Inc Full time

    About Us :At InstaService, we are committed to delivering reliable, high-performance home services to our customers. As a fast-growing on-demand services platform, we are looking for a talentedDevOps / Site Reliability Engineer (SRE)to join our dynamic team. This role is crucial to scaling and maintaining our infrastructure, ensuring our platform remains...


  • Delhi, India Tata Consultancy Services Full time

    TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability EngineerExperience Range: 8 – 12 YearsLocation: Pune & Chennai, Bangalore , DelhiMust-Have:Exceptional skills in...


  • Delhi, India PeopleLogic Full time

    Job Responsibilities :Ensure the 24/7 operations and reliability of data services in our productionCollaborate with the data engineering development team to design, build, and maintain scalable, reliable, and secure data pipelines and systems.Develop and implement monitoring, alerting, and incident response strategies to proactively identify and resolve...


  • delhi, India PeopleLogic Full time

    Job Responsibilities :Ensure the 24/7 operations and reliability of data services in our productionCollaborate with the data engineering development team to design, build, and maintain scalable, reliable, and secure data pipelines and systems.Develop and implement monitoring, alerting, and incident response strategies to proactively identify and resolve...


  • Delhi, India PeopleLogic Full time

    Job Responsibilities :Ensure the 24/7 operations and reliability of data services in our productionCollaborate with the data engineering development team to design, build, and maintain scalable, reliable, and secure data pipelines and systems.Develop and implement monitoring, alerting, and incident response strategies to proactively identify and resolve...


  • New Delhi, India AIVID.AI Full time

    Role Overview:We are seeking proactive and skilled Site Reliability Engineers (SREs) to manage clientdeployments, provide on-site support, and ensure the seamless functioning of our AI-basedcamera analytics systems. This hybrid role requires a mix of on-site visits and remote work.The selected candidates will operate from their respective regions—New...


  • Delhi, India Systal Technology Solutions Full time

    Site Reliability EngineerCompetitive Salary & BenefitsBangaloreSystal is a global managed network and security service and transformation specialist. We consult, deploy, and integrate multi-vendor technologies which help enterprise businesses maximize the security and value of their complex IT infrastructure. Across our 24/7 Network and Security Operations...