Staff site reliability engineer
1 week ago
Company: Pocket FM
Position Overview:
Pocket FM is seeking an experienced and dynamic SRE Staff Engineer to drive the reliability, scalability, and performance of our audio streaming systems. As a key leader in the engineering team, you will design robust infrastructure, ensure platform availability, and guide a team of SREs to uphold our commitment to delivering a seamless listening experience to millions of users.
Key Responsibilities
Leadership and Strategy
Lead and mentor a team of SREs, fostering a culture of reliability and excellence.
Define and implement SLOs, SLAs, and SLIs tailored to Pocket FM's business goals.
Drive cross-functional initiatives to embed reliability across the engineering lifecycle.
Infrastructure and Operations
Architect and maintain highly scalable, secure, and fault-tolerant infrastructure for Pocket FM's audio streaming platform.
Enhance CI/CD pipelines for efficient and reliable software delivery.
Manage system capacity planning and optimize performance for peak user loads.
Incident Management and Root Cause Analysis
Lead incident resolution efforts, ensuring timely communication and minimal user impact.
Conduct post-incident reviews, driving improvements to system reliability and operational processes.
Automation and Monitoring
Build and maintain automated tools to detect, monitor, and mitigate system anomalies.
Develop advanced observability systems to provide actionable insights into platform health.
Collaboration and Innovation
Partner with development, product, and QA teams to incorporate reliability principles from design to deployment.
Contribute to Pocket FM’s long-term engineering roadmap and reliability goals.
Required Skills and Experience
Education: Bachelor's or Master’s degree in Computer Science, Engineering, or a related field.
Experience:
6+ years of experience in software engineering, Dev Ops, or SRE roles.
3+ years of experience in a leadership or senior technical position, managing teams or projects.
Technical Proficiency:
Hands-on expertise with cloud platforms (AWS, GCP, Azure).
Proficiency in containerization and orchestration tools (Docker, Kubernetes).
Strong programming skills in Python, Go, or Java.
Experience with monitoring and logging tools (Prometheus, Grafana, ELK, Datadog).
Expertise in infrastructure-as-code tools (Terraform, Ansible, etc.).
Soft Skills:
Strong analytical and problem-solving skills.
Excellent communication and team collaboration abilities.
Thrives in a dynamic, fast-paced environment.
Preferred Qualifications
Certification in cloud platforms (e.g., AWS Solutions Architect, Google Cloud Dev Ops Engineer).
Familiarity with ITIL or similar incident management frameworks.
Experience managing large-scale distributed systems for consumer-facing applications.
Why Join Pocket FM?
Be a part of a rapidly growing, mission-driven team transforming audio entertainment.
Work on cutting-edge technologies at scale, impacting millions of users worldwide.
Enjoy competitive compensation, a collaborative work environment, and ample growth opportunities.
If you’re passionate about reliability engineering and love solving complex challenges in a high-impact environment, Pocket FM is the place for you
-
Staff Site Reliability Engineer
14 hours ago
Delhi, India Zscaler Full timeAbout the role:Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185...
-
Staff Site Reliability Engineer
1 day ago
Delhi, India Zscaler Full timeAbout the role:Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185...
-
staff Site Reliability Engineer
1 week ago
Delhi, India Pocket FM Full timeJob Description: SRE Staff EngineerCompany: Pocket FMPosition Overview:Pocket FM is seeking an experienced and dynamicSRE Staff Engineerto drive the reliability, scalability, and performance of our audio streaming systems. As a key leader in the engineering team, you will design robust infrastructure, ensure platform availability, and guide a team of SREs to...
-
staff Site Reliability Engineer
1 week ago
Delhi, India Pocket FM Full timeJob Description: SRE Staff EngineerCompany: Pocket FMPosition Overview:Pocket FM is seeking an experienced and dynamic SRE Staff Engineer to drive the reliability, scalability, and performance of our audio streaming systems. As a key leader in the engineering team, you will design robust infrastructure, ensure platform availability, and guide a team of...
-
Delhi, India Zscaler Full timeAbout the role:Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185...
-
Site Reliability Engineer
4 weeks ago
Delhi, India Tata Consultancy Services Full timeDear Candidate,Greetings from TCS !!!TCS is hiring for SRE, please find the below JD…..Experience range – 5+ yearsLocation- Bangalore, Pune, Hyderabad, ChennaiSkills Required - Site Reliability EngineerRole& Responsibilities –Collaborates with cloud platform engineers and teams to design, develop, test, and implement availability, reliability,...
-
Site reliability engineer
4 weeks ago
Delhi, India Tata Consultancy Services Full timeDear Candidate,Greetings from TCS !!!TCS is hiring for SRE, please find the below JD…..Experience range – 5+ yearsLocation- Bangalore, Pune, Hyderabad, ChennaiSkills Required - Site Reliability EngineerRole& Responsibilities –Collaborates with cloud platform engineers and teams to design, develop, test, and implement availability, reliability,...
-
Site Reliability Engineer
1 week ago
Delhi, India IDEMIA Full timeWe are hiring for Site Reliability Engineer role at Noida location.Responsibility:- Involved in deploy/manage/operate of medium to large scale production systems- Understanding of Linux as a runtime environment- Familiar to Cloud native concepts and virtualisation- Familiar to CI/CD concepts and tools like Jenkins, Gitlab etc- Previous experience of working...
-
Site Reliability Engineer
1 week ago
Delhi, India IDEMIA Full timeWe are hiring forSite Reliability Engineerrole atNoidalocation.Responsibility:Involved in deploy/manage/operate of medium to large scale production systemsUnderstanding of Linux as a runtime environmentFamiliar to Cloud native concepts and virtualisationFamiliar to CI/CD concepts and tools like Jenkins, Gitlab etcPrevious experience of working with Docker,...
-
Site Reliability Engineer
3 weeks ago
delhi, India InstaService Inc Full timeAbout Us :At InstaService, we are committed to delivering reliable, high-performance home services to our customers. As a fast-growing on-demand services platform, we are looking for a talentedDevOps / Site Reliability Engineer (SRE)to join our dynamic team. This role is crucial to scaling and maintaining our infrastructure, ensuring our platform remains...
-
Site Reliability Engineer
3 weeks ago
delhi, India InstaService Inc Full timeAbout Us :At InstaService, we are committed to delivering reliable, high-performance home services to our customers. As a fast-growing on-demand services platform, we are looking for a talented DevOps / Site Reliability Engineer (SRE) to join our dynamic team. This role is crucial to scaling and maintaining our infrastructure, ensuring our platform remains...
-
Site Reliability Engineer
1 week ago
Delhi, India IDEMIA Full timeWe are hiring for Site Reliability Engineer role at Noida location.Responsibility:Involved in deploy/manage/operate of medium to large scale production systemsUnderstanding of Linux as a runtime environmentFamiliar to Cloud native concepts and virtualisationFamiliar to CI/CD concepts and tools like Jenkins, Gitlab etcPrevious experience of working with...
-
Site reliability engineer
3 weeks ago
Delhi, India InstaService Inc Full timeAbout Us :At Insta Service, we are committed to delivering reliable, high-performance home services to our customers. As a fast-growing on-demand services platform, we are looking for a talented Dev Ops / Site Reliability Engineer (SRE) to join our dynamic team. This role is crucial to scaling and maintaining our infrastructure, ensuring our platform...
-
Site Reliability Engineer
3 weeks ago
Delhi, India InstaService Inc Full timeAbout Us :At InstaService, we are committed to delivering reliable, high-performance home services to our customers. As a fast-growing on-demand services platform, we are looking for a talentedDevOps / Site Reliability Engineer (SRE)to join our dynamic team. This role is crucial to scaling and maintaining our infrastructure, ensuring our platform remains...
-
Site Reliability Engineer
1 day ago
Delhi, India Tata Consultancy Services Full timeTCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Site Reliability EngineerExperience Range: 8 – 12 YearsLocation: Pune & Chennai, Bangalore , DelhiMust-Have:Exceptional skills in...
-
Site Reliability Engineer
4 weeks ago
Delhi, India PeopleLogic Full timeJob Responsibilities :Ensure the 24/7 operations and reliability of data services in our productionCollaborate with the data engineering development team to design, build, and maintain scalable, reliable, and secure data pipelines and systems.Develop and implement monitoring, alerting, and incident response strategies to proactively identify and resolve...
-
Site Reliability Engineer
4 weeks ago
delhi, India PeopleLogic Full timeJob Responsibilities :Ensure the 24/7 operations and reliability of data services in our productionCollaborate with the data engineering development team to design, build, and maintain scalable, reliable, and secure data pipelines and systems.Develop and implement monitoring, alerting, and incident response strategies to proactively identify and resolve...
-
Site reliability engineer
4 weeks ago
Delhi, India PeopleLogic Full timeJob Responsibilities :Ensure the 24/7 operations and reliability of data services in our productionCollaborate with the data engineering development team to design, build, and maintain scalable, reliable, and secure data pipelines and systems.Develop and implement monitoring, alerting, and incident response strategies to proactively identify and resolve...
-
Site Reliability Engineer
1 week ago
New Delhi, India AIVID.AI Full timeRole Overview:We are seeking proactive and skilled Site Reliability Engineers (SREs) to manage clientdeployments, provide on-site support, and ensure the seamless functioning of our AI-basedcamera analytics systems. This hybrid role requires a mix of on-site visits and remote work.The selected candidates will operate from their respective regions—New...
-
Site reliability engineer
1 week ago
Delhi, India Systal Technology Solutions Full timeSite Reliability EngineerCompetitive Salary & BenefitsBangaloreSystal is a global managed network and security service and transformation specialist. We consult, deploy, and integrate multi-vendor technologies which help enterprise businesses maximize the security and value of their complex IT infrastructure. Across our 24/7 Network and Security Operations...