staff Site Reliability Engineer
2 weeks ago
Job Description: SRE Staff Engineer
Company: Pocket FM
Position Overview:
Pocket FM is seeking an experienced and dynamic SRE Staff Engineer to drive the reliability, scalability, and performance of our audio streaming systems. As a key leader in the engineering team, you will design robust infrastructure, ensure platform availability, and guide a team of SREs to uphold our commitment to delivering a seamless listening experience to millions of users.
Key Responsibilities
- Leadership and Strategy
- Lead and mentor a team of SREs, fostering a culture of reliability and excellence.
- Define and implement SLOs, SLAs, and SLIs tailored to Pocket FM's business goals.
- Drive cross-functional initiatives to embed reliability across the engineering lifecycle.
- Infrastructure and Operations
- Architect and maintain highly scalable, secure, and fault-tolerant infrastructure for Pocket FM's audio streaming platform.
- Enhance CI/CD pipelines for efficient and reliable software delivery.
- Manage system capacity planning and optimize performance for peak user loads.
- Incident Management and Root Cause Analysis
- Lead incident resolution efforts, ensuring timely communication and minimal user impact.
- Conduct post-incident reviews, driving improvements to system reliability and operational processes.
- Automation and Monitoring
- Build and maintain automated tools to detect, monitor, and mitigate system anomalies.
- Develop advanced observability systems to provide actionable insights into platform health.
- Collaboration and Innovation
- Partner with development, product, and QA teams to incorporate reliability principles from design to deployment.
- Contribute to Pocket FM’s long-term engineering roadmap and reliability goals.
Required Skills and Experience
- Education: Bachelor's or Master’s degree in Computer Science, Engineering, or a related field.
- Experience:
- 6+ years of experience in software engineering, DevOps, or SRE roles.
- 3+ years of experience in a leadership or senior technical position, managing teams or projects.
- Technical Proficiency:
- Hands-on expertise with cloud platforms (AWS, GCP, Azure).
- Proficiency in containerization and orchestration tools (Docker, Kubernetes).
- Strong programming skills in Python, Go, or Java.
- Experience with monitoring and logging tools (Prometheus, Grafana, ELK, Datadog).
- Expertise in infrastructure-as-code tools (Terraform, Ansible, etc.).
- Soft Skills:
- Strong analytical and problem-solving skills.
- Excellent communication and team collaboration abilities.
- Thrives in a dynamic, fast-paced environment.
Preferred Qualifications
- Certification in cloud platforms (e.g., AWS Solutions Architect, Google Cloud DevOps Engineer).
- Familiarity with ITIL or similar incident management frameworks.
- Experience managing large-scale distributed systems for consumer-facing applications.
Why Join Pocket FM?
- Be a part of a rapidly growing, mission-driven team transforming audio entertainment.
- Work on cutting-edge technologies at scale, impacting millions of users worldwide.
- Enjoy competitive compensation, a collaborative work environment, and ample growth opportunities.
If you’re passionate about reliability engineering and love solving complex challenges in a high-impact environment, Pocket FM is the place for you
-
Staff Site Reliability Engineer
1 day ago
India Zscaler Full timeAbout the role: Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in...
-
india Zscaler Full timeAbout the role:Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185...
-
india Zscaler Full timeAbout the role: Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185...
-
india Zscaler Full timeAbout the role: Our Engineering team built the world's largest cloud security platform from the ground up, and we keep building. With more than 100 patents and big plans for enhancing services and increasing our global footprint, the team has made us and our multitenant architecture today's cloud security leader, with more than 15 million users in 185...
-
Site Reliability Engineer
4 weeks ago
India Tata Consultancy Services Full timeDear Candidate, Greetings from TCS !!! TCS is hiring for SRE, please find the below JD….. Experience range – 5+ years Location- Bangalore, Pune, Hyderabad, Chennai Skills Required - Site Reliability Engineer Role& Responsibilities – Collaborates with cloud platform engineers and teams to design, develop, test, and implement...
-
Site Reliability Engineer
1 month ago
india Tata Consultancy Services Full timeDear Candidate,Greetings from TCS !!!TCS is hiring for SRE, please find the below JD…..Experience range – 5+ yearsLocation- Bangalore, Pune, Hyderabad, ChennaiSkills Required - Site Reliability EngineerRole& Responsibilities –Collaborates with cloud platform engineers and teams to design, develop, test, and implement availability, reliability,...
-
Site Reliability Engineer
7 days ago
India IDEMIA Full timeWe are hiring for Site Reliability Engineer role at Noida location. Responsibility: Involved in deploy/manage/operate of medium to large scale production systems Understanding of Linux as a runtime environment Familiar to Cloud native concepts and virtualisation Familiar to CI/CD concepts and tools like Jenkins, Gitlab etc Previous...
-
Site Reliability Engineer
1 week ago
India IDEMIA Full timeWe are hiring for Site Reliability Engineer role at Noida location. Responsibility: Involved in deploy/manage/operate of medium to large scale production systems Understanding of Linux as a runtime environment Familiar to Cloud native concepts and virtualisation Familiar to CI/CD concepts and tools like Jenkins, Gitlab etc Previous...
-
Site reliability engineer
4 weeks ago
India PeopleLogic Full timeJob Responsibilities : Ensure the 24/7 operations and reliability of data services in our production Collaborate with the data engineering development team to design, build, and maintain scalable, reliable, and secure data pipelines and systems. Develop and implement monitoring, alerting, and incident response strategies to proactively identify and...
-
Site Reliability Engineer
3 weeks ago
India InstaService Inc Full timeAbout Us:At InstaService, we are committed to delivering reliable, high-performance home services to our customers. As a fast-growing on-demand services platform, we are looking for a talented DevOps / Site Reliability Engineer (SRE) to join our dynamic team. This role is crucial to scaling and maintaining our infrastructure, ensuring our platform remains...
-
Site Reliability, Staff
6 months ago
India Synopsys Full time49347BR - INDIA - India **Job Description and Requirements**Responsibilities**: - Collaborate with cross-functional teams and stakeholders to plan and execute infrastructure upgrades, migrations, and implementation projects. Conduct research to identify new technologies and solutions. - Develop tools to automate administrative tasks. Optimize operation...
-
Site Reliability Engineer
2 months ago
India BCE Global Tech Full timeAbout the role We are seeking a talented Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a strong background in software engineering and systems administration, with a passion for building scalable and reliable systems. As an SRE, you will collaborate with development and operations teams to ensure our services are reliable,...
-
Site Reliability Engineer
4 weeks ago
india PeopleLogic Full timeJob Responsibilities : Ensure the 24/7 operations and reliability of data services in our productionCollaborate with the data engineering development team to design, build, and maintain scalable, reliable, and secure data pipelines and systems.Develop and implement monitoring, alerting, and incident response strategies to proactively identify and resolve...
-
india Pocket FM Full timeJob Description: SRE Staff Engineer Company: Pocket FM Position Overview: Pocket FM is seeking an experienced and dynamic SRE Staff Engineer to drive the reliability, scalability, and performance of our audio streaming systems. As a key leader in the engineering team, you will design robust infrastructure, ensure platform availability, and guide a team...
-
india Pocket FM Full timeJob Description: SRE Staff EngineerCompany: Pocket FMPosition Overview:Pocket FM is seeking an experienced and dynamic SRE Staff Engineer to drive the reliability, scalability, and performance of our audio streaming systems. As a key leader in the engineering team, you will design robust infrastructure, ensure platform availability, and guide a team of SREs...
-
Site Reliability Engineer
1 month ago
india Apex Systems Full timeDevops Engineer Bengaluru & Chennai Remote Looking for an immediate Joiner • Overall 5+yrs of experience as Site Reliability Engineer /Devops Engineer• Bachelor’s or master’s Degree in software engineering, computer science, or in a related technical field• Familiarity with Infrastructure as Code (e.g. Terraform & CloudFormation)• Has a focus in...
-
Site Reliability Engineer
4 weeks ago
India Apex Systems Full timeDevops Engineer Bengaluru & Chennai Remote Looking for an immediate Joiner • Overall 5+yrs of experience as Site Reliability Engineer /Devops Engineer • Bachelor’s or master’s Degree in software engineering, computer science, or in a related technical field • Familiarity with Infrastructure as Code (e.g. Terraform & CloudFormation) • Has...
-
Site Reliability Engineer
1 month ago
Anywhere in India/Multiple Locations Stealth Startup Full timeKey ResponsibilitiesAt Stealth Startup, we're looking for a skilled Site Reliability Engineer to maintain and enhance the reliability, availability, and performance of our large-scale distributed systems. Your key responsibilities will include automating deployment, monitoring, and management of production systems, as well as implementing and managing CI/CD...
-
Senior Site Reliability Engineer
4 weeks ago
india HCLTech Full timeUrgent Opening for Cloud Senior Site Reliability Engineer role for Pan India location with HCL TechInterested candidates kindly share your updated resume to sagardo@hcltech.com with the subject line "Cloud Senior Site Reliability Engineer Role_ your name & preferred location"Job Description: Ability to learn SRE practices across Red Hat Open Shift, Google...
-
Site Reliability Engineer
4 weeks ago
India Tanla Platforms Limited Full timeAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Tanla Platforms Limited. As a Site Reliability Engineer, you will be responsible for ensuring the high availability, scalability, and reliability of our platforms and applications.Key Responsibilities:Design, implement, and maintain scalable and highly available...