
Lead Infrastructure Reliability Engineer
4 days ago
GreetingsCurrently we have an urgent Position for Senior Reliability Engineer Role with one of our projects, location based out of Bangalore / Hyderabad.Kindly find below the Details for your Perusal.Job Location : HyderabadNotice Period : Maximum 20 DaysJob Description:As a Lead Site Reliability Engineer (SRE), you will take a leadership role in ensuring the reliability, scalability, and performance of production systems. You will work closely with engineering, infrastructure, and leadership teams to build robust systems and processes. Your responsibilities will include designing scalable architectures, leading incident response, mentoring team members, and driving automation and observability strategies.Key ResponsibilitiesLead the design and implementation of highly available, resilient, and scalable infrastructure.Define, monitor, and uphold SLIs, SLOs, and SLAs across critical services.Own and evolve incident management processes, including root cause analysis (RCA), blameless postmortems, and remediation planning.Automate infrastructure provisioning and application deployment using tools like Terraform, Ansible, and Helm.Architect, maintain, and enhance observability platforms, including metrics, tracing, and logging systems.Collaborate with development teams on performance tuning, capacity planning, and system reliability improvements.Design and implement disaster recovery (DR) and business continuity strategies.Mentor junior and mid-level SREs, fostering a culture of continuous improvement and knowledge sharing.Partner with security teams to enforce zero-trust principles, RBAC, IAM policies, and compliance requirements.Required Skills & ExperienceSEIII and IV level experience in Site Reliability Engineering, DevOps, or related Infrastructure Engineering roles.Expertise in Kubernetes and cloud platforms, especially AWS.Solid understanding of large-scale distributed systems.Proficient with Linux systems, networking, and storage internals.Hands-on experience with Infrastructure as Code (Terraform, Ansible).Familiarity with CI/CD pipelines and GitOps workflows.Strong knowledge of observability best practices (metrics, logs, tracing).Proven experience with incident response and system troubleshooting.Proficiency in scripting or programming, preferably Python.Strong understanding of modern monitoring tools and platforms.About Sonata Software:Sonata is a global software company, one of the fastest growing in India. It specializes in Platform Engineering, Sonata’s proprietary Platformation methodology, providing a framework that assists companies with their digital transformation journeys and helps them build their businesses on platforms that are Open, Connected, Intelligent, and Scalable. Sonata's platform engineering expertise is supported by its capabilities in Cloud and Data, IoT, AI, and Machine Learning, Robotic Process Automation, and Cybersecurity. With its centres in India, the US, Europe, and Australia, Sonata brings Thought leadership, Customer-centricity, and Execution Excellence towards catalysing the business transformation process for its customers around the world.You can read more here: https://www.Sonata-software.Com/platformationRegardsSathishTalent Acquisition - Sonata Software Services9840669681
-
Infrastructure Reliability Engineer
1 week ago
Hyderabad, India ANSR Full timeAbout T-Mobile:T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional service experience.About TMUS Global...
-
Lead Site Reliability Engineer
7 days ago
Hyderabad, Telangana, India EPAM Systems Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are seeking a skilledLead Site Reliability Engineerto drive the stability, scalability, and reliability of our systems while improving efficiency through automation and best practices.This role calls for deep expertise in DevOps methodologies, Infrastructure as Code (IaC), and collaboration across teams to ensure optimal system...
-
Cloud Infrastructure Reliability Engineer
2 weeks ago
Hyderabad, India ANSR Full timeANSR is hiring for one of its clients.About T-Mobile:T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional...
-
Lead Reliability Engineer
1 week ago
Hyderabad, India ANSR Full timeAbout T-Mobile:T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional service experience.About TMUS Global...
-
Lead - site reliability engineer
4 weeks ago
Hyderabad, India VXI Global Solutions Full timeWe are looking for a Lead - Site Reliability Engineer with 8+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications. The ideal candidate will have hands-on experience with Prometheus , Grafana , Google Cloud Monitoring , and Open Telemetry , along with exposure to ...
-
Lead - Site Reliability Engineer
4 weeks ago
Hyderabad, India VXI Global Solutions Full timeWe are looking for a Lead - Site Reliability Engineer with 8+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications. The ideal candidate will have hands-on experience with Prometheus, Grafana, Google Cloud Monitoring, and OpenTelemetry, along with exposure to SolarWinds. You...
-
Principal Infrastructure Engineer
3 weeks ago
Hyderabad, India Arcesium Full timeJob Description Arcesium is seeking a dynamic, multi-discipline, polyglot Software and Infrastructure Engineer and Leader with broad and deep knowledge across the spectrum of software and infrastructure to join our Infrastructure Engineering team in a lead capacity. Infrastructure s mission is to provide the foundation and supporting critical systems that...
-
Site Reliability Engineering- Lead/ Senior
2 weeks ago
Hyderabad, India Sonata Software Full timeSite Reliability Engineering- Lead/ SeniorFull Time (Hybrid)Hyderabad***Immediate to 30 Days Joiners Only***Required Skills & Experience:- Over 7+ year of Experience in Site Reliability Engineering, DevOps, or related Infrastructure Engineering roles.- Expertise in Kubernetes and cloud platforms, especially AWS.- Solid understanding of large-scale...
-
Site reliability engineering- lead/ senior
2 weeks ago
Hyderabad, India Sonata Software Full timeSite Reliability Engineering- Lead/ Senior Full Time (Hybrid) Hyderabad ***Immediate to 30 Days Joiners Only*** Required Skills & Experience: Over 7+ year of Experience in Site Reliability Engineering, Dev Ops, or related Infrastructure Engineering roles. Expertise in Kubernetes and cloud platforms, especially AWS. Solid understanding of large-scale...
-
Lead - Site Reliability Engineer
4 weeks ago
Hyderabad, India VXI Global Solutions Full timeWe are looking for a Lead - Site Reliability Engineer with 8+ years for Experience into design, implement, and manage robust observability solutions across our cloud infrastructure and applications. The ideal candidate will have hands-on experience with Prometheus, Grafana, Google Cloud Monitoring, and OpenTelemetry, along with exposure to SolarWinds. You...