Engineer, Site Reliability T500-20504

1 week ago

Hyderabad, Telangana, India ANSR Full time ₹ 10,00,000 - ₹ 25,00,000 per year

ANSR is hiring for one of its clients.

About T-Mobile:

T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America's supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional service experience.

About TMUS Global Solutions:

TMUS Global Solutions is a world-class technology powerhouse accelerating the company's global digital transformation. With a culture built on growth, inclusivity, and global collaboration, the teams here drive innovation at scale, powered by bold thinking.

TMUS India Private Limited is a subsidiary of T-Mobile US, Inc. and operates as TMUS Global Solutions.

About the Role:

As a Site Reliability Engineer (SRE), you will be a key member of the CFL Platform Engineering and Operations team you will be responsible for building and maintaining large-scale, distributed systems that are observable, scalable, and resilient. This role sits at the intersection of software engineering and infrastructure operations, ensuring high availability and performance of production systems through automation, monitoring, and proactive engineering. You'll work closely with development, DevOps, and cloud platform teams to improve deployment strategies, incident response, and system health insights. This is a hands-on role for engineers who are passionate about operational excellence, reducing toil, and improving system reliability through code.

What You Will Do:

Ensure high availability and performance of production platforms through monitoring, alerting, and incident management
Design and implement resiliency patterns such as circuit breakers, failovers, retries, and health checks
Develop automation to reduce manual operational work and improve system efficiency
Support CI/CD workflows and infrastructure automation using tools like Terraform and Helm
Collaborate with developers to enhance service deployment and rollback mechanisms
Build and maintain observability tooling including dashboards, logs, and metrics
Analyze performance data and use it to guide optimizations and issue detection
Participate in on-call rotations, incident triage, and post-incident analysis
Write and maintain operational documentation, including runbooks and playbooks
Support development teams in achieving service-level objectives (SLOs) and operational readiness

What You Will Bring:

Bachelor's degree in Computer Science, Engineering, or a related technical field
2-5 years of experience in SRE, infrastructure, DevOps, or related engineering roles
Proficiency in scripting or programming (Python, Go, or Bash preferred)
Strong experience with Linux systems and cloud environments (Azure preferred; AWS/GCP also relevant)
Hands-on experience with Kubernetes and containerized services
Familiarity with observability tools such as Prometheus, Grafana, Splunk, or OpenTelemetry
Exposure to incident response frameworks, postmortems, and error budgets
Understanding of core SRE concepts: SLOs, SLIs, and service reliability metrics
Experience with CI/CD tools (e.g., GitLab CI/CD, Jenkins, Spinnaker)
Working knowledge of infrastructure tools such as HAProxy, RabbitMQ, or similar
Strong analytical and troubleshooting skills for distributed systems
Clear communication skills and ability to work cross-functionally
A continuous improvement mindset focused on reducing operational toil and enhancing developer experience

Must Have Skills:

Application & Microservice: Java, Spring boot, API & Service Design
Any CI/CD Tools : Gitlab Pipeline/Test Automation/GitHub Actions/ Jenkins /Circle CI
App Platform: Docker & Containers (Kubernetes)
Any Databases : SQL & NOSQL (Cassandra/Oracle/Snowflake/MongoDB)
Any Messaging: Kafka, Rabbit MQ
Any Observability/Monitoring: Splunk/ Grafana/ Open Telemetry /ELK Stack/ Datadog/ New Relic/ Prometheus)
Incident/Change/Problem Management

Nice To Have:

Define SLIs/SLOs

Engineer, Site Reliability T500-20503

2 days ago

Hyderabad, Telangana, India ANSR Full time ₹ 15,00,000 - ₹ 25,00,000 per year

ANSR is hiring for one of its clients.About T-Mobile:T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America's supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional...
Site Reliability Engineer

2 days ago

Hyderabad, Telangana, India Talent Worx Full time ₹ 15,00,000 - ₹ 25,00,000 per year

Site Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...
Engineer, Site Reliability T500-20169

2 weeks ago

Hyderabad, Telangana, India ANSR Full time

ANSR is hiring for one of its client: About T-Mobile: T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America's supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional...
Engineer, Site Reliability T500-20169

2 weeks ago

Hyderabad, Telangana, India ANSR Full time

ANSR is hiring for one of its client:About T-Mobile:T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America's supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional...
Site Reliability Engineer

2 days ago

Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per year

Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...
Site Reliability Engineer

3 weeks ago

Hyderabad, Telangana, India IntraEdge Full time

Site Reliability EngineerExperience: 7+ YearsLocation: HyderabadHybrid 4-day office and 1 Day remoteSkills for Principal:- Strong leadership and people management skills.- Exceptional technical proficiency in Pearson's technology stack.- Advanced project management capabilities.- Excellent communication and collaboration skills.- Adept at risk assessment and...
Engineer - Site Reliability - FPT T500-20211

2 weeks ago

Hyderabad, Telangana, India Talent500 Full time

Talent500 is hiring for one of its clients. About Talent500: Talent500 is the go-to premium destination for the best global job opportunities at Global Capability Centres or GCCs in India. We believe in opportunities favoring the bold and thus, we help the best tech and non-tech talent find their dream jobs at renowned companies that leads to a...
Site Reliability Engineer

3 weeks ago

Hyderabad, Telangana, India ServiceNow Full time

Site Reliability Engineer (SRE)Experience : 6+ YearsAbout the Role : We are seeking a seasoned SRE to ensure the reliability, availability, and performance of our critical services. You will combine software engineering with systems administration to create scalable and highly reliable software systems.Responsibilities : - Design, build, and maintain...
Site Reliability Engineer

3 weeks ago

Hyderabad, Telangana, India IntraEdge Full time

Site Reliability Engineer Experience: 7+ Years Location: Hyderabad Hybrid 4-day office and 1 Day remote Skills for Principal: Strong leadership and people management skills. Exceptional technical proficiency in Pearson's technology stack. Advanced project management capabilities. Excellent communication and collaboration skills. Adept at risk assessment...
Site Reliability Engineer

2 weeks ago

Hyderabad, Telangana, India IntraEdge Full time

Site Reliability EngineerExperience: 7+ YearsLocation: HyderabadHybrid 4-day office and 1 Day remoteSkills for Principal:Strong leadership and people management skills.Exceptional technical proficiency in Pearson's technology stack.Advanced project management capabilities.Excellent communication and collaboration skills.Adept at risk assessment and crisis...

Americas

Europe

Asia / Oceania

Africa

Engineer, Site Reliability T500-20504