Principal Engineer, Site Reliability T500-20232

15 hours ago


Bengaluru, Karnataka, India ANSR Full time

ANSR is hiring for one of its client:

About T-Mobile:

T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America's supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional service experience.

About TMUS Global Solutions:

TMUS Global Solutions is a world-class technology powerhouse accelerating the company's global digital transformation. With a culture built on growth, inclusivity, and global collaboration, the teams here drive innovation at scale, powered by bold thinking.

About the Role:

The Principal Engineer, Site Reliability (SRE) will play a critical role in ensuring the stability, scalability, and operational excellence of Accounting and Finance platforms. This role is focused on leading the operational health of these platforms, ensuring the delivery of highly reliable financial applications and data services that meet the demanding requirements of accuracy, compliance, and availability to support business operations.

As a Principal SRE, you will build automation, implement monitoring, improve incident response, and champion DevOps practices that enable Finance and Accounting systems to operate with consistency and trustworthiness, while also coaching and mentoring junior SREs to ensure overall operational excellence.

What You'll Do

  • Operational Oversight: Own day-to-day operations for Accounting and Finance applications and data platforms, ensuring they run smoothly and meet business expectations.
  • Reliability & Availability: Ensure Accounting and Finance platforms meet defined SLAs, SLOs, and SLIs for performance, reliability, and uptime.
  • Automation & Efficiency: Build automation for deployments, monitoring, scaling, and self-healing capabilities to reduce manual effort and operational risk.
  • Observability & Monitoring: Implement and maintain comprehensive monitoring, alerting, and logging for accounting applications and data pipelines (e.g., Snowflake, dbt workflows, ERP integrations).
  • Incident Response: Lead and participate in on-call rotations, perform root cause analysis, and drive improvements to prevent recurrence of production issues.
  • Operational Excellence: Establish and enforce best practices for capacity planning, performance tuning, disaster recovery, and compliance controls in financial systems.
  • Collaboration with Engineering & Finance: Partner with software engineers, data engineers, and Finance/Accounting teams to ensure operational needs are met from development through production.
  • Team Coordination: Manage workload, priorities, and escalations for operations staff and partner teams, ensuring alignment with SLAs and compliance requirements.
  • Security & Compliance: Ensure financial applications and data pipelines meet audit, compliance, and security requirements.
  • Continuous Improvement: Drive post-incident reviews, implement lessons learned, and proactively identify opportunities to improve system resilience.
  • Audit & Compliance Support: Ensure operational practices meet internal controls, audit requirements, and financial compliance standards.

What You'll Bring

  • Bachelor's in computer science, Engineering, Information Technology, or related field (or equivalent experience).
  • 12-15 years of experience in Site Reliability Engineering, DevOps, or Production Engineering, ideally supporting financial or mission-critical applications.
  • Strong experience with monitoring/observability tools (Datadog, Prometheus, Grafana, Splunk, or equivalent).
  • Hands-on expertise with CI/CD pipelines, automation frameworks, and IaC tools (Terraform, Ansible, GitHub Actions, Azure DevOps, etc.).
  • Familiarity with Snowflake, dbt, and financial system integrations from an operational support perspective.
  • Strong scripting/programming experience (Python, Bash, Go, or similar) for automation and tooling.
  • Proven ability to manage incident response and conduct blameless postmortems.
  • Experience ensuring compliance, security, and audit-readiness in enterprise applications.

Must Have Skills:

  • SRE
  • SQL
  • Snowflake OR Databricks
  • DevOps OR CICD OR GitHub Actions
  • monitoring/observability tools (Datadog, Prometheus, Grafana, Splunk, or equivalent)
  • Automation

Nice To Have:

  • Experience supporting financial applications (ERP, revenue recognition systems, accounting platforms).
  • Exposure to FinOps practices for optimizing cloud spend in finance-related platforms.
  • Familiarity with containers and orchestration (Docker, Kubernetes).
  • Experience building resilience into data pipelines and ensuring auditability for accounting data.
  • Strong communication skills to articulate operational issues and risks to both technical and non-technical stakeholders.


  • Bengaluru, Karnataka, India Akamai Full time

    Job Category Site Reliability Would you like to lead modernization initiatives while building a public cloud platform from scratch Would you like to own critical services in a new public cloud platform Join our IaaS Site Reliability Engineering SRE team We design develop and operate infrastructure and services that power the backbone of our...


  • Bengaluru, Karnataka, India Collabera Full time

    Job Description As a Principal/Chief Site Reliability Engineer , you will play a critical role in designing, developing, and maintaining scalable and highly reliable systems. You'll work closely with development teams to improve system reliability, monitor critical applications, and design fail-proof infrastructure. Responsibilities Design and implement...


  • Bengaluru, Karnataka, India Sandisk Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    In this role you will have the opportunity to Work as Principal Engineer, Quality Assurance accountable for overall product quality of the NPI releases. The Principal Engineer provides independent oversight of the design input process, design V&V activities, design transfer and product realization, and performance in the field to ensure that all design...


  • Bengaluru, Karnataka, India NIKE Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    PRINCIPAL SITE RELIABILITY ENGINEERIndia Technology CenterWHO YOU WILL WORK WITHThe Principal Site Reliability Engineer will work alongside a talented team of Site Reliability Engineers focused on delivering reliabile and observable software used by millions of athletes* around the world. You will be a part of the Resilience Engineering organization which...


  • Bengaluru, Karnataka, India ANSR Full time

    ANSR is hiring for one of its client:About T-Mobile:T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America's supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional...


  • Bengaluru, Karnataka, India Programming Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Role - Site Reliability Engineering.Location - BengaluruYears of Expereince - 4+ YearsProfessional & Technical Skills:Must To Have Skills: Proficiency in Site Reliability Engineering.Good To Have Skills: Experience with cloud service providers such as AWS, Azure, or Google Cloud.Strong understanding of CI/CD tools and practices.Experience with container...


  • Bengaluru, Karnataka, India Enterprise Minds, Inc Full time

    We're Hiring | Site Reliability Engineer | 8-10 years


  • Bengaluru, Karnataka, India FOSS United Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    All JobsSite Reliability Engineer at ZEISS IndiaSite Reliability EngineerApplyPosted on September 11, 2025ZEISS IndiaKadubeesanahalli, BengaluruFull TImeJob DescriptionZEISS in IndiaZEISS in India is headquartered in Bengaluru and present in the fields of Industrial Quality Solutions, Research Microscopy Solutions, Medical Technology, Vision Care and Sports...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...


  • Bengaluru, Karnataka, India Randstad Full time

    Role: Site Reliability Engineer SummaryThe Network Engineer 2 provides technical design, planning, operation, maintenance, and advanced troubleshooting of the Bread Financials' network infrastructure. This position ensures continuity and alignment of the network administration/engineering direction. This position supports Bread Financials' strategies and...