Principal Engineer, Site Reliability

3 weeks ago


Lal Bahadur Nagar, India ANSR Full time

ANSR is hiring for one of its clients. About T-Mobile: T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional service experience. About TMUS Global Solutions: TMUS Global Solutions is a world-class technology powerhouse accelerating the company’s global digital transformation. With a culture built on growth, inclusivity, and global collaboration, the teams here drive innovation at scale, powered by bold thinking. TMUS India Private Limited operates as TMUS Global Solutions. About the Role: The Principal Engineer, Site Reliability (SRE) will play a critical role in ensuring the stability, scalability, and operational excellence of Accounting and Finance platforms. This role is focused on leading the operational health of these platforms, ensuring the delivery of highly reliable financial applications and data services that meet the demanding requirements of accuracy, compliance, and availability to support business operations. As a Principal SRE, you will build automation, implement monitoring, improve incident response, and champion DevOps practices that enable Finance and Accounting systems to operate with consistency and trustworthiness, while also coaching and mentoring junior SREs to ensure overall operational excellence. What You’ll Do: - Operational Oversight: Own day-to-day operations for Accounting and Finance applications and data platforms, ensuring they run smoothly and meet business expectations. - Reliability & Availability: Ensure Accounting and Finance platforms meet defined SLAs, SLOs, and SLIs for performance, reliability, and uptime. - Automation & Efficiency: Build automation for deployments, monitoring, scaling, and self-healing capabilities to reduce manual effort and operational risk. - Observability & Monitoring: Implement and maintain comprehensive monitoring, alerting, and logging for accounting applications and data pipelines (e.G., Snowflake, dbt workflows, ERP integrations). - Incident Response: Lead and participate in on-call rotations, perform root cause analysis, and drive improvements to prevent recurrence of production issues. - Operational Excellence: Establish and enforce best practices for capacity planning, performance tuning, disaster recovery, and compliance controls in financial systems. - Collaboration with Engineering & Finance: Partner with software engineers, data engineers, and Finance/Accounting teams to ensure operational needs are met from development through production. - Team Coordination: Manage workload, priorities, and escalations for operations staff and partner teams, ensuring alignment with SLAs and compliance requirements. - Security & Compliance: Ensure financial applications and data pipelines meet audit, compliance, and security requirements. - Continuous Improvement: Drive post-incident reviews, implement lessons learned, and proactively identify opportunities to improve system resilience. - Audit & Compliance Support: Ensure operational practices meet internal controls, audit requirements, and financial compliance standards. What You’ll Bring: - Bachelor’s in computer science, Engineering, Information Technology, or related field (or equivalent experience). - 12-15 years of experience in Site Reliability Engineering, DevOps, or Production Engineering, ideally supporting financial or mission-critical applications. - Strong experience with monitoring/observability tools (Datadog, Prometheus, Grafana, Splunk, or equivalent). - Hands-on expertise with CI/CD pipelines, automation frameworks, and IaC tools (Terraform, Ansible, GitHub Actions, Azure DevOps, etc.). - Familiarity with Snowflake, dbt, and financial system integrations from an operational support perspective. - Strong scripting/programming experience (Python, Bash, Go, or similar) for automation and tooling. - Proven ability to manage incident response and conduct blameless postmortems. - Experience ensuring compliance, security, and audit-readiness in enterprise applications. Must Have Skills: - SRE - SQL - Snowflake OR Databricks - DevOps OR CICD OR GitHub Actions - monitoring/observability tools (Datadog, Prometheus, Grafana, Splunk, or equivalent) - Automation Nice To Have: - Experience supporting financial applications (ERP, revenue recognition systems, accounting platforms). - Exposure to FinOps practices for optimizing cloud spend in finance-related platforms. - Familiarity with containers and orchestration (Docker, Kubernetes). - Experience building resilience into data pipelines and ensuring auditability for accounting data. - Strong communication skills to articulate operational issues and risks to both technical and non-technical stakeholders.



  • Lal Bahadur Nagar, India SID Global Solutions Full time

    Job Role: Site Reliability Engineer (SRE) – GCP Experience: 3+ years Location: Hyderabad About SIDGS: SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...


  • Lal Bahadur Nagar, India Atyeti Inc Full time

    Job Description : - We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our growing team. - Bachelor’s degree in computer science, Engineering, or equivalent practical experience. - 7+ years’ experience in Site Reliability deploying and managing large-scale distributed systems successfully. - Understanding of SRE...


  • Lal Bahadur Nagar, India GSPANN Technologies, Inc Full time

    About Company : GSPANN is a global IT services and consultancy provider headquartered in Milpitas, California (U.S.A.). With five global delivery centers across the globe, GSPANN provides digital solutions that support the customer buying journeys of B2B and B2C brands worldwide. With a strong focus on innovation and client satisfaction, GSPANN delivers...


  • Lal Bahadur Nagar, India ANSR Full time

    ANSR is hiring for one of its clients. About T-Mobile: T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and...


  • Lal Bahadur Nagar, India ValueMomentum Full time

    About the Role We are seeking an experienced Site Reliability / Azure DevOps Engineer with Dynatrace Experience to join our engineering team and contribute to scalable CI/CD practices, infrastructure automation, and cloud operations. The ideal candidate will have deep expertise in Azure DevOps, Infrastructure as Code (IaC), Azure services, and modern DevOps...


  • Lal Bahadur Nagar, India ANSR Full time

    ANSR is hiring for one of its clients. About T-Mobile: T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and...


  • Lal Bahadur Nagar, India ANSR Full time

    ANSR is hiring for one of its clients. About T-Mobile: T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and...


  • Lal Bahadur Nagar, India Mulya Technologies Full time

    About the job Principal / Staff /Senior Analog/Mixed-Signal IC Design Engineer www.Omnidesigntech.Com Bangalore / Hyderabad About Omni Design Technologies Omni Design Technologies is a leading provider of high-performance, ultra-low power IP cores, from 28nm down through advanced FinFET nodes, which enable differentiated system-on-chip (SoC), in applications...


  • Lal Bahadur Nagar, India Mulya Technologies Full time

    Principal IP/RTL Design Engineer for TPU / GPU Hyderabad / Bangalore Founded by highly respected Silicon Valley veterans - with its design centers established in Santa Clara, California. / Hyderabad/ Bangalore Our pay comprehensively beats "ALL" Semiconductor product players in the Indian market. Position Overview Seeking an IP/RTL Design Engineer with 5+...


  • Lal Bahadur Nagar, India Mancer Consulting Services Full time

    About the Position: Key Responsibilities: - Shared Responsibility Models: Define and implement clear shared responsibility models, ensuring accountability across teams for infrastructure, platforms and application security and reliability. - Compliance and Policy as Code: Create roadmaps to embed our SDLC into code together with the platform teams, Maintain...