Staff Reliability Engineer

2 weeks ago


Hyderabad, Telangana, India Capgemini Full time ₹ 8,00,000 - ₹ 12,00,000 per year

We are seeking an experienced and highly motivated StaffReliability Engineer.

The Staff Reliability Engineer will have end-to-end accountability for the reliability of IT services within a defined application portfolio. A prerequisite to the role will be a build-to-manage", problem-solving and innovative mindset applied to the design, build, test, deploy, change and maintenance of services drawing from deep engineering expertise.

Key measures of success will include service stability, effective delivery and environment instrumentation, deployment quality, technical debt reduction, asset resiliency, risk/security compliance, cost efficiency, proactive and preventative maintenance mechanisms, top quartile operating norms.

The StaffReliability Engineer will actively contribute to sustained advancement of the RE practice within and beyond a given area of responsibility.

Key Responsibilities

  • Guide the use of best-in-class software engineering standards and design practices for instrumenting code/application technology stack to enable the generation of relevant metrics on overall technology health - availability, performance, quality, currency and resiliency.
  • Serve as key liaison between the architecture and software engineering teams to influence the technical strategy for the organization, keeping in mind its cross-functional impacts, integration across the organization, and architecture rationalization.
  • Function as the go-to technical leader for the applications supported, requiring depth and breadth of knowledge in technologies, applications, integration, interfaces and business domain.
  • DevSecOps Solution Responsibilities:
  • Design, build, and maintain scalable and reliable systems for production environments.
  • Automate infrastructure provisioning, CI/CD pipelines, and incident response process.
  • Identify and mitigate risks to system reliability, security, and performance.
  • Develop effective tooling, alerts, and response mechanisms to identify and address reliability risks leveraging automation to support problem prevention, detection, mitigation, and resolution.
  • Enhance the delivery flow by engineering the appropriate solutions to increase delivery speed while adhering to technology standards for sustained reliability.
  • Progressively implement preventative controls and drive increased automation and self-healing capabilities. Continue to improve cost efficiency baselines
  • Promote and implement innovative solutions.
  • IT Ops Responsibilities:
  • Ensure operational excellence. Independently drive the triaging and service restoration of all high impact incidents in order to minimize the mean time to service restoration and impact to the business. Demonstrate end-to-end ownership.
  • Partner with infrastructure teams to design and implement intelligent incident routing, enhanced monitoring/alerting capabilities and automated service restoration processes. Take proactive measures to prevent high impactful incidents.
  • Achieve and maintain the continuity of Hartford and third-party assets that support a business function. Accountable for keeping the IT application and infrastructure metadata repositories current.

Required Skills & Experience– Expert for All

  • System Thinking end-to-end - Broad understanding of enterprise architectures and complex (backend) systems (understand more than the component itself)
  • Highly collaborative, partners with peers, stakeholders with a passion about delighting customers.
  • Expert experience with Performance and Observability tools such as DynaTrace, Splunk, TrueSight, CloudWatch, CloudTrail, and related tools.
  • Strong solution architecture orientation to enable expedient troubleshooting, issue-resolution and root-cause removal in a hybrid cloud environment.
  • Experience with continuous integration and DevOps methodologies, preferred tools such as GitHub, Jenkins, Nexus, Rally, SonarQube etc..
  • Experience with cloud platforms (AW, GCP, or Azure)
  • Deep understanding of Linux systems, containers (Docker), and orchestration tools (Kubernetes)
  • Expertise with Infrastructure as Code (Terraform, CloudFormation).
  • Knowledge of complex traditional and modern enterprise architectures and systems (understand more than the component itself).
  • Strong hybrid cloud experience (private and public) across various service delivery models – IaaS, PaaS, SaaS.
  • Strong communication (verbally and written) / collaboration / negotiation skill, working in a diverse team cross business units

Preferred Qualifications

  • Understanding FinOps or cost-optimization practices in the cloud.
  • Experience with API gateways, and network-level observability.
  • Experience in regulated environments (Insurance)
  • AWS Solutions Architect certification
  • Keeps abreast with new market technologies and adept at learning and adopting new models. Promotes and applies continuous learning.


  • Hyderabad, Telangana, India Cyient Full time ₹ 6,00,000 - ₹ 12,00,000 per year

    Field data collection and reliability analysis. Knowledge on PBI, python, SQL. Expertise in Aero Engines. Hands on experience in customer support. Knowledge on Power Plant Engineering is advantage.


  • Hyderabad, Telangana, India InvoiceCloud, Inc. Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    About InvoiceCloudInvoiceCloud is a fast-growing fintech company with an award-winning culture and a leading disruptor in the electronic bill presentment and payment (EBPP) space. Serving more than 3,200 customers across the utility, government, and insurance industries, InvoiceCloud's secure and innovative SaaS platform enhances the customer experience,...


  • Hyderabad, Telangana, India Warner Bros. Discovery Full time ₹ 10,00,000 - ₹ 25,00,000 per year

    Welcome to Warner Bros. Discovery… the stuff dreams are made of.Who We Are…When we say, "the stuff dreams are made of," we're not just referring to the world of wizards, dragons and superheroes, or even to the wonders of Planet Earth. Behind WBD's vast portfolio of iconic content and beloved brands, are thestorytellersbringing our characters to life,...


  • Hyderabad, Telangana, India Medtronic Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    At Medtronic you can begin a life-long career of exploration and innovation, while helping champion healthcare access and equity for all. You'll lead with purpose, breaking down barriers to innovation in a more connected, compassionate world.A Day in the LifeExperienced individual contributor in Reliability Engineering, working on complex projects....


  • Hyderabad, Telangana, India Fanatics Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    Job DescriptionAbout the Role:We are seeking a Staff Quality Engineer to lead our quality engineering efforts across systems that support inventory lifecycle management, product planning, and performance analytics. This role sits at the intersection of quality engineering, test automation, and data integrity, playing a key role in ensuring our applications...


  • Hyderabad, Telangana, India Synchrony Full time ₹ 8,00,000 - ₹ 12,00,000 per year

    AVP, Reliability Engineer, EIS(L10) Job Description: Role Title: AVP, Reliability Engineer, EIS(L10) COMPANY OVERVIEW: Synchrony (NYSE: SYF) is a premier consumer financial services company delivering one of the industry's most complete digitally enabled product suites. Our experience, expertise and scale encompass a broad spectrum of industries...


  • Hyderabad, Telangana, India Apple Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    Imagine what you could do here. Apple is a place where extraordinary people gather to do their best work. Together we craft products and experiences people once couldn't have imagined — and now can't imagine living without. If you're motivated by the idea of making a real impact, and joining a team where we pride ourselves in being one of the most diverse...


  • Hyderabad, Telangana, India Synectics APAC Full time ₹ 20,00,000 - ₹ 25,00,000 per year

    Our Site Reliability Engineers (SREs) play a crucial role in ensuring our systems are reliable, scalable, and efficient. We are looking for an experienced SRE to join our team and help us maintain and improve our infrastructure.ResponsibilitiesMonitor and Maintain Systems: Ensure the availability, performance, and reliability of our production environment by...


  • Hyderabad, Telangana, India Apple Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Are you meticulously organized and highly observant? Join our Information Systems and Technology group and play a vital function on one of two Apple teams: Software and Services and Corporate Functions. From Apple ID to the Apple website to our data centers around the globe, our diverse collection of engineers, designers and creators manage the massive...


  • Hyderabad, Telangana, India TurboHire Full time ₹ 15,00,000 - ₹ 28,00,000 per year

    Site Reliability Engineer (SRE)Location: Hyderabad (Hybrid)Experience: 3–5 yearsAbout the RoleWe are looking for an SRE Engineer to own reliability, deployment, and monitoringof TurboHire's cloud infrastructure. You will ensure our platform is scalable, secure,and highly available. The role balances hands-on coding, automation, and infraoperations, freeing...