Current jobs related to Site Reliability Engineering Analyst - hyderabad - FedEx ACC


  • Hyderabad, Telangana, India SID Global Solutions Full time

    Site Reliability EngineerAt SID Global Solutions, we are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and performance of our cloud-based systems.Key Responsibilities:Design, implement, and maintain scalable and highly available cloud...


  • Hyderabad, Telangana, India Virtusa Full time

    Job Title: SRE Devops awsJob Summary: We are seeking a highly skilled Site Reliability Engineer to join our team at Virtusa. As a Site Reliability Engineer, you will be responsible for designing, building, and maintaining reliable and scalable infrastructure solutions to support our applications and services.Key Responsibilities:Design and implement robust...


  • Hyderabad, Telangana, India SINGLE POINT TECHNOLOGIES PRIVATE LIMITED Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a skilled Site Reliability Engineer to join our team at Single Point Technologies Private Limited. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and security of our cloud-based product suite.Key Responsibilities:* Design and implement...


  • Hyderabad, Telangana, India Crox Consulting Inc Full time

    Site Reliability EngineerJob Summary:Crox Consulting Inc is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based SaaS environment.Key Responsibilities:Design and implement automation and software solutions...


  • Hyderabad, Telangana, India Tata Consultancy Services Full time

    Job Title: Site Reliability EngineerTata Consultancy Services is a global leader in the technology arena, and we're looking for a skilled Site Reliability Engineer to join our team.Key Responsibilities:Design, develop, and test Java applications using standard frameworks and tools.Analyze and resolve application issues in collaboration with team...


  • Hyderabad, Telangana, India SID Global Solutions Full time

    Job Title: Site Reliability EngineerWe are seeking a highly skilled Site Reliability Engineer to join our team at SID Global Solutions.Key Responsibilities:Design and implement scalable and reliable cloud infrastructure using GCP, AWS/Azure, and Kubernetes.Develop and maintain CI/CD pipelines using Jenkins, GitLab CI, and Docker.Collaborate with...


  • Hyderabad, Telangana, India RealPage, Inc. Full time

    Job SummaryRealPage, Inc. is seeking a highly skilled Site Reliability Engineer to join our SRE & Systems team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our multiple open-source application environments.Key ResponsibilitiesProvision, de-provision, and support multiple open-source application...


  • Hyderabad, Telangana, India Quest Diagnostics Full time

    Job Title: Site Reliability Engineering ManagerWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Quest Diagnostics. As a Site Reliability Engineering Manager, you will be responsible for leading a team of Site Reliability Engineers in designing, implementing, and maintaining scalable and reliable systems.Key...


  • Hyderabad, India Conviction HR Full time

    Job Title : Site Reliability Engineer (SRE) - Conviction HRType : Contract-to-Hire (C2H)Job Description :ConvictionHR is seeking a talented Site Reliability Engineer (SRE) to join our team. This Contract-to-Hire position is perfect for an individual who is passionate about improving system reliability and performance while collaborating closely with both...


  • Hyderabad, Telangana, India Experian Full time

    Job Title: Site Reliability EngineerJob Summary:Experian is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and scalability of our AWS platform.Key Responsibilities:Optimize microservice and serverless processes on robust distributed...


  • Hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-On People Leader Responsibility Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, Telangana, India Quest Diagnostics Full time

    Job SummaryWe are seeking a highly skilled Site Reliability Engineering Manager to join our team at Quest Diagnostics. As a Site Reliability Engineering Manager, you will be responsible for leading a team of Site Reliability Engineers in designing, implementing, and maintaining reliable and scalable systems.Key ResponsibilitiesLead and manage a team of Site...


  • Hyderabad, Telangana, India Zelis Full time

    Job Title: Site Reliability EngineerZelis is seeking a highly skilled Site Reliability Engineer to join our Engineering team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Gather and analyze metrics from operating systems and...


  • Hyderabad, India Conviction HR Full time

    Job Title : Site Reliability EngineerType : Contract-to-Hire (C2H)Job Description :ConvictionHR is seeking a skilled Site Reliability Engineer to enhance system reliability and performance. This Contract-to-Hire position is ideal for an individual passionate about collaborating closely with development and operations teams to improve infrastructure and...


  • hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-OnPeople Leader ResponsibilityPosition will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-On People Leader Responsibility Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, Telangana, India Live Connections Full time

    We are looking for Manager Site Reliability Engineer in Hyderabad locationRoles and Responsibilities :Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-On People Leader Responsibility Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, Dev Sec Ops Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, India Live Connections Full time

    We are looking for Manager Site Reliability Engineer in Hyderabad locationRoles and Responsibilities :Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, Telangana, India Quest Diagnostics Full time

    Job Title: Site Reliability Engineering ManagerQuest Diagnostics is seeking a highly skilled Site Reliability Engineering Manager to lead our team of engineers in delivering high-quality, reliable, and scalable systems.Key Responsibilities:Lead and manage a team of Site Reliability Engineers, providing mentorship, guidance, and support to ensure the team's...

Site Reliability Engineering Analyst

2 months ago


hyderabad, India FedEx ACC Full time
A Site Reliability Engineer (SRE) is an advanced DevOps role that combines software engineering and Cloud capabilities to ensure the scalability, performance, and reliability of large-scale, cloud-based applications.
As applications and infrastructure became complex and cloud-based—a more proactive and software-centric approach is needed to ensure reliability at scale.
By combining software engineering and cloud principles, SREs bring a mindset of automation, reliability to operations. The preferred approach to tackle operations challenges with a software engineering perspective, leveraging:
Coding
Automation
Engineering principles
By doing so, build resilient, self-healing systems that could scale seamlessly.
So how do we do this? Here’s what we expect SRE to help IT and Engineering team to mature:
Detect issues.
Automatically handle failures.
Prepare disaster recovery plans.
Keep the system up and reliable.
Mitigate broken systems and prevent them from causing future disruptions.
Responsibilities :
An SRE bridges the gap between traditional software engineering and operations to create highly scalable and fault-tolerant systems. As a result, ensure the reliable and efficient operation of an organization's systems and services.
Here’s an in-depth look into the core responsibilities of site reliability engineers:
Ensure system reliability and availability:
Efficient systems are the backbone of every secure and breach-free organization. Organizations continuously update their application to provide advanced features to users.
But sometimes, their systems become unreliable, which results in unavailability. This is where site reliability engineers help.
Here's how SRE ensure systems are reliable:
Monitor system issues.
Create strategies to detect issues.
Address those issues.
Design systems to troubleshoot automatically.
Write and review post-mortems.
Mitigate operational risks:
SREs identify, assess, and implement measures to eliminate potential risks that could impact the performance of systems and services.
Here is how SRE do it:
Collaborate with development teams and other stakeholders to identify potential risks.
Once risks are identified, analyze and evaluate potential impact and likelihood of occurrence.
Based on the risk assessment, implement various risk mitigation strategies to mitigate operational risks.
Once done, continuously monitor and review the effectiveness of their risk strategies.
By doing so, SREs maintain system reliability and ensure a positive user experience.
Monitor system health:
Monitoring means measuring system’s health. An SRE uses alerts, tickets, logging mechanisms, and request times to monitor a system’s health. This ensures the system is stable and minimizes user disruption. In case a bug occurs, respond immediately to resolve it.
However, doing all of this manually is expensive and time-consuming. So, SREs automate this process for systems that handle large amounts of data. Here is how they do it:
Study historical trends in terms of performance by using metrics like charts and graphs.
Next, they trace the problems with system monitoring tools.
Monitor the log files to manage infrastructures at scale.
Doing so eliminates manual collection, storage, and visualization of the data.
Minimize emergency response:
Emergency response is the time site reliability engineers take to respond to problems. This period is known as the Mean Time to Respond (MTTR). It measures the time an SRE takes to fix the incident after it happens.
Minimizing the MTTR for reliable systems is necessary to reduce downtime. As an SRE, you can improve this metric by resolving the incidents quickly.
Maintain internal tooling:
Site reliability engineers maintain internal tools to run complex operations smoothly. These tools help them track severe bugs, maintain CI/CD pipelines, and communicate with other teams.
Some of the most widely used internal tools are:
Communication platforms like MS teams, ServiceNow – ePDSM.
Bug tracking platforms such as JIRA, Digital Agility or HP ALM.
Deployment strategies such as GitHub Actions
Monitoring solutions like Splunk, Grafana.
Error logging services such as Kibana, ELK Stack.
Documentation tools such as MS SharePoint.
Continuous Improvement.
Site reliability engineers aim to make systems better every day. For this purpose, collaborate with teams like QA, software engineers, and security engineers to ensure all teams are on the same page.
Qualifications:
Bachelor’s degree in computer science, Engineering, or related field.
3 to 5 years of experience as an SRE or DevOps engineer or Ops Engineer.