Cloud Reliability Engineering Specialist

5 days ago


Hyderabad, Telangana, India FedEx ACC Full time
About FedEx ACC India

We are a strategic technology division for FedEx, focusing on developing innovative solutions to enhance productivity and minimize expenses globally. Our mission is to provide outstanding customer experiences.

A Cloud Reliability Engineer (CRE) combines software engineering and cloud capabilities to ensure the scalability, performance, and reliability of large-scale, cloud-based applications. As applications and infrastructure become complex and cloud-based, a more proactive and software-centric approach is needed to ensure reliability at scale.

By integrating software engineering and cloud principles, CREs bring a mindset of automation and reliability to operations. The preferred approach to tackle operations challenges with a software engineering perspective involves:

  • Coding and automation
  • Engineering principles to build resilient systems

The key responsibilities of a CRE include:

System Reliability and Availability

An efficient system is the backbone of every secure organization. However, sometimes, their systems become unreliable, leading to unavailability. A CRE helps mitigate this issue by monitoring system issues, creating strategies to detect them, addressing those issues, designing systems to troubleshoot automatically, and writing and reviewing post-mortems.

  • Monitor system health using alerts, tickets, logging mechanisms, and request times
  • Automate system monitoring for large-scale data handling
Mitigating Operational Risks

A CRE identifies, assesses, and implements measures to eliminate potential risks that could impact system performance. This involves collaborating with development teams and other stakeholders to identify potential risks, analyzing and evaluating their impact and likelihood, implementing risk mitigation strategies, and continuously monitoring and reviewing their effectiveness.

  • Collaborate with development teams to identify potential risks
  • Analyze and evaluate potential impact and likelihood of occurrence
Monitoring System Health

Monitoring means measuring system health. A CRE uses metrics like charts and graphs to study historical trends in terms of performance, tracing problems with system monitoring tools, and managing infrastructures at scale. This eliminates manual collection, storage, and visualization of data.

Minimizing Emergency Response

A CRE minimizes emergency response time by resolving incidents quickly, reducing downtime, and improving the Mean Time to Respond (MTTR). This is achieved by maintaining internal tooling, such as communication platforms, bug tracking platforms, deployment strategies, monitoring solutions, error logging services, and documentation tools.

Qualifications and Experience

To be successful as a CRE, you should have a bachelor's degree in computer science or a related field and 3-5 years of experience as an SRE or DevOps engineer. Strong problem-solving skills, excellent communication, and collaboration abilities are essential for this role.



  • Hyderabad, Telangana, India Talent500 Full time

    About the RoleWe are seeking an experienced Cloud Reliability Engineering Specialist to join our team at FedEx ACC. As a Cloud Reliability Engineer, you will play a critical role in ensuring the scalability, performance, and reliability of our cloud-based applications.


  • Hyderabad, Telangana, India FedEx ACC Full time

    About FedEx ACC">We are a leading company in the logistics industry, known for our reliability and efficiency.">Salary Range">$120,000 - $180,000 per year">Job Description">A Cloud Systems Reliability Specialist is responsible for ensuring the scalability, performance, and reliability of large-scale cloud-based applications. They combine software engineering...


  • Hyderabad, Telangana, India WS Audiology Full time

    We are seeking an experienced Cloud Reliability Specialist to ensure the reliability, performance, and security of our operational backbone.The ideal candidate will have a strong background in at least one of the following fields: containers, public clouds, and cloud-native workloads.Key responsibilities include:Designing systems and applications with a...


  • Hyderabad, Telangana, India IT Full time

    Job DescriptionAs an Infrastructure Reliability Specialist at IT, you will play a critical role in ensuring the reliability, scalability, and performance of our systems. With 4-7 years of experience in Site Reliability Engineering (SRE), you will design, develop, and manage integration solutions using tools such as RabbitMQ, Postman, Apache, Kafka, or MS...


  • Hyderabad, Telangana, India Oracle Full time

    Job DescriptionWe are seeking an experienced Reliability Engineering Specialist to join our team at Oracle.About the RoleThis is a key position that will play a crucial role in defining and developing software for tasks associated with the development, design, and debugging of software applications or operating systems.You will be responsible for managing...


  • Hyderabad, Telangana, India Tech Mahindra Full time

    Job Title: Cloud Data Engineer SpecialistWe are seeking an experienced Cloud Data Engineer Specialist to join our team at Tech Mahindra. This role involves designing, building, and maintaining large-scale data processing systems on cloud platforms like Azure.About the Role:As a Cloud Data Engineer Specialist, you will be responsible for developing and...


  • Hyderabad, Telangana, India FedEx ACC Full time

    About FedEx ACC India:As a strategic technology division, we develop innovative solutions for customers and team members worldwide. Our goal is to enhance productivity, minimize expenses, and update our technology infrastructure to deliver exceptional customer experiences.A Site Reliability Engineer (SRE) combines software engineering and Cloud capabilities...


  • Hyderabad, Telangana, India Talent500 Full time

    About Talent500: Talent500 is a leading provider of innovative IT solutions, with a relentless drive for excellence and innovation. We have a unique opportunity for a skilled IT Reliability Engineer to join our team in Hyderabad, India.We are looking for a talented engineer who can design, develop, and implement large-scale solutions in production...


  • Hyderabad, Telangana, India WaferWire Cloud Technologies Full time

    Job Title: Data Engineering SpecialistAbout WCT:WaferWire Cloud Technologies (WCT) specializes in delivering comprehensive cloud-based solutions through Microsoft's technology stack. Our services include strategic consulting, data estate modernization, and cloud adoption strategy. We excel in solution design encompassing application, data, and AI...


  • Hyderabad, Telangana, India WaferWire Cloud Technologies Full time

    Job Title: Cloud Automation Specialist - Data InnovationAbout WaferWire Cloud Technologies:WaferWire Cloud Technologies is a leading provider of cutting-edge cloud, data, and AI solutions. Our team excels in delivering comprehensive services, including strategic consulting, data estate modernization, and cloud adoption strategy. We specialize in solution...


  • Hyderabad, Telangana, India CommScope Full time

    OverviewAt CommScope, we believe that delivering connectivity is not just about technology, but about people.We are seeking a skilled Cloud Infrastructure Engineer to join our team in the role of Server Technology Specialist. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining our cloud-based...


  • Hyderabad, Telangana, India Zyoin Full time

    **Job Overview**Zyoin is seeking a highly experienced Cloud-Native Reliability Engineer Leader to join our team in Hyderabad, India. The successful candidate will lead the development of scalable, secure, and efficient cloud-native applications.About ZyoinZyoin is a fast-growing technology company based in Denver, Colorado, founded in 2014. We specialize in...


  • Hyderabad, Telangana, India Value Creed Full time

    We are seeking a highly skilled Data Engineering Expert to join our team as a Cloud Solutions Specialist. The ideal candidate will have extensive experience in designing, implementing, and optimizing end-to-end data pipelines for cloud-based environments.As a Cloud Solutions Specialist, you will work closely with business stakeholders to understand their...


  • Hyderabad, Telangana, India WaferWire Cloud Technologies Full time

    About WaferWire Cloud TechnologiesWaferWire Cloud Technologies (WCT) specializes in delivering comprehensive Cloud, Data and AI solutions through Microsoft's technology stack.We excel in Solution Design encompassing Application, Data, and AI Modernization, as well as Infrastructure Planning and Migrations.Our Operational Readiness services ensure seamless...


  • Hyderabad, Telangana, India Vision Excel Career Solutions Full time

    Cloud Data Engineering SpecialistWe are seeking an experienced Cloud Data Engineer to join our team at Vision Excel Career Solutions in Hyderabad.About the RoleThe ideal candidate will possess a strong background in designing and developing efficient data pipelines using Python, ensuring reliable data ingestion, transformation, and storage with Airflow. They...


  • Hyderabad, Telangana, India WS Audiology Full time

    About the RoleWe are seeking an experienced Site Reliability Engineer to join our team, with a focus on monitoring, alerting, and infrastructure stability. This role primarily involves maintaining the reliability and performance of our systems hosted in Azure Cloud.As a core member of our Site Reliability Engineering (SRE) team, you will use tools such as...


  • Hyderabad, Telangana, India Pivotal Full time

    We are seeking a seasoned Cloud Software Engineering Specialist to join our esteemed team at Pivotal. As a key member of our VIPSCloud PMS team, you will play a pivotal role in shaping the definition, design, vision, roadmap, and product features from beginning to end.You will develop scalable, reliable, and extensible services using native cloud...

  • Cloud Architect

    5 days ago


    Hyderabad, Telangana, India Inspire Full time

    About the RoleWe are seeking a highly experienced Cloud Architect to lead our Site Reliability Engineering (SRE) team at Inspire. The successful candidate will be responsible for providing technical leadership and guidance in designing and implementing cloud-native architectures, ensuring the reliability and scalability of our cloud platforms.This is an...


  • Hyderabad, Telangana, India Talent500 Full time

    About Talent500: Talent500 is a leading provider of innovative technology solutions, with a relentless drive for excellence and innovation. Our team of experts shapes the future of travel by tackling complex challenges and pioneering cutting-edge technologies.As a Digital Reliability Engineer, you will play a vital role in shaping the future of travel by...


  • Hyderabad, Telangana, India Tata Consultancy Services Full time

    About the RoleWe are seeking an experienced Cloud Infrastructure Specialist to join our team at Tata Consultancy Services in Hyderabad. This is a fantastic opportunity to leverage your skills and expertise in GCP SRE Engineering to drive business growth and success.As a key member of our team, you will be responsible for designing, deploying, and managing...