Cloud Reliability Engineering Specialist
5 days ago
We are a strategic technology division for FedEx, focusing on developing innovative solutions to enhance productivity and minimize expenses globally. Our mission is to provide outstanding customer experiences.
A Cloud Reliability Engineer (CRE) combines software engineering and cloud capabilities to ensure the scalability, performance, and reliability of large-scale, cloud-based applications. As applications and infrastructure become complex and cloud-based, a more proactive and software-centric approach is needed to ensure reliability at scale.
By integrating software engineering and cloud principles, CREs bring a mindset of automation and reliability to operations. The preferred approach to tackle operations challenges with a software engineering perspective involves:
- Coding and automation
- Engineering principles to build resilient systems
The key responsibilities of a CRE include:
System Reliability and AvailabilityAn efficient system is the backbone of every secure organization. However, sometimes, their systems become unreliable, leading to unavailability. A CRE helps mitigate this issue by monitoring system issues, creating strategies to detect them, addressing those issues, designing systems to troubleshoot automatically, and writing and reviewing post-mortems.
- Monitor system health using alerts, tickets, logging mechanisms, and request times
- Automate system monitoring for large-scale data handling
A CRE identifies, assesses, and implements measures to eliminate potential risks that could impact system performance. This involves collaborating with development teams and other stakeholders to identify potential risks, analyzing and evaluating their impact and likelihood, implementing risk mitigation strategies, and continuously monitoring and reviewing their effectiveness.
- Collaborate with development teams to identify potential risks
- Analyze and evaluate potential impact and likelihood of occurrence
Monitoring means measuring system health. A CRE uses metrics like charts and graphs to study historical trends in terms of performance, tracing problems with system monitoring tools, and managing infrastructures at scale. This eliminates manual collection, storage, and visualization of data.
Minimizing Emergency ResponseA CRE minimizes emergency response time by resolving incidents quickly, reducing downtime, and improving the Mean Time to Respond (MTTR). This is achieved by maintaining internal tooling, such as communication platforms, bug tracking platforms, deployment strategies, monitoring solutions, error logging services, and documentation tools.
Qualifications and ExperienceTo be successful as a CRE, you should have a bachelor's degree in computer science or a related field and 3-5 years of experience as an SRE or DevOps engineer. Strong problem-solving skills, excellent communication, and collaboration abilities are essential for this role.
-
Cloud Reliability Engineering Specialist
4 weeks ago
Hyderabad, Telangana, India Talent500 Full timeAbout the RoleWe are seeking an experienced Cloud Reliability Engineering Specialist to join our team at FedEx ACC. As a Cloud Reliability Engineer, you will play a critical role in ensuring the scalability, performance, and reliability of our cloud-based applications.
-
Cloud Systems Reliability Specialist
1 month ago
Hyderabad, Telangana, India FedEx ACC Full timeAbout FedEx ACC">We are a leading company in the logistics industry, known for our reliability and efficiency.">Salary Range">$120,000 - $180,000 per year">Job Description">A Cloud Systems Reliability Specialist is responsible for ensuring the scalability, performance, and reliability of large-scale cloud-based applications. They combine software engineering...
-
Cloud Reliability Specialist
3 weeks ago
Hyderabad, Telangana, India WS Audiology Full timeWe are seeking an experienced Cloud Reliability Specialist to ensure the reliability, performance, and security of our operational backbone.The ideal candidate will have a strong background in at least one of the following fields: containers, public clouds, and cloud-native workloads.Key responsibilities include:Designing systems and applications with a...
-
Infrastructure Reliability Specialist
3 weeks ago
Hyderabad, Telangana, India IT Full timeJob DescriptionAs an Infrastructure Reliability Specialist at IT, you will play a critical role in ensuring the reliability, scalability, and performance of our systems. With 4-7 years of experience in Site Reliability Engineering (SRE), you will design, develop, and manage integration solutions using tools such as RabbitMQ, Postman, Apache, Kafka, or MS...
-
Reliability Engineering Specialist
1 month ago
Hyderabad, Telangana, India Oracle Full timeJob DescriptionWe are seeking an experienced Reliability Engineering Specialist to join our team at Oracle.About the RoleThis is a key position that will play a crucial role in defining and developing software for tasks associated with the development, design, and debugging of software applications or operating systems.You will be responsible for managing...
-
Cloud Data Engineer Specialist
4 weeks ago
Hyderabad, Telangana, India Tech Mahindra Full timeJob Title: Cloud Data Engineer SpecialistWe are seeking an experienced Cloud Data Engineer Specialist to join our team at Tech Mahindra. This role involves designing, building, and maintaining large-scale data processing systems on cloud platforms like Azure.About the Role:As a Cloud Data Engineer Specialist, you will be responsible for developing and...
-
Reliability Engineering Specialist
4 weeks ago
Hyderabad, Telangana, India FedEx ACC Full timeAbout FedEx ACC India:As a strategic technology division, we develop innovative solutions for customers and team members worldwide. Our goal is to enhance productivity, minimize expenses, and update our technology infrastructure to deliver exceptional customer experiences.A Site Reliability Engineer (SRE) combines software engineering and Cloud capabilities...
-
IT Reliability Engineering Specialist
5 days ago
Hyderabad, Telangana, India Talent500 Full timeAbout Talent500: Talent500 is a leading provider of innovative IT solutions, with a relentless drive for excellence and innovation. We have a unique opportunity for a skilled IT Reliability Engineer to join our team in Hyderabad, India.We are looking for a talented engineer who can design, develop, and implement large-scale solutions in production...
-
Data Engineering Specialist
4 weeks ago
Hyderabad, Telangana, India WaferWire Cloud Technologies Full timeJob Title: Data Engineering SpecialistAbout WCT:WaferWire Cloud Technologies (WCT) specializes in delivering comprehensive cloud-based solutions through Microsoft's technology stack. Our services include strategic consulting, data estate modernization, and cloud adoption strategy. We excel in solution design encompassing application, data, and AI...
-
Cloud Automation Specialist
4 days ago
Hyderabad, Telangana, India WaferWire Cloud Technologies Full timeJob Title: Cloud Automation Specialist - Data InnovationAbout WaferWire Cloud Technologies:WaferWire Cloud Technologies is a leading provider of cutting-edge cloud, data, and AI solutions. Our team excels in delivering comprehensive services, including strategic consulting, data estate modernization, and cloud adoption strategy. We specialize in solution...
-
Cloud Infrastructure Engineer
1 month ago
Hyderabad, Telangana, India CommScope Full timeOverviewAt CommScope, we believe that delivering connectivity is not just about technology, but about people.We are seeking a skilled Cloud Infrastructure Engineer to join our team in the role of Server Technology Specialist. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining our cloud-based...
-
Cloud-Native Reliability Engineer Leader
5 days ago
Hyderabad, Telangana, India Zyoin Full time**Job Overview**Zyoin is seeking a highly experienced Cloud-Native Reliability Engineer Leader to join our team in Hyderabad, India. The successful candidate will lead the development of scalable, secure, and efficient cloud-native applications.About ZyoinZyoin is a fast-growing technology company based in Denver, Colorado, founded in 2014. We specialize in...
-
Data Engineering Expert
4 weeks ago
Hyderabad, Telangana, India Value Creed Full timeWe are seeking a highly skilled Data Engineering Expert to join our team as a Cloud Solutions Specialist. The ideal candidate will have extensive experience in designing, implementing, and optimizing end-to-end data pipelines for cloud-based environments.As a Cloud Solutions Specialist, you will work closely with business stakeholders to understand their...
-
Cloud Engineering Test Automation Specialist
5 days ago
Hyderabad, Telangana, India WaferWire Cloud Technologies Full timeAbout WaferWire Cloud TechnologiesWaferWire Cloud Technologies (WCT) specializes in delivering comprehensive Cloud, Data and AI solutions through Microsoft's technology stack.We excel in Solution Design encompassing Application, Data, and AI Modernization, as well as Infrastructure Planning and Migrations.Our Operational Readiness services ensure seamless...
-
Cloud Data Engineering Specialist
3 weeks ago
Hyderabad, Telangana, India Vision Excel Career Solutions Full timeCloud Data Engineering SpecialistWe are seeking an experienced Cloud Data Engineer to join our team at Vision Excel Career Solutions in Hyderabad.About the RoleThe ideal candidate will possess a strong background in designing and developing efficient data pipelines using Python, ensuring reliable data ingestion, transformation, and storage with Airflow. They...
-
Cloud Reliability Expert
3 weeks ago
Hyderabad, Telangana, India WS Audiology Full timeAbout the RoleWe are seeking an experienced Site Reliability Engineer to join our team, with a focus on monitoring, alerting, and infrastructure stability. This role primarily involves maintaining the reliability and performance of our systems hosted in Azure Cloud.As a core member of our Site Reliability Engineering (SRE) team, you will use tools such as...
-
Cloud Software Engineering Specialist
4 weeks ago
Hyderabad, Telangana, India Pivotal Full timeWe are seeking a seasoned Cloud Software Engineering Specialist to join our esteemed team at Pivotal. As a key member of our VIPSCloud PMS team, you will play a pivotal role in shaping the definition, design, vision, roadmap, and product features from beginning to end.You will develop scalable, reliable, and extensible services using native cloud...
-
Cloud Architect
5 days ago
Hyderabad, Telangana, India Inspire Full timeAbout the RoleWe are seeking a highly experienced Cloud Architect to lead our Site Reliability Engineering (SRE) team at Inspire. The successful candidate will be responsible for providing technical leadership and guidance in designing and implementing cloud-native architectures, ensuring the reliability and scalability of our cloud platforms.This is an...
-
Digital Reliability Engineer
5 days ago
Hyderabad, Telangana, India Talent500 Full timeAbout Talent500: Talent500 is a leading provider of innovative technology solutions, with a relentless drive for excellence and innovation. Our team of experts shapes the future of travel by tackling complex challenges and pioneering cutting-edge technologies.As a Digital Reliability Engineer, you will play a vital role in shaping the future of travel by...
-
Cloud Infrastructure Specialist
4 weeks ago
Hyderabad, Telangana, India Tata Consultancy Services Full timeAbout the RoleWe are seeking an experienced Cloud Infrastructure Specialist to join our team at Tata Consultancy Services in Hyderabad. This is a fantastic opportunity to leverage your skills and expertise in GCP SRE Engineering to drive business growth and success.As a key member of our team, you will be responsible for designing, deploying, and managing...