Staff Cloud Reliability Engineer
6 days ago
About the Role
We’re seeking an experienced Cloud Site Reliability Engineer to join our team at ModMed. As a Staff Cloud Reliability Engineer, you will play a crucial role in ensuring the reliability, performance, and scalability of our cloud infrastructure and services.
Your Responsibilities
- Design and manage secure, scalable cloud infrastructure and services, focusing on automation, reliability, and proactive cost management.
- Implement and refine observability and monitoring solutions using DataDog, ensuring proactive issue identification and efficient resource utilization.
- Lead CI/CD pipeline development, maintenance, and optimization with Jenkins, integrating AWS services to enhance development workflows and infrastructure automation.
- Drive the containerization and orchestration of applications using Kubernetes, enhancing scalability, deployment efficiency, and cost-effectiveness.
- Monitor application and infrastructure performance in AWS, applying tuning and optimizations to ensure optimal resource utilization and user experience while managing costs.
- Design and manage disaster recovery and backup strategies on AWS, prioritizing data integrity, system availability, and cost efficiency.
- Provide expert troubleshooting and problem-solving across various platforms and applications within AWS, aiming for minimal disruption and quick resolution.
- Ensure strict adherence to AWS security standards and compliance with data protection regulations, with a keen eye on cost implications.
- Keep abreast of new cloud technologies and trends, recommending and implementing improvements for competitive advantage and cost savings.
- Mentor and support junior team members, fostering a culture of learning, collaboration, and cost-consciousness.
- Work closely with cross-functional teams to understand requirements and deliver AWS-based solutions that meet business objectives efficiently and cost-effectively.
About You
- Bachelor's degree in Computer Science, Information Technology, or related field, or equivalent experience.
- A minimum of 8-10 years of experience in Site Reliability Engineering, Cloud Engineering, or a similar role, with a demonstrated track record of problem-solving in complex, cloud-based environments.
- Strong expertise in managing cloud environments (preferably in AWS), with hands-on experience in observability platforms such as DataDog.
- Proficiency in automation and scripting languages (e.g., Python, Bash) and infrastructure as code (IaC) tools (e.g., Terraform, Ansible).
- Extensive experience with CI/CD tools, notably Jenkins, and familiarity with containerization and orchestration technologies like Kubernetes.
- Solid understanding of networking, cloud security best practices, performance optimization, and cost management strategies.
- Demonstrated commitment to implementing industry-standard site reliability principles and a proactive approach to cost management in daily operations.
- Proven leadership skills and the ability to mentor junior team members, guide teams through complex operational challenges, and foster a culture of continuous improvement.
- Excellent verbal and written communication skills, with the ability to work effectively in a team environment and communicate complex technical concepts to a non-technical audience.
What We Offer
At ModMed, we believe it’s essential to provide a competitive benefits package that meets the diverse needs of our growing workforce. Eligible Modernizers can enroll in a wide range of benefits, including:
- Comprehensive medical, dental, and vision benefits, including a company Health Savings Account contribution.
- 401(k): ModMed provides a matching contribution each payday of 50% of your contribution deferred on up to 6% of your compensation. After one year of employment with ModMed, 100% of any matching contribution you receive is yours to keep.
- Generous Paid Time Off and Paid Parental Leave programs.
- Company paid Life and Disability benefits, Flexible Spending Account, and Employee Assistance Programs.
- Company-sponsored Business Resource & Special Interest Groups that provide engaged and supportive communities within ModMed.
- Professional development opportunities, including tuition reimbursement programs and unlimited access to LinkedIn Learning.
- Global presence and in-person collaboration opportunities; dog-friendly HQ (US), Hybrid office-based roles and remote availability.
- Weekly catered breakfast and lunch, treadmill workstations, Zen, and wellness rooms within our BRIC headquarters.
About ModMed
ModMed is a growing cloud-based healthcare technology company dedicated to modernizing medical practices and improving patient outcomes. We're passionate about building innovative solutions that empower healthcare professionals and patients alike. Join our team and be part of our mission to revolutionize the healthcare industry.
-
Senior Cloud Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India WaferWire Cloud Technologies Full timeJob Title: Senior Cloud Reliability Engineer (Azure Service Fabric)Job Location: RemoteWorksite: Onsite [100%]About WCT:WaferWire Cloud Technologies specializes in delivering comprehensive Cloud, Data and AI solutions through Microsoft's technology stack. Our services include Strategic Consulting, Data/AI Estate Modernization, and Cloud Adoption Strategy. We...
-
Cloud Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India UnitedHealth Group Full timeCloud Reliability EngineerAt UnitedHealth Group, we're committed to delivering high-quality healthcare services to millions of people worldwide. As a Cloud Reliability Engineer, you'll play a crucial role in ensuring the performance, security, and reliability of our cloud infrastructure.Collaborate with cross-functional teams to design, implement, and...
-
Cloud Reliability Engineer
4 weeks ago
Hyderabad, Telangana, India ModMed Full timeAbout ModMedModMed is a leading healthcare technology company that is united in its mission to make a positive impact on healthcare. With a strong presence in South Florida and a global workforce, we are committed to delivering innovative solutions that improve patient outcomes and medical practice success.Job SummaryWe are seeking a highly skilled Staff...
-
Cloud Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India Microsoft Full timeAbout the RoleWe are looking for a skilled Cloud Reliability Engineer to join our team at Microsoft Azure. As a Cloud Reliability Engineer, you will be responsible for designing, developing, and operating reliable services on the Azure platform.ResponsibilitiesDevelop and deploy reliable services on the Azure platform.Collaborate with cross-functional teams...
-
Cloud Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India Microsoft Full timeAbout the RoleWe are seeking a skilled Cloud Reliability Engineer to join our Azure Customer Experience team. As a key member of our team, you will be responsible for improving customer experience on Azure, diagnosing and troubleshooting mission-critical customer applications built on the Microsoft Azure platform.Key ResponsibilitiesImprove customer...
-
Senior Cloud Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India UnitedHealth Group Full timeAt UnitedHealth Group, we are committed to helping people live healthier lives and making the health system work better for everyone. As a Senior Cloud Reliability Engineer, you will play a critical role in ensuring the performance and reliability of our cloud-based systems. This is an exciting opportunity to join our team and contribute to the development...
-
Cloud Architect
4 weeks ago
Hyderabad, Telangana, India HuntingCube Recruitment Solutions Full timeSenior Staff EngineerJob Summary:HuntingCube Recruitment Solutions is seeking a highly skilled Senior Staff Engineer to lead the design, development, and maintenance of cloud-based platforms, data pipelines, and infrastructure. The ideal candidate will have a strong background in cloud computing, data engineering, and technical leadership.Key...
-
Staff Engineer
4 weeks ago
Hyderabad, Telangana, India Experian Full timeJob Title: Staff EngineerWe are seeking a highly skilled Staff Engineer to join our Platform Engineering team at Experian. As a Staff Engineer, you will play a key role in designing, building, and maintaining our cloud platform, ensuring it is scalable, secure, and reliable.Key Responsibilities:Design and implement cloud-based infrastructure and services to...
-
Cloud and Site Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India Experian Full timeJob Title: Cloud and Site Reliability EngineerJob Summary:We are seeking a highly skilled Cloud and Site Reliability Engineer to join our team at Experian. As a Cloud and Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining our cloud infrastructure, ensuring high availability, scalability, and security.Key...
-
Cloud Engineer
4 weeks ago
Hyderabad, Telangana, India ModMed Full timeAbout ModMedModMed is a leading healthcare technology company dedicated to modernizing medicine through innovative software solutions. Our mission is to empower healthcare professionals to deliver exceptional patient care by providing a comprehensive cloud-based platform that streamlines clinical workflows and improves outcomes.Job SummaryWe are seeking a...
-
Cloud Reliability Engineer
6 days ago
Hyderabad, Telangana, India Microsoft Full timeAzure OpportunitiesAt Microsoft, we are passionate about delivering exceptional customer experiences and advancing our cloud-first strategy. We are seeking a talented Software Engineer to join our Azure Customer Experience (CXP) and Customer Reliability Engineering (CRE) Team.About the RoleWe are looking for a skilled engineer with a customer-focused mindset...
-
Cloud-Native Reliability Engineer
1 week ago
Hyderabad, Telangana, India Splunk Inc Full timeReliability Engineer RoleSplunk Inc. is seeking a skilled Cloud-Native Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, building, and operating large-scale cloud-native microservices platforms. Your expertise in cloud-native technologies, such as Kubernetes and serverless computing,...
-
Cloud Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India Pythian Full timeJob SummaryPythian is seeking a highly skilled Cloud Site Reliability Engineer to join our team. As a Cloud Site Reliability Engineer, you will be responsible for designing, implementing, and maintaining scalable and highly available cloud infrastructure for our clients. Your primary focus will be on ensuring the reliability, security, and performance of our...
-
Senior Cloud Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India Splunk Inc Full timeUnlock the Power of Machine DataSplunk Inc is seeking a highly skilled Senior Cloud Reliability Engineer to join our team. As a key member of our infrastructure software engineering team, you will play a critical role in designing, building, and operating our cloud-scale, big data, and microservices platforms.Key Responsibilities:Design and implement new...
-
Senior Cloud Engineer
4 weeks ago
Hyderabad, Telangana, India Swift Strategic Staff Solutions INC Full timeJob Title: Senior Cloud EngineerJob Summary:We are seeking a highly skilled Senior Cloud Engineer to design, implement, and manage cloud-native solutions on the Azure Kubernetes Service (AKS) platform. As a key member of our team, you will play a crucial role in monitoring and optimizing system performance using Grafana.Key Responsibilities:Design and...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India Thomson Reuters Full timeAbout the RoleIn this opportunity as Site Reliability Engineer, you will be responsible for overseeing the operational aspects of cloud-based systems, ensuring their efficiency, reliability, and scalability. Key responsibilities include managing change and problem management, application and configuration management, and production support of strategic...
-
Senior Cloud Reliability Engineer
4 weeks ago
Hyderabad, Telangana, India Splunk Inc Full timeRole SummaryWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at Splunk Inc. As a key member of our infrastructure team, you will be responsible for designing, implementing, and operating large-scale cloud-native microservices platforms.Key ResponsibilitiesDesign and implement new services, tools, and monitoring to ensure high...
-
Site Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India Thomson Reuters Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Thomson Reuters. As a Site Reliability Engineer, you will be responsible for ensuring the reliability and scalability of our cloud infrastructure. This includes designing, implementing, and maintaining our cloud-based systems, as well as troubleshooting and resolving...
-
Site Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India UnitedHealth Group Full timeAt UnitedHealth Group, we are committed to helping people live healthier lives and making the health system work better for everyone. As a Site Reliability Engineer - Cloud Expert, you will play a critical role in ensuring the reliability and performance of our cloud-based systems. This is an exciting opportunity to join our team and contribute to the...
-
Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India Unison Consulting Pte Ltd Full timeJob Title: Site Reliability Engineer - Cloud ExpertAbout the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at Unison Consulting Pte Ltd. As a Site Reliability Engineer, you will be responsible for ensuring the high availability and performance of our cloud-based applications.Key Responsibilities:Support Java (J2EE/Spring...