Staff Site Reliability Engineer

4 months ago


Hyderabad, India ModMed Full time

We are united in our mission to make a positive impact on healthcare. Join Us 

South Florida Business Journal, Best Places to Work 2024 Inc. 5000 Fastest-Growing Private Companies in America 2023 Company of the Year | 2023 BIG Innovation Awards Fastest-Growing Company of the Year – Large (Bronze) | 2022 Best in Biz Awards

Who we are:

We Are Modernizing Medicine (WAMM) We’re a team of bright, passionate, and positive problem-solvers on a mission to place doctors and patients at the center of care through an intelligent, specialty-specific cloud platform. Our vision is a world where the software we build increases medical practice success and improves patient outcomes. Founded in 2010 by Daniel Cane and Dr. Michael Sherling, we have grown to over 3400 combined direct and contingent team members serving eleven specialties, and we are just getting started ModMed is based in Boca Raton, FL, with office locations in Santiago, Chile, Berlin, Germany, Hyderabad, India, and a robust remote workforce with team members across the US.

As a Staff Cloud Engineer at Modernizing Medicine you will spearhead efforts to ensure the reliability, performance, and scalability of our AWS cloud infrastructure and services. Your role will be pivotal in automating processes, optimizing costs, and applying industry-standard site reliability principles. With your deep knowledge of AWS, DataDog, Jenkins, and Kubernetes, you will lead our evolution towards more efficient and cost-effective cloud-native technologies, ensuring our operations align with strategic goals and deliver exceptional value.

Your Role:

Architect and manage secure, scalable cloud infrastructure and services, focusing on automation, reliability, and proactive cost management to ensure efficient operations. Implement and refine observability and monitoring solutions using DataDog, ensuring proactive issue identification and efficient resource utilization. Lead CI/CD pipeline development, maintenance, and optimization with Jenkins, integrating AWS services to enhance development workflows and infrastructure automation. Drive the containerization and orchestration of applications using Kubernetes, enhancing scalability, deployment efficiency, and cost-effectiveness. Monitor application and infrastructure performance in AWS, applying tuning and optimizations to ensure optimal resource utilization and user experience while managing costs. Design and manage disaster recovery and backup strategies on AWS, prioritizing data integrity, system availability, and cost efficiency. Provide expert troubleshooting and problem-solving across various platforms and applications within AWS, aiming for minimal disruption and quick resolution. Ensure strict adherence to AWS security standards and compliance with data protection regulations, with a keen eye on cost implications. Keep abreast of new cloud technologies and trends, recommending and implementing improvements for competitive advantage and cost savings. Mentor and support junior team members, fostering a culture of learning, collaboration, and cost-consciousness. Work closely with cross-functional teams to understand requirements and deliver AWS-based solutions that meet business objectives efficiently and cost-effectively.

Skills & Requirements:

Bachelor’s degree in Computer Science, Information Technology, or related field, or equivalent experience. A minimum of 8-10 years of experience in Site Reliability Engineering, Cloud Engineering, or a similar role, with a demonstrated track record of problem-solving in complex, cloud-based environments. This should include extensive experience with designing, implementing, and managing scalable, highly available, and fault-tolerant systems. Strong expertise in managing cloud environments (preferably in AWS), with hands-on experience in observability platforms such as DataDog. Proficiency in automation and scripting languages (e.g., Python, Bash) and infrastructure as code (IaC) tools (e.g., Terraform, Ansible). Extensive experience with CI/CD tools, notably Jenkins, and familiarity with containerization and orchestration technologies like Kubernetes. Solid understanding of networking, cloud security best practices, performance optimization, and cost management strategies. Demonstrated commitment to implementing industry-standard site reliability principles and a proactive approach to cost management in daily operations. Proven leadership skills and the ability to mentor junior team members, guide teams through complex operational challenges, and foster a culture of continuous improvement. Excellent verbal and written communication skills, with the ability to work effectively in a team environment and communicate complex technical concepts to a non-technical audience.

ModMed Benefits Highlight:

At ModMed, we believe it’s important to offer a competitive benefits package designed to meet the diverse needs of our growing workforce. Eligible Modernizers can enroll in a wide range of benefits, including:

Comprehensive medical, dental, and vision benefits, including a company Health Savings Account contribution, 401(k): ModMed provides a matching contribution each payday of 50% of your contribution deferred on up to 6% of your compensation. After one year of employment with ModMed, 100% of any matching contribution you receive is yours to keep. Generous Paid Time Off and Paid Parental Leave programs, Company paid Life and Disability benefits, Flexible Spending Account, and Employee Assistance Programs, Company-sponsored Business Resource & Special Interest Groups that provide engaged and supportive communities within ModMed, Professional development opportunities, including tuition reimbursement programs and unlimited access to LinkedIn Learning, Global presence and in-person collaboration opportunities; dog-friendly HQ (US), Hybrid office-based roles and remote availability, Weekly catered breakfast and lunch, treadmill workstations, Zen, and wellness rooms within our BRIC headquarters.

  • hyderabad, India ModMed Full time

    We are united in our mission to make a positive impact on healthcare. Join Us!  South Florida Business Journal, Best Places to Work 2024 Inc. 5000 Fastest-Growing Private Companies in America 2023 Company of the Year | 2023 BIG Innovation Awards Fastest-Growing Company of the Year – Large (Bronze) | 2022 Best in Biz Awards Who...


  • hyderabad, India ModMed Full time

    We are united in our mission to make a positive impact on healthcare. Join Us!  South Florida Business Journal, Best Places to Work 2024 Inc. 5000 Fastest-Growing Private Companies in America 2023 Company of the Year | 2023 BIG Innovation Awards Fastest-Growing Company of the Year – Large (Bronze) | 2022 Best in Biz Awards Who we are: ...


  • hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1Experience: 2 to 10 yearsThe Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system...


  • Hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1The Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems.This entry-level role is ideal for someone who passionate about learning and developing their skills in system reliability, automation,...


  • Hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1Experience: 2.5 to 6 yearsThe Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system...


  • Hyderabad, India TechBlocks Full time

    Seeking a skilled Senior Site Reliability Engineer with expertise in Google Cloud Platform (GCP) to join our dynamic team. As a Senior SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our infrastructure and applications hosted on GCP.Responsibilities:Design, build, and maintain the core infrastructure used by all...


  • Hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1Experience: 2.5 to 6 yearsThe Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system...


  • Hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1Experience: 2 to 10 yearsThe Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system...


  • Hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1 Experience: 2.5 to 6 years The Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in...


  • Hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-OnPeople Leader ResponsibilityPosition will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1The Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system reliability, automation,...


  • Hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1The Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system reliability, automation,...


  • Hyderabad, India SID Global Solutions Full time

    Job Description: Site Reliability Engineer (SRE) – Apigee Level 1 The Site Reliability Engineer (SRE) Level 1 will be responsible for maintaining and improving the reliability, availability, and performance of the systems. This entry-level role is ideal for someone who passionate about learning and developing their skills in system reliability,...


  • hyderabad, India Anicalls (Pty) Ltd Full time

    The Role Mentor teammates on SRE best practices and guide technical direction Work closely with the product engineering team to rapidly deliver capabilities Automate and optimize developer pipelines Build monitoring to assess system and pipeline health Qualifications: Proficiency in Python, Go, Ruby, or Java is a plus Expertise in Linux administration,...


  • hyderabad, India ValueLabs Full time

    Experienced in SRE or Site Reliability EngineerDesign, implement, and maintain automated processes for deploying, monitoring, and managing applications on Azure DevOps.Collaborate with cross-functional teams to optimize system performance, reliability, and scalability.Develop and maintain tools for continuous integration, continuous deployment (CI/CD), and...


  • Hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-OnPeople Leader ResponsibilityPosition will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-OnPeople Leader ResponsibilityPosition will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-OnPeople Leader ResponsibilityPosition will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, India Quest Diagnostics Full time

    Please Note: This is a Leadership Role with Technically Hand-On People Leader Responsibility Position will manage 5 to 10 engineers both directly and indirectly. The engineers will include Site Reliability Engineers, Observability Engineers, Performance Engineers, DevSecOps Engineers, and others These individuals will vary from entry level to senior...


  • Hyderabad, India ValueLabs Full time

    Experienced in SRE or Site Reliability Engineer Design, implement, and maintain automated processes for deploying, monitoring, and managing applications on Azure DevOps.Collaborate with cross-functional teams to optimize system performance, reliability, and scalability.Develop and maintain tools for continuous integration, continuous deployment (CI/CD), and...