Reliability Engineer for Cloud Operations

1 day ago


Pune, Maharashtra, India Quorum Software Full time

About the Role:

">

Quorum Software is seeking a highly skilled Reliability Engineer to join our team in Pune, India. As a key member of our cloud operations team, you will play a critical role in designing and implementing scalable observability solutions, ensuring the reliability and efficiency of our cloud-based systems.

About Us:

Quorum Software connects people and information across the energy value chain, providing innovative software solutions to optimize business workflows. With a strong commitment to diversity, equity, and inclusion, we foster a culture of collaboration and knowledge sharing. Our team members are passionate about delivering exceptional results, and we're looking for like-minded professionals to join us.

Your Responsibilities:

  • Design and Implement Observability Solutions: You will design and implement scalable observability solutions, including monitoring, logging, tracing, and alerting, with a strong focus on minimizing downtime. Experience with monitoring/alerting solutions is essential.
  • Instrumentation and Integration: Work closely with development teams to ensure proper instrumentation of applications, services, and infrastructure components for comprehensive observability.
  • Cloud Platforms: Strong understanding of public cloud services and architecture, familiarity with VMware private cloud solutions, and hands-on experience with hybrid cloud environments.
  • Incident Response and Resolution: Play a key role in incident response by utilizing observability tools to quickly identify and resolve issues, minimizing downtime and impact on end-users.
  • Performance Optimization: Collaborate with development teams to analyze performance metrics and implement optimizations to enhance system reliability and efficiency.
  • Automation and Scripting: Develop automation scripts and tools to streamline observability processes, ensuring timely data collection and analysis.
  • Capacity Planning: Contribute to capacity planning efforts by analyzing observability data to forecast future resource requirements, proactively address potential bottlenecks, and identify under- or unutilized resources.
  • Collaboration, Knowledge Sharing, Documentation: Foster a culture of collaboration by sharing insights and best practices with cross-functional teams, provide training and guidance to promote observability best practices, assist with creation of solution documentation, and partner with internal and vendor resources to evangelize solutions.
  • Continuous Improvement: Stay abreast of industry trends and emerging technologies in observability and contribute to the continuous improvement of our observability strategy and tools.

What We Offer:

  • An estimated salary of ₹1,200,000 - ₹1,800,000 per annum, depending on experience and qualifications.
  • A dynamic and inclusive work environment that values diversity, equity, and inclusion.
  • Opportunities for professional growth and development, including training and mentorship programs.
  • A collaborative and supportive team environment that fosters knowledge sharing and innovation.
  • A comprehensive benefits package, including health insurance, retirement plans, and paid time off.

Requirements:

  • 5-8 years of relevant experience in cloud operations, IT service management, or a related field.
  • Experience with public and private cloud platforms, including AWS, Azure, and VMware.
  • Background in IT operations, including managing datacenter/public cloud infrastructure and IT services.
  • A track record of actively seeking and implementing improvements in cloud operations processes and technologies.
  • Experience with ensuring adherence to SLOs for response and resolution of alerts and incidents.
  • Excellent written and verbal communication skills and customer empathy.
  • Familiarity with ITIL (Information Technology Infrastructure Library) framework, particularly incident management, problem management, and change management.
  • Understanding of cloud security best practices and compliance standards.
  • Strong problem-solving skills to analyze complex issues, identify root causes, and implement effective solutions.
  • Strong documentation skills to maintain records of operational procedures, incident reports, and other relevant documentation.

Nice to Have:

  • Certifications related to AWS, Azure, and ITIL frameworks are a plus.


  • Pune, Maharashtra, India Red Hat India Private Limited Full time

    Red Hat India Private Limited is seeking a highly skilled Cloud Reliability Engineer to join its team. As a Cloud Reliability Engineer, you will be responsible for developing, scaling, and operating our OpenShift managed cloud services.">Company OverviewFounded in 1993, Red Hat is the world's leading provider of enterprise software solutions. Our...


  • Pune, Maharashtra, India Virtusa Full time

    Job Title: Cloud Reliability EngineerVirtusa is seeking a skilled Cloud Reliability Engineer to join our team. The ideal candidate will have a minimum of 5 years of experience in SRE, focusing on integration platforms and cloud-based deployments. Strong programming skills, particularly in integration tier and middleware, are essential. Experience with...


  • Pune, Maharashtra, India Red Hat India Private Limited Full time

    About the Role:We are seeking a skilled Site Reliability Engineer to join our Cloud Operations team in India. As a key member of our team, you will contribute to the development, scaling, and operation of our Red Hat OpenShift Managed Cloud platform. Your expertise in cloud providers and technologies, including kubernetes, will be crucial in enabling...


  • Pune, Maharashtra, India Oracle Full time

    As a Principal Cloud Reliability Engineer at Oracle, you will be part of a dynamic team responsible for ensuring the reliability and performance of our cloud services. **Key Responsibilities:**• Collaborate with cross-functional teams to design, develop, and operate cloud infrastructure and services.• Work with customers and partners to ensure successful...


  • Pune, Maharashtra, India Virtusa Full time

    Virtusa is seeking a highly skilled Cloud Infrastructure Reliability Engineer to join our team. This is an exciting opportunity to work with cutting-edge technology and be part of a dynamic team.About the RoleAs a Cloud Infrastructure Reliability Engineer, you will be responsible for designing, implementing, and maintaining highly available and scalable...


  • Pune, Maharashtra, India People First Consultants Full time

    At People First Consultants, we're seeking a Cloud Reliability Engineer to join our team and play a key role in ensuring the reliability, efficiency, and performance of our applications meet our customers' needs.Responsibilities:Collaborate with development teams and other partner teams to ensure application reliability, efficiency, and performance meet...


  • Pune, Maharashtra, India Practicology Full time

    Job OverviewWe are seeking a highly skilled Cloud Reliability Engineer to join our team. As a Cloud Reliability Engineer, you will be responsible for designing, building, and maintaining scalable and reliable infrastructure in AWS (Postgres, Redis, Docker, Queues, Kinesis Streams, S3, etc.).Key ResponsibilitiesInfrastructure and AutomationDesign and build...


  • Pune, Maharashtra, India Fulcrum Digital Full time

    About the RoleFulcrum Digital is a digital transformation and technology services company that accelerates business growth through innovative solutions. We're seeking a skilled Cloud Reliability Engineer to join our team.Key ResponsibilitiesPlan, manage, and oversee all aspects of a Production EnvironmentDefine strategies for Application Performance...


  • Pune, Maharashtra, India Virtusa Consulting Services Private Limited Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking an experienced Site Reliability Engineer to join our team at Virtusa Consulting Services Private Limited. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure.Key Responsibilities:Minimum 8 years of...


  • Pune, Maharashtra, India Quorum Software Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Quorum Software. As a Site Reliability Engineer, you will play a key role in ensuring the reliability and performance of our cloud-based infrastructure.Key ResponsibilitiesDesign and implement scalable observability solutions, including monitoring, logging, tracing,...


  • Pune, Maharashtra, India Red Hat India Private Limited Full time

    Red Hat is a leading provider of enterprise software solutions, leveraging a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies.As a Principal Site Reliability Engineer (SRE), you will play a critical role in developing, scaling, and operating our OpenShift managed cloud services. With over 10+ years of...


  • Pune, Maharashtra, India ZS Full time

    Cloud Infrastructure Reliability EngineerZS is a place where passion changes lives. As a management consulting and technology firm focused on transforming global healthcare and beyond, our most valuable asset is our people. Here you'll work side-by-side with a powerful collective of thinkers and experts shaping solutions from start to finish. At ZS, we...


  • Pune, Maharashtra, India Jobs for Humanity Full time

    About the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the scalability, high availability, and performance of our software applications.Key ResponsibilitiesDesign and implement scalable and highly available software systemsCollaborate with cross-functional...


  • Pune, Maharashtra, India Thinkproject Full time

    A career at Thinkproject could be your perfect fit if you value a culture of mutual respect and the freedom to create a work-life balance.What do we do?As a European market leader in digitalisation tools for construction companies, Thinkproject has a complex yet exciting mission: to deliver digitalisation and make the AECO industry safer, healthier, and more...


  • Pune, Maharashtra, India Procore Technologies Full time

    **Job Title:** Cloud Infrastructure Engineer - Reliability Expert**Estimated Salary:** $140,000 - $170,000 per yearWe are seeking a highly skilled Senior Reliability Engineer with strong backend software engineering skills to join our team at Procore Technologies. This is an exciting opportunity to design, implement, and maintain cloud infrastructure that...


  • Pune, Maharashtra, India Red Hat India Private Limited Full time

    Red Hat is seeking an experienced Principal Cloud Reliability Engineer to develop, scale, and operate our OpenShift managed cloud services. As a key member of our SRE team, you will contribute to running OpenShift at scale by enabling customer self-service, making our monitoring system more sustainable, and eliminating work through automation.On the SRE...


  • Pune, Maharashtra, India Jobs for Humanity Full time

    About the Role:We are seeking a highly skilled Site Reliability Engineer to join our team at FIS. The successful candidate will be responsible for ensuring the scalability, high availability, and performance of our software applications.Key Responsibilities:Design and implement automation scripts to simplify operations and improve system...


  • Pune, Maharashtra, India ZS Full time

    Cloud Transformation ExpertZS is a global consulting and technology firm that transforms healthcare and beyond. As a cloud transformation expert, you'll work with a powerful collective of thinkers and experts to shape solutions from start to finish. Our most valuable asset is our people, and we honor the diversity that makes us unique.Our Cloud Center of...


  • Pune, Maharashtra, India Roche Full time

    About the PositionJob SummaryAt Roche, we are seeking a skilled Site Reliability Engineer - Cloud Expert to join our team. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability and performance of our production systems.Key ResponsibilitiesDesign, implement, and maintain site reliability engineering practices that ensure...


  • Pune, Maharashtra, India Siemens Industry Software (India) Private Limited Full time

    Solution Engineer for Cloud ReliabilityAs a cloud reliability engineer in our team, you will be responsible for designing, deploying, and automating solutions to drive new capabilities, visibility, and efficiency in our cloud-based platforms. You will collaborate with other technical platforms and partners to engineer automated and integrated solutions...