Cloud Reliability Engineer

2 days ago


Hyderabad Secunderabad Telangana Pune Chennai, India beBeeDevops Full time US$ 90,000 - US$ 1,20,000
Job Overview

This role involves overseeing the reliability and performance of systems and applications in a high-availability, customer-facing business environment where uptime is critical.

You will collaborate with a dynamic team to provision cloud resources, drive DevOps automation activities, and work closely with engineering teams to ensure efficient issue resolution.

This position is based in Bangalore, India.

  • Deploy and maintain infrastructure & solutions hosted in private and public clouds.
  • Manage application releases, configurations, upgrades & support of Java microservices, open-source tools and third-party services in a SaaS environment.
  • Identify, diagnose, and resolve complex technology issues efficiently in live production environments.
  • Escalate issues for triage and resolution with the Engineering and Cloud Infrastructure team.
  • Lead initiatives to avoid recurrence of issues and trigger automated actions to improve system availability.
  • Implement proactive monitoring of all systems/services/networks to detect and resolve problems.
  • Collaborate with the Security team on implementing DevSecOps practices.
  • Work with Architects and engineers to prepare scalable network & deployment architecture.

Key Skills:

  • Strong understanding of software development life cycles using CI/CD tools like Jenkins/GitLab, Argo CD, Helm Charts.
  • Good knowledge of Systems (Unix/Linux, open source, JVM) and networking concepts (TCP/IP, SNMP, SMTP, DNS, HTTP, SSL/TLS, VPN, routing tables).
  • Experience in container orchestration using Docker and Kubernetes, as well as VMware.
  • Proficiency in configuring and managing IIS (Internet Information Services), Apache web servers, and load balancers.
  • Knowledge of scripting languages (Ansible Script, Terraform, Chef, Puppet) and monitoring tools (ELK).
  • Experience with different queuing systems like RabbitMQ, Kafka, Microsoft SQL server.
  • Web Server/Application Server deployments and administration.
  • Familiarity with Windows Servers and Operating Systems.
  • Understanding of multi-tier architecture, Web-based development, and Service-Oriented Architecture.
  • Excellent communication and interpersonal skills.
  • Ability to prioritize tasks between strategic projects and immediate production requirements.

Benefits:

  • Prioritize long-term strategic projects and immediate production needs.
  • Take on-call rotations and coordinate work under production-critical situations.

Requirements:

  • At least 3 years of experience in system setup, configuration, diagnosis, and monitoring of Enterprise-grade SaaS services.
  • Bachelor's degree in Computer Science, Networking, or a related field.

Preferred Skills:

  • Passion for learning and mastering information technology.
  • Experience with DevOps tools and automation.
  • Basic understanding of Database (Postgres), IAM (Key Cloak) & Java Programming language.


  • Hyderabad / Secunderabad, Telangana, India beBeeCloud Full time US$ 1,04,000 - US$ 1,30,878

    Job DescriptionWe are seeking a highly skilled Azure Cloud Site Reliability Engineer (SRE) to join our organization. The ideal candidate will have a strong background in cloud infrastructure, automation, and operational excellence, with a focus on ensuring the reliability, scalability, and performance of our Azure cloud environments.The successful candidate...


  • Hyderabad / Secunderabad, Telangana, India beBeeReliability Full time US$ 1,04,000 - US$ 1,30,878

    System Reliability Engineer OpportunityWe are seeking an experienced System Reliability Engineer to join our organization in India. The ideal candidate will have a strong background in ensuring the reliability, scalability, and performance of our services.This role requires a mix of technical expertise, leadership skills, and a passion for operational...


  • Hyderabad, Pune, India searce Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Overview about the role.As a Site Reliability Engineer (SRE) in the Cloud Managed Services team at Searce, you play a pivotal role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure. You'll be at the forefront of managing and optimizing cloud services to deliver high-quality and resilient solutions.Roles and...


  • Hyderabad / Secunderabad, Telangana, Chennai, India beBeeReliability Full time ₹ 18,00,000 - ₹ 25,00,000

    SRE Architect RoleWe are seeking a highly skilled SRE Architect to join our team.The ideal candidate will have experience designing and implementing reliable systems at scale, with a strong understanding of software engineering, system architecture, and operations.Key Responsibilities:System Design and Architecture: Lead the design and architecture of...


  • Pune, Chennai, Hyderabad / Secunderabad, Telangana, India beBeeCloudEngineeringManager Full time ₹ 1,04,000 - ₹ 1,30,878

    Job Title: Cloud Engineering Manager**About the Role:**We are seeking an experienced Cloud Engineering Manager to lead our team of engineers in designing, implementing and maintaining large-scale cloud-based systems. The ideal candidate will have a deep understanding of cloud computing, strong leadership skills and experience managing cross-functional...


  • Chennai, Hyderabad / Secunderabad, Telangana, India beBeeAutomation Full time US$ 90,000 - US$ 1,20,000

    Unlock new opportunities in a dynamic environmentTransform traditional IT Ops into SRE ops with expertise in SLI/SLOs/Toil/error budget etc.Achieve reliability, performance, and availability of IT Infrastructure and network through automationCreate scalable and resilient IT Infrastructure and network to minimize downtime/ incidents and ensure availability of...


  • Hyderabad, Telangana, India Careernet Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Key Skills: Cloud, Kubernetes, Python, Jenkins, OpenTelemetry, AppDynamics, Site Reliability Engineer.Roles & Responsibilities:Design, implement, and manage cloud infrastructure to ensure high availability and reliability.Utilize Kubernetes for container orchestration and management.Develop and maintain monitoring solutions using OpenTelemetry and...


  • Hyderabad / Secunderabad, Telangana, India beBeeCloudEngineer Full time

    Job Description:We are seeking a highly skilled and experienced Site Reliability Engineer to join our team. The successful candidate will be responsible for ensuring the smooth operation of our systems, identifying and resolving technical issues, and implementing process improvements.Key Responsibilities:Design, implement, and maintain scalable and efficient...


  • Chennai, Tamil Nadu, India Ford Global Career Site Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Be at the Forefront of Mobility's Future: Join Ford as a Site Reliability EngineerEnterprise Technology is the engine driving the future of transportation, and we're looking for a talented Site Reliability Engineer (SRE) to help us redefine mobility. In this role, you'll leverage cutting-edge technology to enhance customer experiences, improve lives, and...


  • Hyderabad, Telangana, India beBeeAzureSre Full time ₹ 15,00,000 - ₹ 25,00,000

    Reliable Cloud Engineer RoleThis is a key role that ensures the reliability, scalability, and security of cloud services.Responsibilities:Monitor and troubleshoot cloud infrastructure and applicationsCollaborate with cross-functional teams to resolve issues and implement improvementsDevelop and maintain cloud resources and automation scriptsPerform capacity...