
Cloud Reliability Engineer
2 days ago
This role involves overseeing the reliability and performance of systems and applications in a high-availability, customer-facing business environment where uptime is critical.
You will collaborate with a dynamic team to provision cloud resources, drive DevOps automation activities, and work closely with engineering teams to ensure efficient issue resolution.
This position is based in Bangalore, India.
- Deploy and maintain infrastructure & solutions hosted in private and public clouds.
- Manage application releases, configurations, upgrades & support of Java microservices, open-source tools and third-party services in a SaaS environment.
- Identify, diagnose, and resolve complex technology issues efficiently in live production environments.
- Escalate issues for triage and resolution with the Engineering and Cloud Infrastructure team.
- Lead initiatives to avoid recurrence of issues and trigger automated actions to improve system availability.
- Implement proactive monitoring of all systems/services/networks to detect and resolve problems.
- Collaborate with the Security team on implementing DevSecOps practices.
- Work with Architects and engineers to prepare scalable network & deployment architecture.
Key Skills:
- Strong understanding of software development life cycles using CI/CD tools like Jenkins/GitLab, Argo CD, Helm Charts.
- Good knowledge of Systems (Unix/Linux, open source, JVM) and networking concepts (TCP/IP, SNMP, SMTP, DNS, HTTP, SSL/TLS, VPN, routing tables).
- Experience in container orchestration using Docker and Kubernetes, as well as VMware.
- Proficiency in configuring and managing IIS (Internet Information Services), Apache web servers, and load balancers.
- Knowledge of scripting languages (Ansible Script, Terraform, Chef, Puppet) and monitoring tools (ELK).
- Experience with different queuing systems like RabbitMQ, Kafka, Microsoft SQL server.
- Web Server/Application Server deployments and administration.
- Familiarity with Windows Servers and Operating Systems.
- Understanding of multi-tier architecture, Web-based development, and Service-Oriented Architecture.
- Excellent communication and interpersonal skills.
- Ability to prioritize tasks between strategic projects and immediate production requirements.
Benefits:
- Prioritize long-term strategic projects and immediate production needs.
- Take on-call rotations and coordinate work under production-critical situations.
Requirements:
- At least 3 years of experience in system setup, configuration, diagnosis, and monitoring of Enterprise-grade SaaS services.
- Bachelor's degree in Computer Science, Networking, or a related field.
Preferred Skills:
- Passion for learning and mastering information technology.
- Experience with DevOps tools and automation.
- Basic understanding of Database (Postgres), IAM (Key Cloak) & Java Programming language.
-
Azure Cloud Site Reliability Engineer
20 hours ago
Hyderabad / Secunderabad, Telangana, India beBeeCloud Full time US$ 1,04,000 - US$ 1,30,878Job DescriptionWe are seeking a highly skilled Azure Cloud Site Reliability Engineer (SRE) to join our organization. The ideal candidate will have a strong background in cloud infrastructure, automation, and operational excellence, with a focus on ensuring the reliability, scalability, and performance of our Azure cloud environments.The successful candidate...
-
Senior System Reliability Engineer
2 days ago
Hyderabad / Secunderabad, Telangana, India beBeeReliability Full time US$ 1,04,000 - US$ 1,30,878System Reliability Engineer OpportunityWe are seeking an experienced System Reliability Engineer to join our organization in India. The ideal candidate will have a strong background in ensuring the reliability, scalability, and performance of our services.This role requires a mix of technical expertise, leadership skills, and a passion for operational...
-
Senior Cloud Reliability Engineer
4 hours ago
Hyderabad, Pune, India searce Full time ₹ 15,00,000 - ₹ 20,00,000 per yearOverview about the role.As a Site Reliability Engineer (SRE) in the Cloud Managed Services team at Searce, you play a pivotal role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure. You'll be at the forefront of managing and optimizing cloud services to deliver high-quality and resilient solutions.Roles and...
-
Senior System Reliability Engineer
6 days ago
Hyderabad / Secunderabad, Telangana, Chennai, India beBeeReliability Full time ₹ 18,00,000 - ₹ 25,00,000SRE Architect RoleWe are seeking a highly skilled SRE Architect to join our team.The ideal candidate will have experience designing and implementing reliable systems at scale, with a strong understanding of software engineering, system architecture, and operations.Key Responsibilities:System Design and Architecture: Lead the design and architecture of...
-
Cloud Engineering Manager
3 days ago
Pune, Chennai, Hyderabad / Secunderabad, Telangana, India beBeeCloudEngineeringManager Full time ₹ 1,04,000 - ₹ 1,30,878Job Title: Cloud Engineering Manager**About the Role:**We are seeking an experienced Cloud Engineering Manager to lead our team of engineers in designing, implementing and maintaining large-scale cloud-based systems. The ideal candidate will have a deep understanding of cloud computing, strong leadership skills and experience managing cross-functional...
-
Reliable IT Engineer
4 days ago
Chennai, Hyderabad / Secunderabad, Telangana, India beBeeAutomation Full time US$ 90,000 - US$ 1,20,000Unlock new opportunities in a dynamic environmentTransform traditional IT Ops into SRE ops with expertise in SLI/SLOs/Toil/error budget etc.Achieve reliability, performance, and availability of IT Infrastructure and network through automationCreate scalable and resilient IT Infrastructure and network to minimize downtime/ incidents and ensure availability of...
-
Cloud Site Reliability Engineer
2 hours ago
Hyderabad, Telangana, India Careernet Full time ₹ 1,04,000 - ₹ 1,30,878 per yearKey Skills: Cloud, Kubernetes, Python, Jenkins, OpenTelemetry, AppDynamics, Site Reliability Engineer.Roles & Responsibilities:Design, implement, and manage cloud infrastructure to ensure high availability and reliability.Utilize Kubernetes for container orchestration and management.Develop and maintain monitoring solutions using OpenTelemetry and...
-
Reliable Cloud Infrastructure Specialist
6 days ago
Hyderabad / Secunderabad, Telangana, India beBeeCloudEngineer Full timeJob Description:We are seeking a highly skilled and experienced Site Reliability Engineer to join our team. The successful candidate will be responsible for ensuring the smooth operation of our systems, identifying and resolving technical issues, and implementing process improvements.Key Responsibilities:Design, implement, and maintain scalable and efficient...
-
Cloud Site Reliability Engineer
16 hours ago
Chennai, Tamil Nadu, India Ford Global Career Site Full time ₹ 1,04,000 - ₹ 1,30,878 per yearBe at the Forefront of Mobility's Future: Join Ford as a Site Reliability EngineerEnterprise Technology is the engine driving the future of transportation, and we're looking for a talented Site Reliability Engineer (SRE) to help us redefine mobility. In this role, you'll leverage cutting-edge technology to enhance customer experiences, improve lives, and...
-
Cloud Reliability Specialist
3 days ago
Hyderabad, Telangana, India beBeeAzureSre Full time ₹ 15,00,000 - ₹ 25,00,000Reliable Cloud Engineer RoleThis is a key role that ensures the reliability, scalability, and security of cloud services.Responsibilities:Monitor and troubleshoot cloud infrastructure and applicationsCollaborate with cross-functional teams to resolve issues and implement improvementsDevelop and maintain cloud resources and automation scriptsPerform capacity...