Site Reliability Engineer
3 weeks ago
About the Team
We are growing our WSA Cloud Center of Excellence team within R&D, dedicated to ensuring the continuous reliability and performance of our cloud-based infrastructure. Our team leverages cutting-edge technology to build and maintain resilient systems that meet the demands of modern, cloud-native applications.
Job Description:
We are seeking an experienced Site Reliability Engineer to join our team, with a focus on monitoring, alerting, and infrastructure stability. This role primarily involves maintaining the reliability and performance of our systems hosted in Azure Cloud.
Main Responsibilities:
- Ensure System Uptime and Reliability: Monitor and maintain cloud-based applications and infrastructure, ensuring minimal downtime and efficient incident response.
- Build and Optimize Monitoring and Alerting Systems: Set up and continuously improve comprehensive monitoring and alerting frameworks to detect and address issues proactively.
- Cloud Infrastructure Management: Manage, optimize, and scale systems on Azure cloud platforms, ensuring high performance and cost-effectiveness.
- Incident Management and Response: Act as the first line of defense in identifying, diagnosing, and resolving technical issues in real-time or escalating them to the appropriate teams.
- Automation and Infrastructure as Code (IaC): Utilize IaC tools to automate infrastructure provisioning and management, promoting reproducibility and reducing manual interventions.
- Tooling and Observability: Leverage technologies such as Grafana for observability and Argo for CI/CD automation, enhancing our ability to respond swiftly and effectively to infrastructure needs.
- Collaboration: Work closely with cross-functional teams to align on SRE best practices, share insights, and support development and operational goals.
Requirements:
- Experience with Cloud Platforms: 5+ years of experience in cloud environments, with a primary focus on Azure.
- Monitoring and Alerting Skills: Strong experience with monitoring tools (e.g., Grafana, Prometheus) and a background in setting up alerts and dashboards.
- Incident Management: Proven track record in diagnosing and troubleshooting complex system issues, with a focus on fast incident response and resolution.
- Collaboration and Communication: Excellent communication skills, with an ability to work collaboratively with various technical teams and stakeholders.
- Kubernetes Expertise: Proficiency with Kubernetes (K8s) for orchestrating and managing containerized applications.
- Automation and IaC: Hands-on experience with any Scripting language (e.g., Python, Shell script, Power shell) Infrastructure as Code (e.g., Terraform, Ansible) for automating cloud infrastructure management.
Estimated Salary: $120,000 - $180,000 per annum, depending on experience.
Location: Flexible work arrangements available, with remote work options and flexible working hours.
-
Site Reliability Engineering Manager
3 weeks ago
Hyderabad, Telangana, India Truetech Full timeSenior Site Reliability EngineerWe are seeking an experienced Senior Site Reliability Engineer to lead and manage a team of engineers, providing guidance and support to ensure the team's success. The ideal candidate will have a strong background in site reliability engineering, cloud computing, and DevSecOps principles.The successful candidate will be...
-
Site Reliability Engineering Lead
1 month ago
Hyderabad, Telangana, India Live Connections Full timeWe are looking for a highly skilled Site Reliability Engineering Lead to join our team at Live Connections in Hyderabad. As a key member of our organization, you will be responsible for leading and managing a team of engineers to ensure the reliability, scalability, and performance of our systems.**Estimated Salary: ₹25,00,000 - ₹35,00,000 per...
-
Site Reliability Engineering Leader
1 week ago
Hyderabad, Telangana, India Truetech Full timeAbout the Role:We are seeking a highly experienced Site Reliability Engineering (SRE) leader to manage our team of SREs. The successful candidate will be responsible for providing mentorship, guidance, and support to ensure the team's success.Key Responsibilities:Develop and implement strategies for improving system reliability, scalability, and...
-
Site Reliability Expert
3 weeks ago
Hyderabad, Telangana, India IT Full timeAbout the RoleThis is an exciting opportunity for a skilled Site Reliability Engineer to join our team in Chennai and Hyderabad. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, scalability, and performance of our systems.You will be responsible for designing, developing, and managing integration solutions using...
-
Site Reliability Engineering Team Lead
1 month ago
Hyderabad, Telangana, India Live Connections Full timeWe are seeking an experienced Site Reliability Engineering Team Lead to join our team at Live Connections in Hyderabad.About the RoleThis is a leadership position that requires a strong technical background in site reliability engineering and experience in managing teams. The ideal candidate will have a proven track record of driving projects to successful...
-
Site Reliability Engineering Expert
4 weeks ago
Hyderabad, Telangana, India GeekBull Consulting Full timeWe are seeking a highly skilled Senior Site Reliability Engineer to join our team at GeekBull Consulting. As a key member of our engineering team, you will play a critical role in ensuring the reliability and efficiency of our clients' applications and systems.About the RoleThis is a 6-month contract-to-hire position with the possibility of extension or...
-
Chief Site Reliability Engineering Lead
1 month ago
Hyderabad, Telangana, India Live Connections Full timeAbout Live ConnectionsWe're a cutting-edge technology firm dedicated to delivering innovative solutions. Our team is passionate about crafting exceptional products that drive business success.Job Description:System Reliability Engineer ManagerThis role offers an exciting opportunity to lead our site reliability engineering team, driving strategies for...
-
Site Reliability Engineer
3 weeks ago
Hyderabad, Telangana, India AutoRABIT Software Pvt Ltd Full timeJob SummaryWe are seeking an experienced Site Reliability Engineer to join our team at AutoRABIT Software Pvt Ltd. As a key member of our DevOps team, you will be responsible for contributing to and maintaining monitoring, logging, and alerting systems for comprehensive visibility into infrastructure health.The ideal candidate will have a strong...
-
Site Reliability Engineer
4 weeks ago
Hyderabad, Telangana, India Talent500 Full timeAbout the RoleWe are seeking a highly skilled Site Reliability Engineer to join our team at Talent500. As a key member of our IT department, you will play a vital role in shaping the future of travel and ensuring that our systems operate with reliability and efficiency.Job DescriptionThe successful candidate will have a strong background in designing,...
-
Site Reliability Engineer
4 weeks ago
Hyderabad, Telangana, India Tanla Platforms Limited Full time**Job Title:** Site Reliability Engineer - Platform Optimizer**Estimated Salary:** $120,000 - $180,000 per annumCompany OverviewTanla Platforms Limited is a rapidly growing company in the telecom and CPaaS space. We are committed to creating an inclusive environment for all employees and champion diversity.About the RoleWe are seeking a skilled Site...
-
Hyderabad, Telangana, India Tanla Platforms Limited Full timeJob OverviewWe are seeking an experienced Site Reliability Engineer to join our team at Tanla Platforms Limited. As a key member of our infrastructure team, you will play a pivotal role in ensuring the availability, scalability, and reliability of our platforms.
-
Site Reliability Engineer
2 weeks ago
Hyderabad, Telangana, India Tanla Platforms Limited Full timeAbout Tanla Platforms LimitedTanla is a leading provider of cloud-based software solutions for the telecom and CPaaS space.Job SummaryWe are seeking an experienced Site Reliability Engineer to join our platform operations team. This role will be responsible for ensuring the high availability, scalability, and reliability of our platforms.Key...
-
Reliability Engineer Position
3 weeks ago
Hyderabad, Telangana, India IT Full timeJob DescriptionWe are seeking a skilled Site Reliability Engineer to join our team in Chennai and Hyderabad. The ideal candidate will have 4-7 years of experience in implementing and maintaining Site Reliability Engineering practices.The role involves designing, developing, and managing integration solutions using tools such as RabbitMQ, Postman, Apache,...
-
Hyderabad, Telangana, India Lifelancer Full timeJob Title: Site Reliability Engineer: Optimizing System PerformanceLocation: Hyderabad, IndiaJob Type: Full-timeAbout Us: Lifelancer is a talent-hiring platform in Life Sciences, Pharma, and IT. We connect talented professionals with opportunities in pharma, biotech, health sciences, healthtech, and IT domains.About the Role:We are seeking an experienced...
-
Site Reliability Engineering Lead
3 weeks ago
Hyderabad, Telangana, India AutoRABIT Software Pvt Ltd Full timeJob Title: Site Reliability Engineering Lead - AWS CI/CDLocation: Remote / Hybrid Work ModeEstimated Salary: $130,000 per annumAbout the RoleWe are seeking an experienced Site Reliability Engineering Lead to join our team at AutoRABIT Software Pvt Ltd. As a Site Reliability Engineering Lead, you will be responsible for designing and implementing scalable,...
-
Senior Reliability Engineering Team Lead
3 weeks ago
Hyderabad, Telangana, India Live Connections Full timeWe are seeking a seasoned Senior Reliability Engineering Team Lead at Live Connections in Hyderabad. As a key member of our engineering team, you will be responsible for leading and managing a team of Site Reliability Engineers to ensure the success of our systems and applications.With over 5 years of experience in site reliability engineering or a related...
-
Site Reliability Engineer
5 days ago
Hyderabad, Telangana, India ValueLabs Full timeValueLabs is seeking a highly skilled Site Reliability Engineer to join our team in the development and operation of cloud-native applications on Azure.We are looking for someone with strong experience in Azure cloud environments, container orchestration tools like Kubernetes, Helm, and Docker, as well as scripting languages such as PowerShell, Python, Go,...
-
Hyderabad, Telangana, India Truetech Full timeAbout TruetechTruetech is a dynamic company that pushes the boundaries of innovation in site reliability engineering. Our team is dedicated to delivering exceptional services and products, and we are looking for a highly skilled individual to join our ranks as a Senior Reliability Engineering Manager for Scalable Systems.About the RoleThis is a leadership...
-
Site Reliability Engineer Lead
4 weeks ago
Hyderabad, Telangana, India AutoRABIT Full time**About AutoRABIT**We are a leading DevOps and CI/CD platform for SaaS platforms. Our unique metadata-aware capability makes Release Management, Version Control, and Backup & Recovery complete, reliable, and effective.Job Summary:We are seeking an experienced Site Reliability Engineer Lead to join our team in Hyderabad. The ideal candidate will have a strong...
-
Reliability Engineering Professional
1 week ago
Hyderabad, Telangana, India PURVIEW Full timePurview is hiring a Site Reliability Engineering professional for one of our valued clients. This permanent role is based in Hyderabad, and we offer a competitive salary.**Job Overview:**We are seeking an experienced Site Reliability Engineer to join our team. The successful candidate will have a strong background in application support, with a focus on...