Current jobs related to Senior Infrastructure Reliability Specialist - Rajahmundry, Andhra Pradesh - beBeeSite
-
Infrastructure Reliability Specialist
1 week ago
Rajahmundry, Andhra Pradesh, India beBeeInfrastructure Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Job OverviewWe are seeking a highly skilled Infrastructure Reliability Specialist to join our team. This role plays a critical part in ensuring the stability and performance of complex systems.Key Responsibilities:Implement DevOps practices to improve deployment efficiency, monitoring, and automation.Collaborate with cross-functional teams to identify and...
-
Senior Infrastructure Specialist
1 week ago
Rajahmundry, Andhra Pradesh, India beBeeInfrastructure Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job Title: Senior Infrastructure SpecialistWe are seeking a skilled and experienced Principal Engineer to join our team in the Site Reliability Engineering space.SRE Operations: Lead day-to-day operations for Accounting and Finance applications, ensuring they run smoothly and meet business expectations.Platform Management: Ensure Accounting and Finance...
-
IT Infrastructure Reliability Specialist
2 weeks ago
Rajahmundry, Andhra Pradesh, India beBeeReliabilityEngineer Full time ₹ 2,00,00,000 - ₹ 2,50,00,000About People Prime Worldwide:Our mission is to unleash human energy through technology for an inclusive and sustainable future, helping organisations accelerate their transition to a digital and sustainable world.We provide a variety of services, including consulting, technology, professional, and outsourcing services.Job Details:Good hands on experience in...
-
Senior Cloud Infrastructure Specialist
2 weeks ago
Rajahmundry, Andhra Pradesh, India beBeeCloud Full time ₹ 18,00,000 - ₹ 24,00,000Site Reliability Engineering LeaderThe position of Technical Manager for Site Reliability Engineering (SRE) entails leading a team of skilled engineers in achieving operational excellence and fostering a high-performing work environment.This role oversees daily operations, provides technical guidance, and aligns with the company's objectives. The Technical...
-
Cloud Infrastructure Specialist Position
1 week ago
Rajahmundry, Andhra Pradesh, India beBeeInfrastructure Full time ₹ 10,00,000 - ₹ 25,00,000Cloud Infrastructure Specialist\We are seeking a highly skilled Cloud Infrastructure Specialist to join our team. In this role, you will be responsible for designing and implementing cloud infrastructure that powers our nationwide platforms.\As a Cloud Infrastructure Specialist, you will play a critical role in ensuring the scalability, reliability, and...
-
Highly Skilled Infrastructure Architect
2 weeks ago
Rajahmundry, Andhra Pradesh, India beBeeReliability Full time ₹ 19,44,000 - ₹ 25,12,000Site Reliability EngineerWe are seeking a highly skilled Reliability Engineering Specialist to fill this critical role. The ideal candidate will have extensive experience in designing and implementing robust infrastructure systems, with expertise in DevOps and SRE principles.Key Responsibilities:Troubleshoot complex issues: Our reliability engineer must be...
-
Cloud Infrastructure Specialist
2 weeks ago
Rajahmundry, Andhra Pradesh, India beBeeCloud Full time US$ 2,00,000 - US$ 2,50,000We are seeking a highly skilled Cloud Infrastructure Specialist to join our team.Job SummaryThe ideal candidate will be responsible for designing and implementing scalable cloud infrastructure using AWS services. This includes ensuring high availability, performance tuning, and cost optimization of cloud and on-premise infrastructure. Additionally, the...
-
Cloud Infrastructure Specialist
2 weeks ago
Rajahmundry, Andhra Pradesh, India beBeeCloudSpecialist Full time ₹ 1,80,00,000 - ₹ 2,50,00,000Job Title: Cloud Infrastructure SpecialistWe are seeking a skilled Cloud Infrastructure Specialist to design and implement scalable and secure cloud infrastructure solutions.About the Role:Design and implement hybrid cloud architectures that meet the needs of our clients.Develop and maintain infrastructure as code (IaC) using Ansible and Terraform.Manage...
-
Cloud Gaming Infrastructure Specialist
2 weeks ago
Rajahmundry, Andhra Pradesh, India beBeeInfrastructure Full time ₹ 1,50,00,000 - ₹ 2,50,00,000About Our Role:We are looking for a skilled Cloud Gaming Infrastructure Specialist to join our team.As a key member of our Service Reliability Engineering team, you will play a significant role in delivering a great cloud gaming experience to our customers.You will influence design and operational decisions towards the overall stability of the gaming...
-
Chief Platform Reliability Engineer
2 weeks ago
Rajahmundry, Andhra Pradesh, India beBeeAutomation Full time ₹ 18,00,000 - ₹ 25,00,000Infrastructure Architect Specialist The RoleWe treat Infrastructure and operations as Software Engineering problems. Our mission is to build and progress software platforms which enables the provisioning and managing of all services in safe, reliable and scalable ways. This role is responsible for designing & architecting new solutions, finding creative...

Senior Infrastructure Reliability Specialist
3 weeks ago
We are seeking an experienced and dynamic Site Reliability Engineering (SRE) professional to oversee the reliability, scalability, and performance of our critical systems. As an SRE leader, you will play a pivotal role in establishing and implementing SRE practices, leading a team of engineers, and driving automation, monitoring, and incident response strategies.
This position combines software engineering and systems engineering expertise to build and maintain high-performing, reliable systems.
Key Responsibilities:
- Maintain high availability and reliability of critical services.
- Define and monitor SLIs, SLOs, and SLAs to ensure business requirements are met.
- Proactively identify and resolve performance bottlenecks and system inefficiencies.
Incident Management & Response:
- Establish and improve incident management processes and on-call rotations.
- Lead incident response and root cause analysis for high-priority outages.
- Drive post-incident reviews and ensure actionable insights are implemented.
Automation & Tooling:
- Develop and implement automated solutions to reduce manual operational tasks.
- Enhance system observability through metrics, logging, and distributed tracing tools.
- Optimize CI/CD pipelines for seamless deployments.
Collaboration:
- Partner with software engineering teams to improve the reliability of applications and infrastructure.
- Work closely with product/engineering teams to design scalable and robust systems.
- Ensure seamless integration of monitoring and alerting systems across teams.
Leadership & Team Building:
- Manage, mentor, and grow a team of SREs.
- Promote SRE best practices and foster a culture of reliability and performance across the organization.
- Drive performance reviews, skills development, and career progression for team members.
Capacity Planning & Cost Optimization:
- Perform capacity planning and implement autoscaling solutions to handle traffic spikes.
- Optimize infrastructure and cloud costs while maintaining reliability and performance.
Required Skills & Qualifications:
- Technical Expertise:
- Experience with cloud platforms (AWS / Azure / GCP) and Kubernetes.
- Hands-on knowledge of infrastructure-as-code tools like Terraform / Helm / Ansible.
- Proficiency in Java.
- Expertise in distributed systems, databases, and load balancing.
- Monitoring & Observability:
- Proficient with tools like Prometheus, Grafana, Elastic APM, or New Relic.
- Understanding of metrics-driven approaches for system monitoring and alerting.
- Automation & CI/CD:
- Hands-on experience with CI/CD pipelines (e.g., Jenkins, Azure Pipelines etc).
- Skilled in automation frameworks and tools for infrastructure and application deployments.
- Incident Management:
- Proven track record in handling incidents, post-mortems, and implementing solutions to prevent recurrence.
- Leadership & Communication Skills:
- Strong people management and leadership skills with the ability to inspire and motivate teams.
- Excellent problem-solving and decision-making skills.
- Clear and concise communication, with the ability to translate technical concepts for non-technical stakeholders.
Benefits:
- Be a key driver in building and scaling reliable systems in a fast-paced environment.
- Work with cutting-edge technologies and influence the evolution of the infrastructure.
- Lead a high-impact team and foster a culture of reliability and innovation.