Infrastructure Reliability Specialist

24 hours ago


ghaziabad, India beBeeExpert Full time

Reliability Engineering ExpertWe're revolutionizing the home services industry through AI-driven technology, connecting customers with trusted professionals instantly.Our goal is to scale infrastructure to serve millions of users reliably.Lead incident response, conduct root cause analysis, and ensure permanent preventive measures.Design and optimize CI/CD pipelines, automate deployments, and enforce release stability.Build scalable infrastructure on AWS, GCP, or Azure using Terraform, Ansible, and Kubernetes.Continuously monitor system health with Prometheus, Grafana, ELK, and CloudWatch.Conduct load and performance testing and optimize systems for high-traffic events.Improve observability, reduce alert noise, and enhance signal clarity for faster debugging.Collaborate with developers and architects to ensure systems meet SLOs, SLIs, and SLAs.Develop automation scripts and tools in Python, Go, Node.js, or Shell to streamline operations.Manage distributed systems and message queues like Kafka or RabbitMQ.Drive a culture of reliability, automation, and scalability across teams.



  • ghaziabad, India beBeeEngineer Full time

    Job OverviewAn innovative AI infrastructure specialist is required to join a leading company in the field of sustainable energy.This role involves designing and implementing scalable AI solutions that are tailored to meet the diverse needs of clients. With a focus on environmental responsibility, the company utilizes state-of-the-art technology to support AI...


  • ghaziabad, India beBeeCloudEngineer Full time

    Our team thrives on leveraging cutting-edge technology to drive innovation and customer success.We're seeking a Cloud Infrastructure Specialist to contribute to the management, operation, and optimization of our cloud infrastructure across GCP, AWS, and Azure.


  • ghaziabad, India beBeeInfrastructure Full time

    Cloud Infrastructure SpecialistThe role of a Cloud Platform Engineer is to contribute to the management, operation, and optimization of our cloud infrastructure. This involves implementing and managing infrastructure using Terraform, ensuring it is scalable, reliable, and cost-effective.Key responsibilities include defining and managing infrastructure as...


  • ghaziabad, India beBeeDiscovery Full time

    As a seasoned IT Professional, you will lead the implementation of Discovery processes to create an accurate and reliable Configuration Management Database (CMDB) across on-premises and cloud environments.You will oversee the deployment, configuration, and maintenance of ServiceNow Discovery across hybrid infrastructure landscapes.Configure and maintain...


  • ghaziabad, India beBeeMachineLearning Full time

    About the RoleWe are seeking a seasoned MLOps Engineer to spearhead our platform's infrastructure development. The successful candidate will work closely with cross-functional teams to drive innovation, design scalable systems, and ensure the reliability and performance of all deployed systems.


  • ghaziabad, India beBeeInfrastructure Full time

    Our company is seeking an experienced professional to fill a key role in designing and implementing large-scale cloud infrastructure on Google Cloud Platform.Key Responsibilities:Design, implement, and manage large-scale cloud infrastructure on Google Cloud Platform.Ensure cloud environments follow best practices for security, operations, and...


  • ghaziabad, India beBeeDatabase Full time

    Key ResponsibilitiesWe are seeking a Senior DevOps & Database Reliability Engineer to strengthen our infrastructure reliability, database performance and disaster recovery capabilities.Design automate and test backup and disaster recovery strategies for large-scale databases and production systems.Manage optimize MySQL MongoDB PostgreSQL and Redis databases...


  • ghaziabad, India beBeeReliability Full time

    Site Reliability Engineer: Scaling InfrastructureTransform your career in a collaborative community of colleagues around the world. Your Role:Deploy and manage scalable infrastructure using Kubernetes clusters.Implement automation solutions for operational workflows using Python or Go.Apply reliability and performance principles to ensure high-quality...


  • ghaziabad, India beBeeSRE Full time

    Improve infrastructure scalability, reliability, and automation.Job DescriptionAs a Site Reliability Engineer, you will drive operational excellence across our platforms. You will design, implement, and manage AWS infrastructure at scale (EC2, S3, ELB, Lambda, ECS, Route 53, SQS, CloudWatch).You will develop and maintain Infrastructure as Code (IaC) using...


  • ghaziabad, India beBeeLinux Full time

    **Job Title:** Senior Linux Systems EngineerWe are seeking an experienced professional to spearhead the implementation and management of on-premises Linux servers optimized for AI/ML workloads.The ideal candidate will have in-depth expertise in Linux system administration, Kubernetes cluster management, and a strong understanding of data center...