Site Reliability Engineer
1 week ago
Job Description About Infinova Infinova is an emerging player in intelligent business transformation, dedicated to helping organizations scale smarter and achieve sustainable success. We are building a foundation that combines strategic consultancy, financial expertise, and technology-driven solutions to deliver measurable growth and operational efficiency. Our services include AI-powered business consultancy, talent solutions, and advanced technology development, enabling businesses to convert data into actionable intelligence, optimize performance, and embrace innovation. With a commitment to transparency, quality, and future-ready strategies, Infinova ensures every partnership delivers lasting impact. About The Role The Site Reliability Engineer will play a key role in designing, deploying, and supporting highly available AI platform environments across Azure, AWS, and Google Cloud. This role is focused on ensuring secure, scalable, and reliable operations for cloud-native and AIdriven workloads. The ideal candidate will collaborate directly with cloud engineers, product teams, and customer stakeholders to optimize infrastructure, automate CI/CD, and deploy cloud resources that support AI products and enterprise platforms. The successful candidate will have experience in cloud deployment operations, orchestration, automation, and performance monitoring for production systems. Key Responsibilities Lead deployment design, planning, and configuration for cloud platforms, including Azure, AWS, GCP, and Kubernetes environments. Optimize cloud architectures for availability, scalability, cost efficiency, and performance while aligning with cloud security and operational best practices. Craft and maintain automation using IaC frameworks such as Terraform, Ansible, and CloudFormation. Deploy CI/CD pipelines to automate build, test, and release processes for cloudbased AI platforms and services. Ensure platform availability and reliability by configuring alerting, monitoring, and response systems. Provide ongoing support and maintenance for cloud infrastructure, identifying and resolving incidents to ensure high system uptime. Perform detailed analysis of failures, conduct root cause analysis, and implement corrective and preventive actions. Establish end-to-end monitoring frameworks, dashboards, and observability for AI workloads and cloud deployments. Conduct regular security reviews, threat mitigation, and compliance validation for multi-cloud environments. Work closely with development, QA, and product teams to enhance service delivery and operational workflows. Share best practices, documentation, and knowledge to elevate team capability and platform reliability. Skills and Experience Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience. Expertise in Azure, AWS, and Google Cloud platform services and operational models. Strong knowledge of Linux, virtualization, and container runtimes including Docker and Kubernetes. Deep understanding of networking, security, access control, and compliance frameworks within multi-cloud environments. Proficiency in IaC tools (Terraform, CloudFormation), configuration tools (Puppet, Chef, Helm), and scripting (Python, Bash, PowerShell). Experience with CI/CD tools such as GitHub Actions or Jenkins, and monitoring tools such as Prometheus, ELK, or Splunk. Strong diagnostic and troubleshooting skills, with the ability to support mission-critical deployments. Excellent communication skills and the ability to collaborate across engineering, product, and customer teams. Fluent in English and comfortable working in cloud-based distributed environments.
-
Site Reliability Engineer
2 weeks ago
Cochin, Kerala, India NOV Full time ₹ 15,00,000 - ₹ 25,00,000 per yearDescriptionWe are seeking a highly motivated and experienced Site Reliability Engineer to join our team. The SRE will work closely with our software engineering and operations teams to ensure the reliability, performance, and scalability of our mission-critical systems. The ideal candidate is a creative problem-solver who can design and implement innovative...
-
Site Reliability Engineer
4 weeks ago
Kochi, India NOV Inc Full timeWe are seeking a highly motivated and experienced Site Reliability Engineer to join our team. The SRE will work closely with our software engineering and operations teams to ensure the reliability, performance, and scalability of our mission-critical systems. The ideal candidate is a creative problem-solver who can design and implement innovative solutions...
-
Site Reliability Engineer
2 weeks ago
India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
2 weeks ago
India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
2 weeks ago
India Pagos Consultants Full timewe are looking for experienced site reliability engineers to join a founding team of startup-minded individuals that will lay the groundwork for our new fintech offering. This team will play a pivotal role in spearheading innovation. As such, you will have the opportunity to shape the early architecture and design of the system and set the trajectory for its...
-
Site Reliability Engineer
19 hours ago
Cochin, Kerala, India NOV Full timeJob DescriptionWe are seeking a highly motivated and experienced Site Reliability Engineer to join our team. The SRE will work closely with our software engineering and operations teams to ensure the reliability, performance, and scalability of our mission-critical systems. The ideal candidate is a creative problem-solver who can design and implement...
-
Site Engineer
4 weeks ago
India, Cochin / Kochi / Ernakulam B-RAM NIRMAN PVT. LTD. Full timeJob Description Company Description B-RAM Nirman Pvt. Ltd., established in 2004 and headquartered in Ernakulam, Kerala, is a leading construction company specializing in commercial and industrial infrastructure. Known for its focus on innovation, safety, sustainability, and quality, the company manages every aspect of projects, from design to execution, with...
-
Site Reliability Engineer
3 weeks ago
India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Site Reliability Engineer
17 hours ago
Kochi, India NOV Full timeJOB DESCRIPTIONWe are seeking a highly motivated and experienced Site Reliability Engineer to join our team. The SRE will work closely with our software engineering and operations teams to ensure the reliability, performance, and scalability of our mission-critical systems. The ideal candidate is a creative problem-solver who can design and implement...
-
Site Reliability Engineer
1 week ago
India Insight Global Full timeCompany: Insight Global Duration: Approved for 1 year 📍 Location: Remote (India) 💼 Type: Contract with Insight Global Client 💰 Compensation: 14 LPA – 20 LPA 🕒 Working Hours: Normal IST hours 🚀 Start Date: Immediate (No notice period) About the Role Join our Site Reliability Engineering (SRE) team as a Fullstack Developer, focused on building...