SRE Devops Manager
2 days ago
We are looking for Site Reliability Engineering (SRE) Devops ManagerLocation: Bangalore / Hyderabad / Chennai / Noida / Pune / Visakhapatnam / GurgaonShift timing: regularCan join Immediate - 30 daysInterested candidates, Please share your profiles and below details toEmail ID: Shanmukh.Varma@infinite.comTotal experience:Relevant Experience:Current CTC:Expected CTC:Notice Period:If Serving Notice Period, Last working day:Email ID: Shanmukh.Varma@infinite.comJob SummaryWe are seeking an experienced Site Reliability Engineering (SRE) Manager to lead and evolve our cloud infrastructure, reliability practices, and automation strategy. This role blends hands-on technical leadership with strategic oversight to ensure scalable, secure, and reliable systems across AWS-based environments.As an SRE Manager, you will guide a team of DevOps and SRE engineers to design, build, and operate cloud-native platforms leveraging Kubernetes (EKS), Terraform, and AWS DevOps tools. You will drive operational excellence through observability, automation, and AIOps—enhancing reliability, performance, and cost efficiency.You will collaborate closely with development, product, and security teams to define SLOs, manage error budgets, and continuously improve infrastructure resilience and developer productivity.Key ResponsibilitiesLeadership & StrategyLead, mentor, and grow a global team of Site Reliability and DevOps Engineers.Define and drive the reliability roadmap, SLOs, and error budgets across services.Establish best practices for infrastructure automation, observability, and incident response.Partner with engineering leadership to shape long-term cloud, Kubernetes, and AIOps strategies.Infrastructure & AutomationDesign, implement, and manage AWS cloud infrastructure using Terraform (advanced modules, remote state management, custom providers).Build and optimize CI/CD pipelines using AWS CodePipeline, CodeBuild, CodeDeploy, and CodeCommit.Manage EKS clusters with focus on scalability, reliability, and cost efficiency—leveraging Helm, ingress controllers, and service mesh (e.g., Istio).Implement robust security and compliance practices (IAM policies, network segmentation, secrets management).Automate environment provisioning for dev, staging, and production using Infrastructure as Code (IaC).Monitoring, Observability & ReliabilityLead observability initiatives using Prometheus, Grafana, CloudWatch, and OpenSearch/ELK.Improve system visibility and response times by enhancing monitoring, tracing, and alerting mechanisms.Drive proactive incident management and root cause analysis (RCA) to prevent recurring issues.Apply chaos engineering principles and reliability testing to ensure resilience under load.AIOps & Advanced OperationsIntegrate AIOps tools to proactively detect, diagnose, and remediate operational issues.Design and manage scalable deployment strategies for AI/LLM workloads (e.g., Llama, Claude, Cohere).Monitor model performance and reliability across hybrid Kubernetes and managed AI environments.Stay current with MLOps and Generative AI infrastructure trends, applying them to production workloads.Manage 24/7 operations using apropos alerting tools and follow-the-sun modelCost Optimization & GovernanceAnalyze and optimize cloud costs through instance right-sizing, auto-scaling, and spot usage.Implement cost-aware architecture decisions and monitor monthly spend for alignment with budgets.Establish cloud governance frameworks to enhance cost visibility and accountability across teams.Collaboration & ProcessPartner with developers to streamline deployment workflows and improve developer experience.Maintain high-quality documentation, runbooks, and postmortem reviews.Foster a culture of reliability, automation, and continuous improvement across teams.
-
DevOps / SRE with Python
1 day ago
Bengaluru, India Bahwan Cybertek Group Full timeWe are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure. Responsibilities: - Develop and...
-
DevOps / SRE with Python
1 week ago
Bengaluru, Karnataka, India Bahwan Cybertek Group Full time ₹ 20,00,000 - ₹ 25,00,000 per yearWe are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure.Responsibilities:- Develop and...
-
DEVOPS SRE
1 week ago
Bengaluru, India RARR Technologies Full timeJob Description Key Responsibilities: SRE & DevOps Strategy: - Design and develop a robust SRE ecosystem following industry best practices. - Formulate SRE strategies based on emerging trends and organizational needs. - Implement best practices into local functional teams for consistent adoption. Platform & Automation: - Develop scaffolding libraries for...
-
DevOps / SRE - Python
2 weeks ago
Bengaluru, Karnataka, India Bahwan CyberTek Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are looking for a talented DevOps / SRE Engineer with strong Python skills to join our team at Bahwan Cybertek Group. As a DevOps / SRE Engineer, you will be responsible for maintaining and improving our software development and deployment processes, as well as ensuring the reliability and scalability of our infrastructure.Responsibilities:Develop and...
-
DevOps Sre
3 days ago
Bengaluru, Karnataka, India Hutech Solutions Full timeLooking for a DevOps SRE. The expected experience is 5+ years. - LOCATIONBengaluru/Pune - JOB-TYPEFull Time - NOTICE PERIODImmediate/Less Than 2 Weeks **Qualification**: Bachelors/Masters In Computers Science **Experience**: 5+ Years **Roles and Responsibilities**: As a DevOps Site Reliability Engineer, your primary responsibilities will include: -...
-
DevOps Engineer
4 weeks ago
Bengaluru, India 8byte Full timeJob Description DevOps Engineer 8byte What is the role As a DevOps Engineer, you will be responsible for designing, implementing, and maintaining robust, scalable, and secure infrastructure while optimizing development and deployment processes. This role goes beyond traditional DevOps and aligns closely with Site Reliability Engineering (SRE) principles,...
-
SRE Devops Manager
2 days ago
Bengaluru, India Infinite Computer Solutions Full timeWe are looking for Site Reliability Engineering (SRE) Devops ManagerLocation: Bangalore / Hyderabad / Chennai / Noida / Pune / Visakhapatnam / GurgaonShift timing: regularCan join Immediate - 30 daysInterested candidates, Please share your profiles and below details toEmail ID: Shanmukh.Varma@infinite.comTotal experience:Relevant Experience:Current...
-
SRE Devops Manager
2 days ago
Bengaluru, India Infinite Computer Solutions Full timeWe are looking for Site Reliability Engineering (SRE) Devops ManagerLocation: Bangalore / Hyderabad / Chennai / Noida / Pune / Visakhapatnam / GurgaonShift timing: regularCan join Immediate - 30 daysInterested candidates, Please share your profiles and below details toEmail ID: Shanmukh.Varma@infinite.comTotal experience:Relevant Experience:Current...
-
SRE Devops Manager
2 days ago
Bengaluru, India Infinite Computer Solutions Full timeWe are looking for Site Reliability Engineering (SRE) Devops Manager Location: Bangalore / Hyderabad / Chennai / Noida / Pune / Visakhapatnam / Gurgaon Shift timing: regular Can join Immediate - 30 days Interested candidates, Please share your profiles and below details to Email ID: Shanmukh.Varma@infinite.com Total experience: Relevant Experience: Current...
-
SRE Devops Manager
7 hours ago
Bengaluru, India Infinite Computer Solutions Full timeWe are looking for Site Reliability Engineering (SRE) Devops Manager Location: Bangalore / Hyderabad / Chennai / Noida / Pune / Visakhapatnam / Gurgaon Shift timing: regular Can join Immediate - 30 days Interested candidates, Please share your profiles and below details to Email ID: Total experience: Relevant Experience: Current CTC: Expected CTC: Notice...