Sre Devops Manager
5 days ago
We are looking for Site Reliability Engineering (SRE) Devops Manager Location: Bangalore / Hyderabad / Chennai / Noida / Pune / Visakhapatnam / Gurgaon Shift timing: regular Can join Immediate - 30 days Interested candidates, Please share your profiles and below details to Email ID: shanmukh.varma@infinite.com Total experience: Relevant Experience: Current CTC: Expected CTC: Notice Period: If Serving Notice Period, Last working day: Email ID: shanmukh.varma@infinite.com Job Summary We are seeking an experienced Site Reliability Engineering (SRE) Manager to lead and evolve our cloud infrastructure, reliability practices, and automation strategy. This role blends hands-on technical leadership with strategic oversight to ensure scalable, secure, and reliable systems across AWS-based environments. As an SRE Manager, you will guide a team of DevOps and SRE engineers to design, build, and operate cloud-native platforms leveraging Kubernetes (EKS), Terraform, and AWS DevOps tools. You will drive operational excellence through observability, automation, and AIOps—enhancing reliability, performance, and cost efficiency. You will collaborate closely with development, product, and security teams to define SLOs, manage error budgets, and continuously improve infrastructure resilience and developer productivity. Key Responsibilities Leadership & Strategy - Lead, mentor, and grow a global team of Site Reliability and DevOps Engineers. - Define and drive the reliability roadmap, SLOs, and error budgets across services. - Establish best practices for infrastructure automation, observability, and incident response. - Partner with engineering leadership to shape long-term cloud, Kubernetes, and AIOps strategies. Infrastructure & Automation - Design, implement, and manage AWS cloud infrastructure using Terraform (advanced modules, remote state management, custom providers). - Build and optimize CI/CD pipelines using AWS CodePipeline, CodeBuild, CodeDeploy, and CodeCommit. - Manage EKS clusters with focus on scalability, reliability, and cost efficiency—leveraging Helm, ingress controllers, and service mesh (e.G., Istio). - Implement robust security and compliance practices (IAM policies, network segmentation, secrets management). - Automate environment provisioning for dev, staging, and production using Infrastructure as Code (IaC). Monitoring, Observability & Reliability - Lead observability initiatives using Prometheus, Grafana, CloudWatch, and OpenSearch/ELK. - Improve system visibility and response times by enhancing monitoring, tracing, and alerting mechanisms. - Drive proactive incident management and root cause analysis (RCA) to prevent recurring issues. - Apply chaos engineering principles and reliability testing to ensure resilience under load. AIOps & Advanced Operations - Integrate AIOps tools to proactively detect, diagnose, and remediate operational issues. - Design and manage scalable deployment strategies for AI/LLM workloads (e.G., Llama, Claude, Cohere). - Monitor model performance and reliability across hybrid Kubernetes and managed AI environments. - Stay current with MLOps and Generative AI infrastructure trends, applying them to production workloads. - Manage 24/7 operations using apropos alerting tools and follow-the-sun model Cost Optimization & Governance - Analyze and optimize cloud costs through instance right-sizing, auto-scaling, and spot usage. - Implement cost-aware architecture decisions and monitor monthly spend for alignment with budgets. - Establish cloud governance frameworks to enhance cost visibility and accountability across teams. Collaboration & Process - Partner with developers to streamline deployment workflows and improve developer experience. - Maintain high-quality documentation, runbooks, and postmortem reviews. - Foster a culture of reliability, automation, and continuous improvement across teams.
-
Senior Devops Platform Engineer
4 days ago
Yelahanka, India apna Full timeJob Title: Senior engineer (SDE-2)– Platform Engineering Location: Bengaluru Employment Type: Full-time Team: Platform Engineering About the Role: We are looking for a passionate and hands-on DevOps Engineer to join our Platform Engineering team and accelerate our platform modernization journey. This role is ideal for engineers who thrive in...
-
Devops - Kubernetes Architect
5 days ago
Yelahanka, India TRDFIN Support Services Pvt Ltd Full timePosition : DevOps – Kubernetes Architect Location : Bangalore/Noida/Gurgoan/Pune/Hyderabad/Indore Experience : 12-17 Years Budget : 45-50 LPA Work Mode: Hybrid Notice Period : Immediate / 15 Days Max Key Responsibilities: - Design and implement DevOps solutions that automate software delivery pipelines and infrastructure provisioning. - Architect and...
-
Engineer Ii – Frontend T500-21429
5 days ago
Yelahanka, India lululemon Full timeAbout lululemon: lululemon is an innovative performance apparel company for yoga, running, training, and other athletic pursuits. Setting the bar in technical fabrics and functional design, we create transformational products and experiences that support people in moving, growing, connecting, and being well. We owe our success to our innovative products,...
-
Engineer Ii T500-20909
4 days ago
Yelahanka, India lululemon Full timeWho we are: lululemon is an innovative performance apparel company for yoga, running, training, and other athletic pursuits. Setting the bar in technical fabrics and functional design, we create transformational products and experiences that support people in moving, growing, connecting, and being well. We owe our success to our innovative product, emphasis...
-
Manager - IT
2 weeks ago
Yelahanka, Karnataka, India Wesco Full time ₹ 12,00,000 - ₹ 24,00,000 per yearDescriptionWe are seeking a highly skilled and strategic Technical Data Platform Manager to lead and support a dynamic team of data professionals, including Databricks Architects, Platform Administrators, and specialists in Azure SQL, Azure Data Factory (ADF), and Control-M. This role is pivotal in driving the design, implementation, and optimization of our...
-
Sde Iv
5 days ago
Yelahanka, India Talentoj Full timeRole Purpose: As a Software Development Engineer IV (SDE IV), you will play a critical role in designing and building scalable backend systems. As a senior individual contributor, you will take ownership of complex features, contribute to architectural decisions, and mentor other engineers. Your focus will be on delivering high-quality, production-ready...
-
Oracle Database Administration
6 days ago
Yelahanka, Karnataka, India Intelex Technologies ULC Full time ₹ 12,00,000 - ₹ 36,00,000 per yearPosition OverviewPosition: Oracle Database AdministratorShift: 2:00 PM to 11:00 PM ISTCompany OverviewWith more than 1,300 clients and 1.6 million users, Intelex Technologies, ULC is a global leader in environmental, health, safety and quality (EHSQ) management software. Since 1992 its scalable, web-based platform and applications have helped clients across...
-
Devops+Openshift- Engineer
7 days ago
Yelahanka, India ValueLabs Full time**We are hiring for CICD/OpenShift for Dubai Location.** **Note** **30 days Max or Serving Notice period** Bachelor of Computer Science or Equivalent Red Hat Certified Architect or Equivalent, Azure Cloud Architect, AWS Cloud Architect, OCI Cloud Implementation At least 8 years of experience at a relevant technical position in large organizations, - Must...
-
Delivery Manager – Modernization
5 days ago
Yelahanka, India Programmers.io Full timeJob Title : Delivery Manager – Modernization & Microsoft Stack Shift Timings : General Timings till 10 PM IST Location : Remote, India, Jaipur, Hyderabad, Work from Home Experience required : 8 to 15 years We are looking for a dynamic Delivery Manager with a strong technical foundation in Microsoft technologies—especially .NET and C#—and proven...
-
Automation Test Lead
7 days ago
Yelahanka, India HSO Full timeKey Responsibilities: - Lead and manage automation and performance testing for MD365 F&O projects. - Design, develop, and maintain test automation frameworks using Selenium WebDriver, Java, TestNG, and Maven. - Implement and scale Leapwork automation for Dynamics 365 F&O and related enterprise systems. - Conduct performance testing using tools such as...