Site Reliability/DevOps Engineer
5 days ago
Job Title : Director - SRE, DevOps, Monitoring, and Database Operations
Key Responsibilities :
Leadership & Strategy :
- Provide technical and people leadership to SRE, DevOps, Monitoring, and Database Operations teams.
- Collaborate with leadership on budgeting, planning, hiring, and managing third-party contracts.
- Oversee project status, assemble project teams, and define assignments with schedules and milestones.
Platform Reliability & Performance :
- Drive continuous improvement of reliability, stability, and performance of digital platforms.
- Oversee implementation of automated telemetry, observability, and applied intelligence systems.
- Lead efforts to develop automated alerting, self-healing mechanisms, and intelligent response systems.
Incident & Escalation Management :
- Ensure 24/7 uptime of sites and services, with minimal unplanned downtime.
- Serve as Escalation Manager/Critical Incident Manager during major incidents, leading teams in rapid service restoration.
- Provide on-call escalation support based on 24/7/365 schedules.
- Communicate timely updates and incident reports to senior leadership.
Collaboration & Integration :
- Partner with administrators, platform engineers, and other stakeholders to achieve highly reliable infrastructure, systems, and integrations.
- Collaborate with product, application development, QA, and technology teams to enhance service reliability and performance.
Incident Management & Automation :
- Provide advanced Incident and Problem Management support to effectively diagnose, remediate, and resolve platform issues.
- Automate critical workflows across the platform to minimize manual errors and reduce human intervention.
- Implement ITIL processes like Incident, Problem, and Change Management.
Monitoring & Scalability :
- Design and implement effective monitoring systems with proper alerting and escalation mechanisms for critical events.
- Ensure timely capacity planning and infrastructure upgrades for optimal reliability.
- Develop and refine processes to minimize Mean Time to Recover (MTTR) and extend Mean Time to Failure (MTTF).
Documentation & Compliance :
- Create and maintain detailed documentation, including run books, incident response guides, post-mortem reports, RCAs, and mitigation plans.
- Ensure all changes adhere to established procedures and documentation standards.
Business Alignment :
- Understand business workflows and map technology solutions to address problems effectively.
- Lead conversations and provide technical support to both internal and external customers.
-
Site Reliability Engineer
5 months ago
Bengaluru, India Squareroot Consulting Pvt Ltd. Full timeSite Reliability EngineerLocation : Bangalore, IndiaDomain : CybersecurityBudget : 30 to 50 Lacks - We are looking for a hands-on devops engineer leading the design, implementation of devops/SRE practice for our infrastructure for data privacy.- The successful candidate will have experience implementing advanced DevOps & SRE techniques such as Auto...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, India ValueLabs Full timeWe are seeking a highly skilled Site Reliability Engineer (SRE) with expertise in Splunk, AWS CloudWatch, New Relic, Prometheus, and DevOps tools to join our team. The ideal candidate will have a strong background in cloud computing, containerization, and monitoring tools. The SRE will be responsible for ensuring the reliability, scalability, and performance...
-
Site Reliability Engineer
1 month ago
Bengaluru, India ValueLabs Full timeWe are seeking a highly skilled Site Reliability Engineer (SRE) with expertise in Splunk, AWS CloudWatch, New Relic, Prometheus, and DevOps tools to join our team. The ideal candidate will have a strong background in cloud computing, containerization, and monitoring tools. The SRE will be responsible for ensuring the reliability, scalability, and performance...
-
Site Reliability Engineer
1 month ago
Bengaluru, India ValueLabs Full timeWe are seeking a highly skilled Site Reliability Engineer (SRE) with expertise in Splunk, AWS CloudWatch, New Relic, Prometheus, and DevOps tools to join our team. The ideal candidate will have a strong background in cloud computing, containerization, and monitoring tools. The SRE will be responsible for ensuring the reliability, scalability, and performance...
-
Site Reliability Engineer
1 month ago
Bengaluru, India ValueLabs Full timeWe are seeking a highly skilled Site Reliability Engineer (SRE) with expertise in Splunk, AWS CloudWatch, New Relic, Prometheus, and DevOps tools to join our team. The ideal candidate will have a strong background in cloud computing, containerization, and monitoring tools. The SRE will be responsible for ensuring the reliability, scalability, and performance...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India Natobotics Technologies Pvt Limited Full timeJob Title: Site Reliability EngineerWe are seeking an experienced Site Reliability Engineer to join our team at Natobotics Technologies Pvt Limited. The ideal candidate will have a strong background in system administration and DevOps, with expertise in configuration management tools, Linux/Unix systems, and scripting languages.About the Role:This is a...
-
Site Reliability Engineer
1 month ago
Bengaluru, India BCE Global Tech Full timeAt BCE Global Tech, immerse yourself in exciting projects that are shaping the future of both consumer and enterprise telecommunications. This involves building innovative mobile apps to enhance user experiences and enable seamless connectivity on-the-go.If you are passionate about technology and eager to make a difference, we want to hear from you! Apply...
-
DevOps Engineer
3 days ago
Bengaluru, India Coders Brain Technology Private Limited Full timeExperience :Required : 5-12 YearsLocation : BangaloreKey Responsibilities :Infrastructure Management :- Design, deploy, and manage scalable infrastructure in AWS using services such as ECS, EC2, EBS, EKS, S3, RDS, ELB, IAM, and Lambda.- Administer Linux-based systems, ensuring optimal performance and security.- Handle Kubernetes cluster management and...
-
Site Reliability Engineer Specialist
3 weeks ago
Bengaluru, Karnataka, India Synechron Full timeJob Opportunity: We are seeking a skilled Site Reliability Engineer to join our team at Synechron. As a trusted partner in digital optimization and modernization, we provide innovative technology solutions for businesses.About the Role: In this customer-facing role, you will be responsible for understanding SRE requirements, assisting in building the team,...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, India Karix Full timeRole:Site Reliability EngineerLocation:Bangalore (WFO)About the role:We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the reliability,...
-
Senior Site Reliability Engineer
2 months ago
Bengaluru, India Tech Mahindra (formerly Mahindra Satyam) Full timeWe are looking for Senior SRE who can join with us in Pan India Location.Skill - SRE with GCP platformExp - 6- 13 YrsLocation - Pan IndiaHybrid work locationAs an Off-Shore Site Reliability Engineer, you will have the following preferred skillsets:- Experience with cloud platforms such as Azure, or GCP- Proficiency in scripting languages such as Python,...
-
Site Reliability Engineer
1 month ago
Bengaluru, India BCE Global Tech Full timeAt BCE Global Tech, immerse yourself in exciting projects that are shaping the future of both consumer and enterprise telecommunications. This involves building innovative mobile apps to enhance user experiences and enable seamless connectivity on-the-go. If you are passionate about technology and eager to make a difference, we want to hear from you! Apply...
-
Site Reliability Engineer
1 month ago
Bengaluru, India BCE Global Tech Full timeAt BCE Global Tech, immerse yourself in exciting projects that are shaping the future of both consumer and enterprise telecommunications. This involves building innovative mobile apps to enhance user experiences and enable seamless connectivity on-the-go. If you are passionate about technology and eager to make a difference, we want to hear from you! Apply...
-
Site Reliability Engineer
1 month ago
Bengaluru, India BCE Global Tech Full timeAt BCE Global Tech, immerse yourself in exciting projects that are shaping the future of both consumer and enterprise telecommunications. This involves building innovative mobile apps to enhance user experiences and enable seamless connectivity on-the-go. If you are passionate about technology and eager to make a difference, we want to hear from you! Apply...
-
Site Reliability Engineer
3 weeks ago
Greater Bengaluru Area, India Intuition IT – Intuitive Technology Recruitment Full timeJob Description:Prior experience in establishing Site Reliability Engineering function with 24/7 supportCoding experience and demonstrate how to build, test, scan and deploy a .NET and JavaScript application.Hands-on experience of Azure cloud, IaC , JSON, Azure Bicep, Azure policies, Azure DevOps, Open telemetry, Azure Monitoring, Azure Sentinel, Azure...
-
Site Reliability Engineer
3 weeks ago
Greater Bengaluru Area, India Intuition IT – Intuitive Technology Recruitment Full timeJob Description: Prior experience in establishing Site Reliability Engineering function with 24/7 support Coding experience and demonstrate how to build, test, scan and deploy a .NET and JavaScript application. Hands-on experience of Azure cloud, IaC , JSON, Azure Bicep, Azure policies, Azure DevOps, Open telemetry, Azure Monitoring, Azure Sentinel, Azure...
-
Site reliability engineer
3 weeks ago
Bengaluru, India Karix Full timeRole: Site Reliability EngineerLocation: Bangalore (WFO)About the role: We are seeking an experienced professional Site Reliability Engineer who acts as a bridge between development and IT operations, taking operational tasks to ensure the efficient functioning of Service platforms. They are responsible for monitoring, automating, and improving the...
-
Senior Site Reliability Engineer
2 months ago
Bengaluru, India Tech Mahindra (formerly Mahindra Satyam) Full timeWe are looking for Senior SRE who can join with us in Pan India Location.Skill - SRE with GCP platformExp - 6- 13 YrsLocation - Pan IndiaHybrid work locationAs an Off-Shore Site Reliability Engineer, you will have the following preferred skillsets: - Experience with cloud platforms such as Azure, or GCP - Proficiency in scripting languages such as Python,...
-
Senior Site Reliability Engineer
2 months ago
Bengaluru, India Tech Mahindra (formerly Mahindra Satyam) Full timeWe are looking for Senior SRE who can join with us in Pan India Location.Skill - SRE with GCP platformExp - 6- 13 YrsLocation - Pan IndiaHybrid work locationAs an Off-Shore Site Reliability Engineer, you will have the following preferred skillsets: - Experience with cloud platforms such as Azure, or GCP - Proficiency in scripting languages such as Python,...
-
Site Reliability Engineer
3 months ago
Greater Bengaluru Area, India BCE Global Tech Full timeAbout the role We are seeking a talented Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a strong background in software engineering and systems administration, with a passion for building scalable and reliable systems. As an SRE, you will collaborate with development and operations teams to ensure our services are reliable,...