Senior Site Reliability Engineer

4 weeks ago

New Delhi, India Allegion Full time

Allegion India is seeking a highly motivated Senior Site Reliability Engineer who will play a critical role in ensuring the reliability, scalability, and performance of our organization's systems and infrastructure, who will work with a team of cross-functional product development engineers to design, implement, and maintain highly available and resilient systems and whose expertise in automation, monitoring, and incident response will contribute to the overall stability and efficiency of our technology stack throughout the Allegion product portfolio.Job Description:Design, implement, and maintain highly available and scalable infrastructure systems, ensuring maximum uptime and performance. Collaborate with software engineering teams to build and deploy applications using best practices in reliability, scalability, and security. Develop and implement automation tools and frameworks to streamline operational processes, reduce manual intervention, and improve efficiency. Monitor and analyse system performance, identifying bottlenecks, and implementing solutions to optimize performance and scalability. Implement and maintain effective monitoring, alerting, and logging systems to proactively identify and resolve issues before they impact users. HandsOn Experience in building CI/CD automated pipelines using GitHUB Actions/Jenkins/GitLab or equivalent platform Excellent in Automating workflows or solutions using Python/Go/Shell Lead incident response and root cause analysis efforts, driving continuous improvement and preventing future incidents. Collaborate with cross-functional teams to define and enforce best practices, standards, and guidelines for system reliability and performance. Participate in on-call rotations and respond to incidents, ensuring timely resolution and minimal impact to users and thereby meeting SLAs. Plan and devise Disaster Recovery (DR) strategies and implement DR Plans. Mentor and provide guidance to junior team members, fostering a culture of learning and growth. Run the production environment by monitoring availability and taking a holistic view of system health. Build software and systems to manage platform infrastructure and applications. Improve reliability, quality, and time-to-market of our suite of software solutions. Measure and optimize system performance, with an eye toward pushing our capabilities forward, getting ahead of customer needs, and innovating for continual improvement. Provide primary operational support and engineering for multiple large-scale distributed software applications.Required Knowledge, Skills and Abilities:Proven experience as a Site Reliability Engineer or similar role, with a focus on designing and maintaining highly available and scalable systems. Strong programming and scripting skills (Python, Bash, etc.) to automate operational tasks and develop tooling. Experience with cloud platforms (AWS) and containerization technologies (Docker, EKS). Proficient in configuration management tools like Ansible and infrastructure-as-code frameworks such as Terraform and CloudFormation. Experience with monitoring and logging tools (Prometheus, Grafana, Loki, Sentry.io, CloudWatch, etc.) for proactive system monitoring and troubleshooting. Ability to program (Structured and OOP) using one or more high-level languages, such as Java and JavaScript Solid understanding of networking principles, protocols, and security best practices. Strong problem-solving skills and the ability to work effectively in a fast-paced, dynamic environment. Excellent communication and collaboration skills, with the ability to work effectively with cross-functional teams. Experience with distributed storage technologies such as NFS, Amazon S3, as well as dynamic resource management frameworks (Apache Mesos, Kubernetes, Yarn) Proactive approach to identifying problems, performance bottlenecks, and areas for improvement. Experience in Agile methodologies Strong skills in software design, design patterns Experience in different architecture patterns like client-server/server less computing. Effective written, verbal and presentation skills with the ability to clearly articulate ideas and concepts. Self-directed and able to direct others.Desired Skills & Abilities:Experience with setting up performance/load test environments. Familiarity with SOC2 audit processesRequired Education and/or Experience:BE/B Tech/M Tech/MCA/MSc in Computer Science Engineering 7 to 11 Years of experience in Software Application Development/CloudOps/SREAllegion is a diverse and inclusive environment. We are an equal opportunity employer and are dedicated to hiring qualified protected veterans and individuals with disabilities. If for any reason you cannot apply through the job center, please contact HR, Allegion India for special accommodation. We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law.

Site Reliability Engineer

4 weeks ago

New Delhi, India WhiteLotus Talent Partners Full time

We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
Senior Site Reliability Engineer

1 day ago

New Delhi, India Synechron Full time

We have immediate opportunity forSRE (Senior Site Reliability Engineer) 5+ years. Synechron– MumbaiJob Role: -SRE (Senior Site Reliability Engineer) Job Location: -MumbaiAbout Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to 14,500+ people, across 58 offices, in 21...
Site Reliability Engineer

4 weeks ago

New Delhi, India SID Global Solutions Full time

Job Role: Site Reliability Engineer (SRE) – GCPExperience: 3+ yearsLocation: HyderabadAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...
Senior Staff Site Reliability Engineer

4 weeks ago

New Delhi, India Movius Full time

Senior Staff Site Reliability EngineerLocation: Bengaluru, KA, 560076 Job Description: We are seeking a highly skilled Senior Staff Site Reliability Engineer with extensive experience in DevOps/SRE roles and large-scale distributed systems. The ideal candidate will have a proven background in cloud operations, automation, and CI/CD, with a preference for...
Senior Staff Site Reliability Engineer

4 weeks ago

New Delhi, India Movius Full time

Senior Staff Site Reliability EngineerLocation: Bengaluru, KA, 560076Job Description:We are seeking a highly skilled Senior Staff Site Reliability Engineer with extensive experience in DevOps/SRE roles and large-scale distributed systems. The ideal candidate will have a proven background in cloud operations, automation, and CI/CD, with a preference for...
Senior Staff Site Reliability Engineer

3 weeks ago

New Delhi, India Movius Full time

Senior Staff Site Reliability EngineerLocation: Bengaluru, KA, 560076Job Description:We are seeking a highly skilled Senior Staff Site Reliability Engineer with extensive experience in DevOps/SRE roles and large-scale distributed systems. The ideal candidate will have a proven background in cloud operations, automation, and CI/CD, with a preference for...
Senior Site Reliability Engineer

3 weeks ago

New Delhi, India Tata Consultancy Services Full time

Role**: Senior Site Reliability Engineer (SRE)Required Technical Skill Set: Senior Site Reliability Engineer (SRE)Desired Experience Range: 7 - 10 yrsNotice Period: Immediate to 90Days onlyLocation of Requirement: BangaloreWe are currently planning to do a Virtual InterviewJob Description:Key ResponsibilitiesInfrastructure & Application Support- Design,...
Senior Site Reliability Engineer/ Senior Cloud Engineer

2 weeks ago

New Delhi, India CloudHire Full time

Job SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...
Senior Site Reliability Engineer

1 day ago

Delhi, India Synechron Full time

We have immediate opportunity forSRE (Senior Site Reliability Engineer) 5+ years.Synechron– MumbaiJob Role: -SRE (Senior Site Reliability Engineer)Job Location: -MumbaiAbout SynechronWe began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to 14,500+ people, across 58 offices, in 21 countries,...
Senior Site Reliability Engineer – Grafana

3 days ago

New Delhi, India Aptimized Full time

Job Description – Senior Site Reliability Engineer (SRE) – Grafana & ObservabilityPosition: Senior Site Reliability Engineer – Grafana & ObservabilityLocation: [Hyderabad /Hybrid]Experience: 10–20+ yearsOperating globally, Aptimized is a premium ERP, HCM, and Technology Optimization Consulting agency. Our team at Aptimized focuses on helping our...

Americas

Europe

Asia / Oceania

Africa

Senior Site Reliability Engineer