Urgent Search: Sr. Site Reliability Engineer- Azure

3 days ago


Chennai, India XenonStack Full time

Job Description

- Gathering Project Requirements from Stakeholders along with Business Analysts and Project Managers
- Break down complex problems and projects into manageable goals
- Handle High severity incident and situation.
- Designing high level Schematics of the infrastructure, tools and process needed
- Performing and in depth analysis of the possible risk and countermeasures for them
- Create a bridge between development and operations by applying software engineering mindset to system administration topics
- Configuration management platform understanding and experience (Chef/Puppet/Ansible)
- Release engineering, which involves defining best practices to ensure software releases are consistent and repeatable.
- Alerting, being on-call, and troubleshooting, along with emergency and incident response and postmortems.
- Know how best to monitor systems and react when things go wrong, constantly writing and rewriting response playbooks to reduce the time to fix any breakdown which may occur
- Involves documenting an incident, understanding all contributing root causes, and implementing future preventive actions.
- Highly developed skills in managing 24x7 production support comprising of Incident, Problem, Change management
- Troubleshooting Support Escalation
- On-Call Process Optimization
- Documenting Knowledge
- Optimizing SDLC (Software Development Life Cycle)

Technical Requirement -

- Strong understanding of cloud-based architecture and cloud operations. Hands-on experience with Azure
- Experience in administration/build/management of Linux systems
- Foundational understanding of Infrastructure and Platform Technology stacks
- Strong understanding of Networking concepts and theories, such as different protocols (TCP/IP, UDP, ICMP, etc), MAC addresses, IP packets, DNS, OSI layers, and load balancing
- Working knowledge of Infrastructure and Application monitoring platforms
- Understanding of the core DevOps practices (CI/CD pipeline, release management etc)
- Ability to write code using any one modern programming language (Python, JavaScript, Ruby etc). Additional scripting skills are preferred
- Prior experience in Cloud management automation tools (Terraform/CloudFormation etc) is preferred
- Experience with source code management software and API automation is preferred.
- Deep Understanding of architecture and operations of Container Orchestration tools eg Kubernetes
- Deep understanding of Know Applications ie JAVA, Nodejs, Golang
- Deep understanding of Databases and SQL
- Strong understanding of BigData Infrastructure.
- Understanding of Incident management and Event Register Management
- Knowledge of SDLC methodologies and best practices including Waterfall Process, Agile methodologies, deployment automation, code reviews, and test-driven development

Professional Attributes -

- Excellent communication skills
- Attention to detail
- Analytical mind and Problem Solving Aptitude
- Strong Organizational skills
- Visual Thinking



  • Chennai, Tamil Nadu, India Concord Full time

    SRE Sr. Engineers (Individual Contributors)Key Attributes:Strong SRE (Site Reliability Engineering) experienceDevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc.Excellent troubleshooting and debugging skills (infrastructure + application level)Perseverance – must push through complex/challenging issues without giving upAble to...


  • Chennai, Tamil Nadu, India Ford Global Career Site Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Be at the Forefront of Mobility's Future: Join Ford as a Site Reliability EngineerEnterprise Technology is the engine driving the future of transportation, and we're looking for a talented Site Reliability Engineer (SRE) to help us redefine mobility. In this role, you'll leverage cutting-edge technology to enhance customer experiences, improve lives, and...


  • Chennai, Tamil Nadu, India Grootan Technologies Full time US$ 90,000 - US$ 1,20,000 per year

    About the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4 to 5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • Chennai, India Grootan Technologies Full time

    About the Role We are seeking a skilled Site Reliability Engineer (SRE) with 4 to 5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...


  • Chennai, India Zyoin Group Full time

    Position: Site Reliability Engineer (SRE) Experience: 4 – 10 Years Location: Chennai (Hybrid – 2 days in office) Role Overview: We are seeking a Site Reliability Engineer (SRE) responsible for leading reliability practices, ensuring scalable systems, and collaborating with development teams to maintain highly available services. Key...

  • Sr Cloud Engineer

    2 weeks ago


    Chennai, Tamil Nadu, India 1CloudHub Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Company Description1CloudHub is a global cloud transformation and engineering company, delivering cutting-edge cloud solutions for multi-billion dollar businesses worldwide. We specialize in cloud advisory, IaaS and engineering, cloud applications, and 24x7 managed services. Our expert-led Nimbus framework ensures rapid and reliable cloud migration and...


  • Chennai, Tamil Nadu, India Trimble Inc. Full time

    Job DescriptionJob SummaryWe are seeking a motivated Site Reliability Engineer (SRE) Level 1 to enhance the infrastructure and operational reliability of our ERP product, specifically within Azure and Windows environments. The ideal candidate will utilize SRE principles to ensure high system availability, stability, and performance while collaborating...


  • Chennai, Tamil Nadu, India Trimble Inc. Full time US$ 1,04,000 - US$ 1,30,878 per year

    Job SummaryWe are seeking a motivated Site Reliability Engineer (SRE) Level 1 to enhance the infrastructure and operational reliability of our ERP product, specifically within Azure and Windows environments. The ideal candidate will utilize SRE principles to ensure high system availability, stability, and performance while collaborating closely with...


  • Chennai, Tamil Nadu, India Parkar Digital Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    About Parkar:We love building software products. With a decade of experience and a global presence across four countries, we've established ourselves as a trusted partner for over 100 organizations, helping them leverage technology to drive transformative growth. Staying at the forefront of technological advancements, we actively explore and integrate the...


  • Chennai, India Parkar Digital Full time

    About Parkar: We love building software products. With a decade of experience and a global presence across four countries, we've established ourselves as a trusted partner for over 100 organizations, helping them leverage technology to drive transformative growth. Staying at the forefront of technological advancements, we actively explore and integrate the...