Site Reliability Engineer

4 weeks ago


Bengaluru, Karnataka, India Xebia Full time
Performance & Reliability Engineer ( Senior, Lead , Principal & Manager) Hybrid Location: Pune, Chennai, Bangalore & Gurgaon Need immediate joiners only Job description Role: Performance & Reliability Engineer Job Location: Gurgaon, Chennai, Pune, Bangalore Hybrid Job Overview: We are seeking a highly skilled and motivated Performance & Reliability Engineer to join our team.
In this role, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications.
You will leverage tools such as Dynatrace , Cloud Watch , and Python to monitor and optimize system performance, troubleshoot issues, and enhance the overall reliability of our infrastructure with SRE Best Practices .
Key Responsibilities: Performance Monitoring & Optimization: Use Dynatrace and Cloud Watch to monitor system performance and availability.
Implement performance tuning techniques to ensure high availability and optimal system performance.
Identify performance bottlenecks and optimize applications and infrastructure for scalability.
System Observability App Dynamics and monitoring dashboards.
Collaborate with development and operations teams to troubleshoot incidents and provide recommendations for performance improvements.
Proactively identify areas of risk and implement preventive measures.
Automation & Scripting: Develop automation scripts in Python to enhance monitoring, incident response, and reporting processes.
Write and maintain Python-based tools for proactive monitoring, alerting, and issue resolution.
Cloud Monitoring & Alerts: Configure Cloud Watch for real-time monitoring and alerting of cloud infrastructure, Develop and manage dashboards to visualize system health and performance metrics.
Prepare and present performance reports, incident post-mortems, and improvement recommendations to senior leadership.
Chaos Engineering, Fault management Vulnerability identification, Failure simulation, Stress Management Required Skills and Experience: Strong experience with Dynatrace for application performance monitoring and root cause analysis.
Proficiency in Cloud Watch for monitoring AWS cloud infrastructure, configuring alerts, and visualizing metrics.
Solid understanding of Python for automating tasks, building performance tools, and writing scripts to enhance operations.
Experience in analyzing system logs, troubleshooting performance issues, and providing technical recommendations.
Hands-on experience with cloud environments (AWS preferred), including development knowledge Experience with load testing and performance benchmarking.
About Xebia:

  • Bengaluru, Karnataka, India Enterprise Minds, Inc Full time

    We're Hiring | Site Reliability Engineer | 8-10 years


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, Karnataka, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000

    Role OverviewAs a Site Reliability Engineer, you will play a pivotal role in driving innovation and modernizing complex systems by leveraging cutting-edge technologies and collaboration with cross-functional teams.


  • Bengaluru, Karnataka, India Coforge Full time

    Job Description- Design, implement, and maintain scalable infrastructure to ensure high availability and performance of software applications.- Collaborate with development teams to identify and resolve issues affecting application performance, stability, and reliability.- Develop automated monitoring scripts using tools like Prometheus, Grafana, etc. to...


  • Bengaluru, Karnataka, India Infrasoft Technologies Limited Full time

    Job DescriptionJob Title: DeveloperWork Location: Bangalore, KarnatakaExperience Range: 68 YearsJob Description:We are looking for a skilled Developer with strong hands-on experience in Site Reliability Engineering (SRE), Java, JavaScript, and Production Support. The ideal candidate should have a solid background in application monitoring and troubleshooting...


  • Bengaluru, Karnataka, India Collabera Full time

    Job Description As a Principal/Chief Site Reliability Engineer , you will play a critical role in designing, developing, and maintaining scalable and highly reliable systems. You'll work closely with development teams to improve system reliability, monitor critical applications, and design fail-proof infrastructure. Responsibilities Design and implement...


  • Bengaluru, Karnataka, India Xebia Full time

    We are seeking an experienced AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE) to design, build, and manage scalable, reliable, and secure cloud environments. The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI/CD, monitoring & observability frameworks, and incident...


  • Bengaluru, Karnataka, India NatWest Group Full time

    Join us as a Site Reliability Engineer In this key role you ll support the improvement of non-functional and operational characteristics such as availability performance efficiency change management monitoring security incident response and capacity planning of our products and services You ll enjoy significant stakeholder interaction working in...

  • Site Reliability Engineer

    14 minutes ago


    Bengaluru, Karnataka, India Xebia Full time

    We are seeking an experienced AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE) to design, build, and manage scalable, reliable, and secure cloud environments. The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI/CD, monitoring & observability frameworks, and incident...