High Salary Site Reliability Engineer

3 weeks ago


Bengaluru, Karnataka, India Xebia Full time
Performance & Reliability Engineer ( Senior, Lead , Principal & Manager)

Hybrid

Location: Pune, Chennai, Bangalore & Gurgaon

Need immediate joiners only

Job description

Role: Performance & Reliability Engineer

Job Location: Gurgaon, Chennai, Pune, Bangalore

Hybrid

Job Overview:

We are seeking a highly skilled and motivated Performance & Reliability Engineer to join our team. In this role, you will be responsible for ensuring the reliability, scalability, and performance of our systems and applications. You will leverage tools such as Dynatrace, CloudWatch, and Python to monitor and optimize system performance, troubleshoot issues, and enhance the overall reliability of our infrastructure with SRE Best Practices.

Key Responsibilities:

- Performance Monitoring & Optimization:
- Use Dynatrace and CloudWatch to monitor system performance and availability.
- Implement performance tuning techniques to ensure high availability and optimal system performance.
- Identify performance bottlenecks and optimize applications and infrastructure for scalability.
- System Observability
- AppDynamics and monitoring dashboards.
- Collaborate with development and operations teams to troubleshoot incidents and provide recommendations for performance improvements.
- Proactively identify areas of risk and implement preventive measures.
- Automation & Scripting:
- Develop automation scripts in Python to enhance monitoring, incident response, and reporting processes.
- Write and maintain Python-based tools for proactive monitoring, alerting, and issue resolution.
- Cloud Monitoring & Alerts:
- Configure CloudWatch for real-time monitoring and alerting of cloud infrastructure,
- Develop and manage dashboards to visualize system health and performance metrics.
- Prepare and present performance reports, incident post-mortems, and improvement recommendations to senior leadership.
- Chaos Engineering, Fault management
- Vulnerability identification, Failure simulation, Stress Management

Required Skills and Experience:

- Strong experience with Dynatrace for application performance monitoring and root cause analysis.
- Proficiency in CloudWatch for monitoring AWS cloud infrastructure, configuring alerts, and visualizing metrics.
- Solid understanding of Python for automating tasks, building performance tools, and writing scripts to enhance operations.
- Experience in analyzing system logs, troubleshooting performance issues, and providing technical recommendations.
- Hands-on experience with cloud environments (AWS preferred), including development knowledge
- Experience with load testing and performance benchmarking.

About Xebia: https://www.linkedin.com/company/xebia/about/

  • Bengaluru, Karnataka, India Landmark Group Full time

    COMPANY- LANDMARK GROUPJob Title: SRE Lead (Engineering & Reliability)Experience: 8-12 yearsJob Summary:We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead tooversee the reliability, scalability, and performance of our critical systems. As an SRE Lead,you will play a pivotal role in establishing and implementing SRE practices,...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

    We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...


  • Bengaluru, Karnataka, India FIS Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    About the Role :Site Reliability Engineer (SRE)with deep expertise inMainframe technologies like COBOL, JCL, etc. to support and enhance ourCard Management & Payment processing functions. This role will be responsible for ensuring reliability, high availability, scalability, stability and performance of mission-critical mainframe software applications and...


  • Bengaluru, Karnataka, India AppHelix Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Role DescriptionThis is a full-time on-site role located in Bengaluru for a Site Reliability Engineer. The Site Reliability Engineer will be responsible for maintaining and improving the reliability of AppHelix's systems. Daily tasks include monitoring system performance, troubleshooting issues, managing infrastructure, and supporting software development....


  • Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    We are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes. In this role, you will focus onmonitoring,basic troubleshooting, andincident response, helping to maintain high system availability,...


  • Bengaluru, Karnataka, India Success Pact Consulting Pvt Ltd Full time

    Position : Site Reliability EngineerExperience : 5 - 9 YearsLocation : Bangalore, IndiaJob Summary : We are seeking an experienced Site Reliability Engineer (SRE) with 5-9 years of experience to join our Platform Engineering team. This role is crucial for ensuring the high availability, performance, and scalability of our AI-powered code review platform....


  • Bengaluru, Karnataka, India Coforge Full time

    Job Description- Design, implement, and maintain scalable infrastructure to ensure high availability and performance of software applications.- Collaborate with development teams to identify and resolve issues affecting application performance, stability, and reliability.- Develop automated monitoring scripts using tools like Prometheus, Grafana, etc. to...


  • Bengaluru, Karnataka, India Tavant Full time

    About Tavant:With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tech-enabled transformation across a wide range of industries such as Consumer Lending, Manufacturing, Agtech, Media & Entertainment, and Retail in...


  • Bengaluru, Karnataka, India Tavant Full time

    About Tavant: With 25+ years of experience building innovative digital products and solutions, Tavant provides impactful results to its customers. It has been the frontrunner in driving digital innovation and tech-enabled transformation across a wide range of industries such as Consumer Lending, Manufacturing, Agtech, Media & Entertainment, and Retail in...