Senior SRE Engineer

2 weeks ago


bangalore, India Spot Your Leaders & Consulting Full time

Responsibilities :


Design and implement automation solutions for infrastructure provisioning, configuration

management,promoting consistency and reliability across environments.


● Maintenance of CI/CD pipelines using Jenkins, ensuring efficient deployment processes and

integrating quality checks.

● Manage the applications using Docker and Kubernetes, focusing on scalability, efficiency, and

security.

● Solutioning and Maintaining the secure, scalable, and resilient cloud infrastructure on AWS,

including performance tuning and cost optimization.

● Conduct comprehensive Linux system administration, including performance tuning, security

hardening, and troubleshooting.

● Develop and maintain Java,Python to automate tasks and integrate systems, enhancing

operational efficiency

● Collaborate with development and operations teams to implement SRE principles, fostering a

culture of reliability and performance.

● Monitor system performance, identify bottlenecks, and implement solutions to ensure high

availability and optimal user experience.

● Lead incident response efforts, minimizing impact and conducting post-mortem analyses to

prevent future occurrences.

● Mentor junior team members and contribute to the development of best practices and

standards within the SRE team.

Must Have Skills :

● Minimum 8+ years of hands-on experience in Site Reliability Engineering (SRE) or a similar

role, with a proven track record of managing large-scale, highly available AWS infrastructure.

● Extensive experience with AWS services, including but not limited to EC2, EKS, RDS, S3,

Lambda, IAM, VPC, and Cloud Formation. Proficiency in designing, deploying, and

maintaining complex, cloud-native architectures on AWS.

● Deep understanding of observability principles and best practices. Hands-on experience with

monitoring, logging, tracing, and alerting tools such as New Relic, Grafana, Prometheus, and

ELK stack.

● Proficiency in Java, Python and Shell scripting for automation tasks and infrastructure

management. Experience with configuration management tools like Ansible and Terraform

for infrastructure as code (IaC) deployment.

● Hands-on experience with CI/CD pipelines using tools like Jenkins for automated build, test,

and deployment processes.

● Should have experience in handling containerisation applications like Kubernetes.

● Proven ability to monitor system performance, identify bottlenecks, and optimize resource

utilization. Experience in capacity planning, performance tuning, and scaling AWS resources

as needed.

● Strong troubleshooting skills and experience in incident management and response. Ability to

diagnose and resolve complex issues in a timely manner, ensuring minimal impact on service

availability and performance.

● Excellent interpersonal and communication skills, with the ability to collaborate effectively

across cross-functional teams. Experience in fostering a culture of collaboration, knowledge

sharing, and continuous improvement.

● Ability to lead and mentor junior team members, guiding them in best practices,

methodologies, and tools related to SRE and observability.


Good to Have :

AIOps Knowledge : Familiarity with AIOps (Artificial Intelligence for IT Operations)

concepts and tools such as machine learning, anomaly detection, and predictive analytics

applied to infrastructure monitoring and management. Experience in leveraging AIOps

solutions to enhance observability, automate remediation, and optimize performance in cloud

environments.

Telecom Domain Experience : Exposure to the telecommunications industry, including

knowledge of networking protocols, telecommunications infrastructure, and service delivery


platforms. Experience with telecom-specific technologies such as VoIP, LTE, 5G, IMS, and

SDN/NFV.

OTT (Over-the-Top) Domain Experience : Understanding of Over-the-Top services and

platforms, including streaming media, content delivery networks (CDNs), and

video-on-demand (VOD) services. Experience in managing high-volume, high-availability

OTT platforms and addressing unique challenges related to content delivery, user experience,

and scalability.

SRE - Observability, Microservices,Distributing Architect

2. Experience in Application and infra monitoring tools like,NewRelic, Grafana,

Prometheus, and ELK stack.

3. Programming - Python / Shell Scripting and Terraform/Ansible.

4. Troubleshooting on Kubernetes, Networking and Security.

5. Knowledge in distributed systems and load balancing.

6. Hands on in VPC that should have experience in migration from on-prem to cloud.

7. Devops - CI/CD or Argo-CD.

Good to have:

1. Database - AWS RDS, Mongo, Oracle

2. Domain- AIOps Knowledge and Telecom Domain Experience

3. OTT (Over-the-Top) Domain Experience

4. Knowledge in middleware skills like Kafka, RabbitMq.

5. Good to have experience in ETCD and Redis



Qualification:

● Bachelor's degree in Computer Science, Information Technology, or related field preferred.


  • Devops Engineer

    1 month ago


    bangalore, India Sonata Software Full time

    Job Title: Senior Site Reliability Engineer (SRE)Department: Cloud EngineeringJob Type: Full-time Job Description:We are seeking a highly skilled Senior Site Reliability Engineer (SRE) with extensive experience in Cloud Engineering, particularly in AWS. The ideal candidate should have hands-on expertise in developing Cloud solutions using Terraform or...

  • Devops Engineer

    3 weeks ago


    bangalore, India Sonata Software Full time

    Job Title: Senior Site Reliability Engineer (SRE)Department: Cloud EngineeringJob Type: Full-time Job Description:We are seeking a highly skilled Senior Site Reliability Engineer (SRE) with extensive experience in Cloud Engineering, particularly in AWS. The ideal candidate should have hands-on expertise in developing Cloud solutions using Terraform or...

  • SRE - Bengaluru

    1 month ago


    bangalore, India Virtusa Full time

    SRE - CREQ189656 Description We are looking for senior SRE (Software Reliability Engineer) profiles for our squad with the capacity to become Tech lead.Strong hands-on skills required on: distributed architecture and high availability automation and scripting network and system performance analysis CICD toolchains Infrastructure services, esp. on Kubernetes...

  • SRE - Bengaluru

    3 weeks ago


    bangalore, India Virtusa Full time

    SRE - CREQ189656 Description We are looking for senior SRE (Software Reliability Engineer) profiles for our squad with the capacity to become Tech lead.Strong hands-on skills required on: distributed architecture and high availability automation and scripting network and system performance analysis CICD toolchains Infrastructure services, esp. on Kubernetes...


  • bangalore, India Couchbase Full time

    Every day we tackle new and exciting challenges to empower developers to build modern cloud, mobile, and edge applications that deliver a premium user experience. Couchbase delivers unmatched performance, scalability, flexibility and financial value across cloud, on premises, hybrid, mobile and edge deployments. The database market is undergoing a...


  • bangalore, India Infogain Full time

    SRE / Reliability Engineer (Lead) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune) Posted on: May 31, Share on Linkedin Share on Twitter Share on Facebook ROLES &...


  • bangalore, India Abha Engineer Full time

    We are looking for a Senior Mechanical EngineerRoles are described below.1. Manpower Planning.2. Preparing of Project Cost.3. Schedule wise work execution.4. As Drawing & quality work execution.5. Client & Third Party Manage.6. Working Team Manage & Review.7. Reporting to Management.8. ROB & FOB Fabrication & Erection Work Knowledge.


  • bangalore, India Abha Engineer Full time

    We are looking for a Senior Mechanical EngineerRoles are described below.1. Manpower Planning.2. Preparing of Project Cost.3. Schedule wise work execution.4. As Drawing & quality work execution.5. Client & Third Party Manage.6. Working Team Manage & Review.7. Reporting to Management.8. ROB & FOB Fabrication & Erection Work Knowledge.

  • Platform SRE Engineer

    2 months ago


    bangalore, India DigiCert Full time

    ABOUT DIGICERT We're a leading, global security authority that's disrupting our own category. Our encryption is trusted by the major ecommerce brands, the world's largest companies, the major cloud providers, entire country financial systems, entire internets of things and even down to the little things like surgically embedded pacemakers. We help...


  • bangalore, India Virtusa Full time

    SRE with AIOP and Dynatrace - CREQ181002 Description Knowledge & Experience:Minimum of 6 years of relevant work experience in critical production environmentsExperience in enabling observability within applications to extract appropriate telemetry into suitable back ends like DynatraceHands-on experience of curating Service Level Objectives, defining Error...


  • bangalore, India Virtusa Full time

    SRE with AIOP and Dynatrace - CREQ181002 Description Knowledge & Experience:Minimum of 6 years of relevant work experience in critical production environmentsExperience in enabling observability within applications to extract appropriate telemetry into suitable back ends like DynatraceHands-on experience of curating Service Level Objectives, defining Error...


  • bangalore, India Spot Your Leaders & Consulting Full time

    Roles and Responsibilities:● Own the application, APM and work with Developers and Systems engineers to Build, Release, Monitor and run the services reliability exceeding the agreed SLAs..● Write software to automate to create custom dashboards for APM and infra monitoring tools like New Relic,datadog,grafana, etc.● Writeautomation to reduce toil and...

  • Cloud SRE Engineer

    3 weeks ago


    bangalore, India Australia and New Zealand Banking Group Limited (ANZ) Full time

    Cloud SRE Engineer Cloud SRE Engineer Req ID: Department: Tech CDIS Cloud, Data and Analytics Division: Technology Location: Bengaluru About Us At ANZ, we're applying new ways technology and data can be harnessed as we work towards a common goal: to improve the financial wellbeing and sustainability of our millions of customers. Our...


  • bangalore, India American Express Full time

    You Lead the Way. We’ve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you’ll learn and grow as we help you...


  • bangalore, India AMEX Full time

    You Lead the Way. Weve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, youll learn and grow as we help you create a...


  • bangalore, India Spot Your Leaders & Consulting Full time

    Roles and Responsibilities: ● Own the application, APM and work with Developers and Systems engineers to Build, Release, Monitor and run the services reliability exceeding the agreed SLAs.. ● Write software to automate to create custom dashboards for APM and infra monitoring tools like New Relic,datadog,grafana, etc. ● Writeautomation to reduce...

  • Linux SRE Engineer

    7 days ago


    bangalore, India Central Business Solutions, Inc Full time

    The Enterprise Computing (EC) Core Infrastructure Services organization is looking for a Site Reliability Engineering to manage the operations, reliability and services for Morgan Stanley's suite of Software Distribution product ecosystem products that are part of Artifact Curation and Distribution Control squad. This squad is responsible for providing...

  • Vice President- SRE

    3 weeks ago


    bangalore, India Angel One Full time

    Key ResponsibilitiesRun Engineering functions, including managing people and a team across multiple locationBuilding high-performing teams by developing and nurturing Engineering teams through cultural change,Supporting, challenging and building consensus on design directions/decisions to ensure they are viable from a Cloud perspective.Ability to work in a...


  • bangalore, India Arting Digital Private Limited Full time

    Posting title:      Site Reliability Engineer (SRE)Experience:        7+ YearsLocation:             BangaloreWork mode:       HybridPrimary skills:    Terraform, Ansible, AWS, Kubernetes, Openshift, CI/CDQualification:      Any Engineering/ Computers degreeResponsibilities :• Design, build, and maintain highly available,...

  • Staff IT SRE Engineer

    3 months ago


    bangalore, India NVIDIA Full time

    NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers,...