Lead SRE

4 weeks ago


Bengaluru, Karnataka, India Delta Air Lines Full time

Responsibilities:

  • Engage in and improve the whole lifecycle of servicesfrom inception and design through deployment, operation, and refinement
  • Support capacity planning, availability, scalability, security and latency considerations for new infrastructure and service provisioning as appropriate
  • Responsible for improvements to end-to-end availability and performance of mission critical services and build automation to prevent problem recurrence.
  • Partner with other SREs to bring best practices or learnings from across the organization to them
  • Scale and optimize existing infrastructure and services sustainably through mechanisms, including automation, and evolve them by improving reliability and efficiency
  • Manage end-to-end availability and performance of mission-critical services and build automation to prevent problem recurrence
  • Maintain infrastructure (infrastructure as code) and services by measuring, and monitoring system metrics to proactively identify operational efficiencies, potential outages and security threats in Development, UAT, Staging and Production environments
  • Practice sustainable incident responseand blameless postmortems
  • Build infrastructure and drive projects that break things with the aim to improve the robustness of production systems
  • Use the core Site Reliability Engineering principles of change management, monitoring, emergency response, capacity planning, and production readiness reviews to run the platform
  • Step back to observe patterns and develop innovative tools and automation to eliminate or minimize menial tasks. Use those learnings to drive the best operational practices
  • Develop and maintain solution and operational documentation and designs for all infrastructure and services within the scope of SRE
  • Preserve operational visibility and response capabilities fixing and improving our dashboards, alerts, and automation
  • Maintain operational uptime and reliability by participating in triage and issue support calls for mission critical systems
  • Partner with business and technical product owners to set SLOs / SLIs / error budgets to manage reliability of infrastructure and applications

Required Qualifications:

  • Software Engineering, Computer Science equivalent, or STEM degree (Desirable) or commensurate experience
  • 6+ years of total software engineering experience using Kubernetes, AWS Native components/Azure/GCP, CloudWatch, Dynatrace
  • 3+ years of support a production system on a DevOps team
  • 2+ yearsof experience Architecting using AWS Cloud
  • Strong experience setting SLOs / SLIs / error budgets and managing of reliability for infrastructure and applicationsusing Kubernetes, AWS Native components, CloudWatch, Dynatrace
  • Can mentor team of less experienced Full-stack developers who are learning the AWS environment.
  • Proficient in one or more of the following scripting languages: JavaScript, Nodejs, Python, Maven, Ansible, Bash, etc.
  • Experience handling large numbers of diverse systems with configuration management systems like Puppet, Chef, Ansible, GitLab CI
  • Understanding of standard networking protocols and components such as HTTP, DNS, ECMP, TCP/IP, ICMP, the OSI Model, Subnetting and Load Balancing strategies
  • Experience in Serverless Application Framework
  • Experience in containerized workloads and management platforms such as Docker or Kubernetes
  • Familiarity with distributed systems is a plus including Microservices
  • Experience in Infrastructure automation tools such as CloudFormation, Terraform
  • Understanding of CI/CD processes and experience with deployment automation tools such as Code Pipeline, Code Deploy, Jenkins, Bamboo
  • Strong debugging, troubleshooting, and problem-solving skills
  • Effective communication, collaboration & negotiation skills with the ability to interface with various business units and third parties
  • Must have the ability to listen to customers and colleagues; convey ideas effectively; prepare written documentation
  • Experience liaising with developers, operations staff and third-party resources
  • Experience with API integration projects
  • Proven history of toil elimination by leveraging automation
  • Strong background using tools like PagerDuty for managing incidents
  • Strong experience with monitoring and alerting systems like Prometheus, Grafana, Datadog.

Preferred Qualifications:

  • AWS Certified DevOps Engineer or equivalent cloud professional SRE certifications.
  • A mindset focused on automation, measurement and efficiency.


  • Bengaluru, Karnataka, India Infogain Full time

    SRE / Reliability Engineer (Lead) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune)ROLES & RESPONSIBILITIESCore Skills8 to 10 years of experience in DevOps role with focus on GCP Cloud...

  • Lead SRE

    2 weeks ago


    Bengaluru, Karnataka, India Delta Air Lines Full time

    Company: XYZ Tech SolutionsPosition: Senior Site Reliability EngineerResponsibilities:Engaging in and improving the entire lifecycle of services - from inception and design through deployment, operation, and refinementSupporting capacity planning, availability, scalability, security, and latency considerations for new infrastructure and service provisioning...


  • Bengaluru, Karnataka, India AQUASoft Full time

    AQUASoft is a software development company that specializes in creating custom-made products and software solutions for various clients, including Fortune 500 giants and medium-sized businesses. Our team of highly skilled and experienced software engineers across two continents utilize the latest frameworks and state-of-the-art technologies to build robust,...

  • SRE Consultant

    2 weeks ago


    Bengaluru, Karnataka, India Wipro Full time

    Job Title: Lead Cloud/SRE Consultant: Location: Pune/Bangalore Expected to drive and contribute to research, design, documentation, and modifications to software specifications throughout the production life cycle with optimal technical solutions across the Cloud Infrastructure platforms stack and also Work with the Engineering, Product, Delivery and...


  • Bengaluru, Karnataka, India AQUASoft Full time

    AQUASoft is a software development company that specializes in creating custom-made products and software solutions for various clients, including Fortune 500 giants and medium-sized businesses. Our team of highly skilled and experienced software engineers across two continents utilize the latest frameworks and state-of-the-art technologies to build robust,...


  • Bengaluru, Karnataka, India Virtusa Full time

    SRE with AIOP and Dynatrace - CREQ181002 DescriptionKnowledge & Experience:Minimum of 6 years of relevant work experience in critical production environments Experience in enabling observability within applications to extract appropriate telemetry into suitable back ends like Dynatrace Hands-on experience of curating Service Level Objectives, defining Error...

  • SRE - Bengaluru

    2 weeks ago


    Bengaluru, Karnataka, India Virtusa Full time

    SRE - CREQ189656 Description We are looking for senior SRE (Software Reliability Engineer) profiles for our squad with the capacity to become Tech lead.Strong hands-on skills required on:distributed architecture and high availabilityautomation and scriptingnetwork and systemperformance analysisCICD toolchainsInfrastructure services, esp. on...

  • Lead SRE

    2 weeks ago


    Bengaluru, Karnataka, India Thomson Reuters Full time

    About the Role:In this opportunity as Lead SRE - Global Command Center, you will:Run the production environment by monitoring availability and taking a holistic view of system health.Build software and systems to manage platform infrastructure and applicationsImprove reliability, quality, and time-to-market of our suite of software solutionsMeasure and...

  • Lead SRE

    2 weeks ago


    Bengaluru, Karnataka, India Thomson Reuters Full time

    As an employee at Thomson Reuters, you will play a role in shaping and leading the global knowledge economy. Our technology drives global markets and helps professionals around the world make decisions that matter. As the world's leading provider of intelligent information, we want your unique perspective to create the solutions that advance our...

  • Senior SRE

    2 weeks ago


    Bengaluru, Karnataka, India Dautom Full time

    Client Introduction: In this role, you will have the opportunity to work closely with one of our esteemed clients. This client is a global leader in the IT industry, known for its commitment to quality and innovation. They have chosen Dautom as their trusted partner for their upcoming projects. Job Title: Senior SRE - Cloud Administrator Job Description ...

  • Senior SRE

    2 weeks ago


    Bengaluru, Karnataka, India Dautom Full time

    Client Introduction:In this role, you will have the opportunity to work closely with one of our esteemed clients. This client is a global leader in the IT industry, known for its commitment to quality and innovation. They have chosen Dautom as their trusted partner for their upcoming projects.Job Title: Senior SRE - Cloud AdministratorJob...

  • Vice President- SRE

    2 weeks ago


    Bengaluru, Karnataka, India Angel One Full time

    Key ResponsibilitiesRun Engineering functions, including managing people and a team across multiple locationBuilding high-performing teams by developing and nurturing Engineering teams through cultural change,Supporting, challenging and building consensus on design directions/decisions to ensure they are viable from a Cloud perspective.Ability to work in a...

  • Senior SRE Engineer

    2 weeks ago


    Bengaluru, Karnataka, India Taggd Full time

    Key Skills Sets Linux Administration DeVops Docker Kubernetes AWS Python Ansible Jenkins Observability tools like New Relic Shift timings: 8am IST to 5pm IST Position Overview:The Senior SRE will be responsible for leading initiatives to improve system reliability, automate operational processes, and ensure the scalability and security of our...

  • Vice President- SRE

    2 weeks ago


    Bengaluru, Karnataka, India Angel One Full time

    Key Responsibilities Run Engineering functions, including managing people and a team across multiple location Building high-performing teams by developing and nurturing Engineering teams through cultural change, Supporting, challenging and building consensus on design directions/decisions to ensure they are viable from a Cloud perspective. Ability to work...

  • Vice President- Sre

    2 weeks ago


    Bengaluru, Karnataka, India Angel One Full time

    Key Responsibilities Run Engineering functions, including managing people and a team across multiple location Building high-performing teams by developing and nurturing Engineering teams through cultural change, Supporting, challenging and building consensus on design directions/decisions to ensure they are viable from a Cloud perspective.Ability to work in...

  • SRE Platform Engg

    2 weeks ago


    Bengaluru, Karnataka, India FIS Full time

    Position Type :Full timeType Of Hire :Experienced (relevant combo of work and education)Education Desired :Bachelor of Computer EngineeringTravel Percentage :0%SRE Platform Engg (Devops + Production Support)Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues in...

  • SRE Platform Engg

    2 weeks ago


    Bengaluru, Karnataka, India Jobs for Humanity Full time

    Job DescriptionPosition Type :Full timeType Of Hire :Experienced (relevant combo of work and education)Education Desired :Bachelor of Computer EngineeringTravel Percentage :0%SRE Platform Engg (Devops + Production Support)Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant...

  • Sre

    2 weeks ago


    Bengaluru, Karnataka, India Virtusa Full time

    We are looking for senior SRE (Software Reliability Engineer) profiles for our squad with the capacity to become Tech lead.Strong hands-on skills required on:distributed architecture and high availabilityautomation and scriptingnetwork and systemperformance analysisCICD toolchainsInfrastructure services, esp. on KubernetesMonitoring solutionAgile methodology...

  • SRE Platform Engg

    2 weeks ago


    Bengaluru, Karnataka, India Jobs for Humanity Full time

    Job Description Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Bachelor of Computer Engineering Travel Percentage : 0% SRE Platform Engg (Devops + Production Support) Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most...

  • SRE Platform Engg

    4 weeks ago


    Bengaluru, Karnataka, India FIS Global Full time

    Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Bachelor of Computer Engineering Travel Percentage : 0%SRE Platform Engg (Devops + Production Support)Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues...