Lead SRE
4 weeks ago
Responsibilities:
- Engage in and improve the whole lifecycle of servicesfrom inception and design through deployment, operation, and refinement
- Support capacity planning, availability, scalability, security and latency considerations for new infrastructure and service provisioning as appropriate
- Responsible for improvements to end-to-end availability and performance of mission critical services and build automation to prevent problem recurrence.
- Partner with other SREs to bring best practices or learnings from across the organization to them
- Scale and optimize existing infrastructure and services sustainably through mechanisms, including automation, and evolve them by improving reliability and efficiency
- Manage end-to-end availability and performance of mission-critical services and build automation to prevent problem recurrence
- Maintain infrastructure (infrastructure as code) and services by measuring, and monitoring system metrics to proactively identify operational efficiencies, potential outages and security threats in Development, UAT, Staging and Production environments
- Practice sustainable incident responseand blameless postmortems
- Build infrastructure and drive projects that break things with the aim to improve the robustness of production systems
- Use the core Site Reliability Engineering principles of change management, monitoring, emergency response, capacity planning, and production readiness reviews to run the platform
- Step back to observe patterns and develop innovative tools and automation to eliminate or minimize menial tasks. Use those learnings to drive the best operational practices
- Develop and maintain solution and operational documentation and designs for all infrastructure and services within the scope of SRE
- Preserve operational visibility and response capabilities fixing and improving our dashboards, alerts, and automation
- Maintain operational uptime and reliability by participating in triage and issue support calls for mission critical systems
- Partner with business and technical product owners to set SLOs / SLIs / error budgets to manage reliability of infrastructure and applications
Required Qualifications:
- Software Engineering, Computer Science equivalent, or STEM degree (Desirable) or commensurate experience
- 6+ years of total software engineering experience using Kubernetes, AWS Native components/Azure/GCP, CloudWatch, Dynatrace
- 3+ years of support a production system on a DevOps team
- 2+ yearsof experience Architecting using AWS Cloud
- Strong experience setting SLOs / SLIs / error budgets and managing of reliability for infrastructure and applicationsusing Kubernetes, AWS Native components, CloudWatch, Dynatrace
- Can mentor team of less experienced Full-stack developers who are learning the AWS environment.
- Proficient in one or more of the following scripting languages: JavaScript, Nodejs, Python, Maven, Ansible, Bash, etc.
- Experience handling large numbers of diverse systems with configuration management systems like Puppet, Chef, Ansible, GitLab CI
- Understanding of standard networking protocols and components such as HTTP, DNS, ECMP, TCP/IP, ICMP, the OSI Model, Subnetting and Load Balancing strategies
- Experience in Serverless Application Framework
- Experience in containerized workloads and management platforms such as Docker or Kubernetes
- Familiarity with distributed systems is a plus including Microservices
- Experience in Infrastructure automation tools such as CloudFormation, Terraform
- Understanding of CI/CD processes and experience with deployment automation tools such as Code Pipeline, Code Deploy, Jenkins, Bamboo
- Strong debugging, troubleshooting, and problem-solving skills
- Effective communication, collaboration & negotiation skills with the ability to interface with various business units and third parties
- Must have the ability to listen to customers and colleagues; convey ideas effectively; prepare written documentation
- Experience liaising with developers, operations staff and third-party resources
- Experience with API integration projects
- Proven history of toil elimination by leveraging automation
- Strong background using tools like PagerDuty for managing incidents
- Strong experience with monitoring and alerting systems like Prometheus, Grafana, Datadog.
Preferred Qualifications:
- AWS Certified DevOps Engineer or equivalent cloud professional SRE certifications.
- A mindset focused on automation, measurement and efficiency.
-
SRE / Reliability Engineer (Lead)
2 weeks ago
Bengaluru, Karnataka, India Infogain Full timeSRE / Reliability Engineer (Lead) with skills ITSM Principles, AWS - EKS, AWS - CloudFormation, SRE Architecture, AWS-Apps, GCP-Apps, AWS-Infra, SRE Engineering, AWS DBA for location Any Infogain Base Location (Noida, Gurugram, Bangalore, Mumbai, Pune)ROLES & RESPONSIBILITIESCore Skills8 to 10 years of experience in DevOps role with focus on GCP Cloud...
-
Lead SRE
2 weeks ago
Bengaluru, Karnataka, India Delta Air Lines Full timeCompany: XYZ Tech SolutionsPosition: Senior Site Reliability EngineerResponsibilities:Engaging in and improving the entire lifecycle of services - from inception and design through deployment, operation, and refinementSupporting capacity planning, availability, scalability, security, and latency considerations for new infrastructure and service provisioning...
-
Lead Cloud SRE Engineer
2 weeks ago
Bengaluru, Karnataka, India AQUASoft Full timeAQUASoft is a software development company that specializes in creating custom-made products and software solutions for various clients, including Fortune 500 giants and medium-sized businesses. Our team of highly skilled and experienced software engineers across two continents utilize the latest frameworks and state-of-the-art technologies to build robust,...
-
SRE Consultant
2 weeks ago
Bengaluru, Karnataka, India Wipro Full timeJob Title: Lead Cloud/SRE Consultant: Location: Pune/Bangalore Expected to drive and contribute to research, design, documentation, and modifications to software specifications throughout the production life cycle with optimal technical solutions across the Cloud Infrastructure platforms stack and also Work with the Engineering, Product, Delivery and...
-
Lead Cloud SRE Engineer
2 weeks ago
Bengaluru, Karnataka, India AQUASoft Full timeAQUASoft is a software development company that specializes in creating custom-made products and software solutions for various clients, including Fortune 500 giants and medium-sized businesses. Our team of highly skilled and experienced software engineers across two continents utilize the latest frameworks and state-of-the-art technologies to build robust,...
-
SRE with AIOP and Dynatrace
2 weeks ago
Bengaluru, Karnataka, India Virtusa Full timeSRE with AIOP and Dynatrace - CREQ181002 DescriptionKnowledge & Experience:Minimum of 6 years of relevant work experience in critical production environments Experience in enabling observability within applications to extract appropriate telemetry into suitable back ends like Dynatrace Hands-on experience of curating Service Level Objectives, defining Error...
-
SRE - Bengaluru
2 weeks ago
Bengaluru, Karnataka, India Virtusa Full timeSRE - CREQ189656 Description We are looking for senior SRE (Software Reliability Engineer) profiles for our squad with the capacity to become Tech lead.Strong hands-on skills required on:distributed architecture and high availabilityautomation and scriptingnetwork and systemperformance analysisCICD toolchainsInfrastructure services, esp. on...
-
Lead SRE
2 weeks ago
Bengaluru, Karnataka, India Thomson Reuters Full timeAbout the Role:In this opportunity as Lead SRE - Global Command Center, you will:Run the production environment by monitoring availability and taking a holistic view of system health.Build software and systems to manage platform infrastructure and applicationsImprove reliability, quality, and time-to-market of our suite of software solutionsMeasure and...
-
Lead SRE
2 weeks ago
Bengaluru, Karnataka, India Thomson Reuters Full timeAs an employee at Thomson Reuters, you will play a role in shaping and leading the global knowledge economy. Our technology drives global markets and helps professionals around the world make decisions that matter. As the world's leading provider of intelligent information, we want your unique perspective to create the solutions that advance our...
-
Senior SRE
2 weeks ago
Bengaluru, Karnataka, India Dautom Full timeClient Introduction: In this role, you will have the opportunity to work closely with one of our esteemed clients. This client is a global leader in the IT industry, known for its commitment to quality and innovation. They have chosen Dautom as their trusted partner for their upcoming projects. Job Title: Senior SRE - Cloud Administrator Job Description ...
-
Senior SRE
2 weeks ago
Bengaluru, Karnataka, India Dautom Full timeClient Introduction:In this role, you will have the opportunity to work closely with one of our esteemed clients. This client is a global leader in the IT industry, known for its commitment to quality and innovation. They have chosen Dautom as their trusted partner for their upcoming projects.Job Title: Senior SRE - Cloud AdministratorJob...
-
Vice President- SRE
2 weeks ago
Bengaluru, Karnataka, India Angel One Full timeKey ResponsibilitiesRun Engineering functions, including managing people and a team across multiple locationBuilding high-performing teams by developing and nurturing Engineering teams through cultural change,Supporting, challenging and building consensus on design directions/decisions to ensure they are viable from a Cloud perspective.Ability to work in a...
-
Senior SRE Engineer
2 weeks ago
Bengaluru, Karnataka, India Taggd Full timeKey Skills Sets Linux Administration DeVops Docker Kubernetes AWS Python Ansible Jenkins Observability tools like New Relic Shift timings: 8am IST to 5pm IST Position Overview:The Senior SRE will be responsible for leading initiatives to improve system reliability, automate operational processes, and ensure the scalability and security of our...
-
Vice President- SRE
2 weeks ago
Bengaluru, Karnataka, India Angel One Full timeKey Responsibilities Run Engineering functions, including managing people and a team across multiple location Building high-performing teams by developing and nurturing Engineering teams through cultural change, Supporting, challenging and building consensus on design directions/decisions to ensure they are viable from a Cloud perspective. Ability to work...
-
Vice President- Sre
2 weeks ago
Bengaluru, Karnataka, India Angel One Full timeKey Responsibilities Run Engineering functions, including managing people and a team across multiple location Building high-performing teams by developing and nurturing Engineering teams through cultural change, Supporting, challenging and building consensus on design directions/decisions to ensure they are viable from a Cloud perspective.Ability to work in...
-
SRE Platform Engg
2 weeks ago
Bengaluru, Karnataka, India FIS Full timePosition Type :Full timeType Of Hire :Experienced (relevant combo of work and education)Education Desired :Bachelor of Computer EngineeringTravel Percentage :0%SRE Platform Engg (Devops + Production Support)Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues in...
-
SRE Platform Engg
2 weeks ago
Bengaluru, Karnataka, India Jobs for Humanity Full timeJob DescriptionPosition Type :Full timeType Of Hire :Experienced (relevant combo of work and education)Education Desired :Bachelor of Computer EngineeringTravel Percentage :0%SRE Platform Engg (Devops + Production Support)Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant...
-
Sre
2 weeks ago
Bengaluru, Karnataka, India Virtusa Full timeWe are looking for senior SRE (Software Reliability Engineer) profiles for our squad with the capacity to become Tech lead.Strong hands-on skills required on:distributed architecture and high availabilityautomation and scriptingnetwork and systemperformance analysisCICD toolchainsInfrastructure services, esp. on KubernetesMonitoring solutionAgile methodology...
-
SRE Platform Engg
2 weeks ago
Bengaluru, Karnataka, India Jobs for Humanity Full timeJob Description Position Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Bachelor of Computer Engineering Travel Percentage : 0% SRE Platform Engg (Devops + Production Support) Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most...
-
SRE Platform Engg
4 weeks ago
Bengaluru, Karnataka, India FIS Global Full timePosition Type : Full time Type Of Hire : Experienced (relevant combo of work and education) Education Desired : Bachelor of Computer Engineering Travel Percentage : 0%SRE Platform Engg (Devops + Production Support)Are you curious, motivated, and forward-thinking? At FIS you'll have the opportunity to work on some of the most challenging and relevant issues...