
Site Reliability Engineer, AVP
1 day ago
Site Reliability Engineer, AVP
Position Overview
Job Title: Site Reliability Engineer, AVP
Location: Bangalore, India
Corporate Title: AVP
Role Description
Technology/Service is responsible for delivering the business vision and strategy, at a global level, focusing on achieving consistent operational excellence and client/user satisfaction through industrialisation, price/value optionality and leveraging increased automation and the use of technology. Work includes: Creating a digital vision and strategy for the bank, and ensuring its integration with the organization's overall strategic plans
- Identifying opportunities for differentiating the bank's digital portfolio including capabilities and solutions
- Acting as a change agent in leading the organizational changes that are required to create and maintain the necessary digital portfolio
- Applying extensive knowledge and understanding of the evolving digital market, acts as a thought leader on emerging digital trends related to technology and business
What we'll offer you
As part of our flexible scheme, here are just some of the benefits that you'll enjoy
- Best in class leave policy
- Gender neutral parental leaves
- 100% reimbursement under childcare assistance benefit (gender neutral)
- Sponsorship for Industry relevant certifications and education
- Employee Assistance Program for you and your family members
- Comprehensive Hospitalization Insurance for you and your dependents
- Accident and Term life Insurance
- Complementary Health screening for 35 yrs. and above
Your key responsibilities
- As Senior Site Reliability Engineer you
- Orchestrate and contribute SRE activities across API Platforms and Integration services
- Introduce all engineering disciplines that combine software- and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems
- Implement the core of DevOps with specific principles and practices, focusing on what and how to improve reliability
- Establish and support capacity planning procedures and have a close eye on SLIs and SLOs for production readiness and in live environment
- Coordinate with the rest of the division and the teams working on different layers of the application and infrastructure, and you have full commitment to collaboration on problem solving
- ForInfrastructure & Service Managementyou
- Engage in and improve the whole lifecycle of services - from inception and design, deployment, operation, and refinement
- Maintain services once they are live by measuring and monitoring availability, latency, and overall system health
- Scale systems sustainably through mechanisms like automation evolve systems by pushing for changes that improve reliability and velocity
- Develop and enforce policies, standards and guidelines for site reliability
- Automate application and infrastructure deployment activities to production environments.
- ForIncident & Problem Managementyou
- Perform troubleshooting & Emergency Response
- Investigate root causes and suggest solutions
- Increase the productivity by leading blameless post-mortems
- ForApplication Maintenanceyou
- Collaboratively work with Product Owners and Engineers to run reliable services
- Configure and maintains application & monitoring
- Identify business objects for monitoring
- Track system performance, capacity, and use your experience to create effective strategies for maintaining and improving system performance and availability.
- ForOperational Continuous Improvementyou
- Identify issues and optimization potential and introduce related user stories
- Support with automation knowhow to reduce the risk of bad changes
- Identify, design, develop, deploy tools and processes to monitor, maintain, and report site performance and availability
- ForService Onboardingyou
- Support your Squad and your Chapter population in onboarding & promotions
Your skills and experiences
- Expert hands-on experience with on-premises
- Expert hands-on experience with cloud ecosystems run on Google Cloud
- Expert hands-on experience with Docker / Kubernetes operations with GKE or similar technology
- Expert experience with automated infrastructure provisioning based on Terraform/TerraGrunt, Terraform Enterprise, Ansible
- Advanced hands-on experience with Continuous Integration / Continuous Deployment (Github) and patterns for CI/CD pipelines.
- Advanced hands-on experience of monitoring tools like Prometheus, Grafana, Kibana and alerting tools like OpsGenie, NewRelic, DataDog, Splunk, Google Operations-Suite (Stackdriver)
- Very good knowledge of security capabilities (TLS, OAuth2, KMS, Vault, Admission Controllers, let's encrypt or similar technologies).
- Very good understanding of Microservice architectures and experience with API Management with Apigee or WSO2
- Experience in software development in at least one language (Java, JavaScript, Python, Go).
- Good Knowledge of the Software Development Life Cycle processes based on related tools such as
- TeamCity, BitBucket, Artifactory
- SonarQube, VeraCode, Crucible
- JIRA, Confluence, Service Now
How we'll support you
- Training and development to help you excel in your career
- Coaching and support from experts in your team
- A culture of continuous learning to aid progression
- A range of flexible benefits that you can tailor to suit your needs
About us and our teams
Please visit our company website for further information:
We strive for a in which we are empowered to excel together every day. This includes acting responsibly, thinking commercially, taking initiative and working collaboratively.
Together we share and celebrate the successes of our people. Together we are Deutsche Bank Group.
We welcome applications from all people and promote a positive, fair and inclusive work environment.
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India Enterprise Minds, Inc Full timeWe're Hiring | Site Reliability Engineer | 8-10 years
-
site reliability engineer
1 day ago
Bengaluru, Karnataka, India Randstad Full timeRole: Site Reliability Engineer SummaryThe Network Engineer 2 provides technical design, planning, operation, maintenance, and advanced troubleshooting of the Bread Financials' network infrastructure. This position ensures continuity and alignment of the network administration/engineering direction. This position supports Bread Financials' strategies and...
-
Site Reliability Engineer
6 days ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes . In this role, you will focus on monitoring , basic troubleshooting , and incident response , helping to maintain high...
-
Site Reliability Engineer
2 days ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
7 days ago
Bengaluru, Karnataka, India beBeeReliability Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Role OverviewAs a Site Reliability Engineer, you will play a pivotal role in driving innovation and modernizing complex systems by leveraging cutting-edge technologies and collaboration with cross-functional teams.
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India Coforge Full timeJob Description- Design, implement, and maintain scalable infrastructure to ensure high availability and performance of software applications.- Collaborate with development teams to identify and resolve issues affecting application performance, stability, and reliability.- Develop automated monitoring scripts using tools like Prometheus, Grafana, etc. to...
-
Site Reliability Engineering
3 days ago
Bengaluru, Karnataka, India Infrasoft Technologies Limited Full timeJob DescriptionJob Title: DeveloperWork Location: Bangalore, KarnatakaExperience Range: 68 YearsJob Description:We are looking for a skilled Developer with strong hands-on experience in Site Reliability Engineering (SRE), Java, JavaScript, and Production Support. The ideal candidate should have a solid background in application monitoring and troubleshooting...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Collabera Full timeJob Description As a Principal/Chief Site Reliability Engineer , you will play a critical role in designing, developing, and maintaining scalable and highly reliable systems. You'll work closely with development teams to improve system reliability, monitor critical applications, and design fail-proof infrastructure. Responsibilities Design and implement...
-
Site Reliability Engineer
4 days ago
Bengaluru, Karnataka, India Xebia Full timeWe are seeking an experienced AWS DevOps Engineer with strong expertise in Observability and Site Reliability Engineering (SRE) to design, build, and manage scalable, reliable, and secure cloud environments. The role requires hands-on experience with AWS services, Infrastructure as Code (IaC), CI/CD, monitoring & observability frameworks, and incident...
-
Site Reliability Engineer
4 weeks ago
Bengaluru, Karnataka, India NatWest Group Full timeJoin us as a Site Reliability Engineer In this key role you ll support the improvement of non-functional and operational characteristics such as availability performance efficiency change management monitoring security incident response and capacity planning of our products and services You ll enjoy significant stakeholder interaction working in...