Site Reliability Engineering

1 week ago

Bengaluru, Karnataka, India Brambles Full time ₹ 15,00,000 - ₹ 28,00,000 per year

CHEP helps move more goods to more people, in more places than any other organization on earth via our 347 million pallets, crates and containers. We employ approximately 13,000 people and operate in 60 countries. Through our pioneering and sustainable share-and-reuse business model, the world's biggest brands trust us to help them transport their goods more efficiently, safely and with less environmental impact.

What does that mean for you? You'll join an international organization big enough to take you anywhere, and small enough to get you there sooner. You'll help change how goods get to market and contribute to global sustainability. You'll be empowered to bring your authentic self to work and be surrounded by diverse and driven professionals. And you can maximize your work-life balance and flexibility through our Hybrid Work Model.

Job Description

Key Responsibilities May Include:

Design, build, and maintain cloud infrastructure for machine learning workflows, including data storage and bespoke frameworks supporting the BRIX platform.
Develop and implement automated CI/CD pipelines to ensure regular, reliable deployment of models into production using containerization tools to manage and package devices.
Implement monitoring and logging solutions to maintain high availability and performance of machine learning models, applications, and cloud infrastructure.
Contribute to automation efforts for infrastructure processes, including configuration management and provisioning to enhance operational efficiency and scalability.
Support Brambles' Serialization and Asset Digitization programs by setting up and maintaining IoT and Edge platforms, ensuring seamless integration with cloud services.
Ensure best practices in version control and documentation processes, contributing to the clarity and standardization of cloud operations.
Research and implement best practices for cloud infrastructure management, such as GitOps and Infrastructure as Code (IaC), enhancing the team's capabilities.
Mentor and support the Cloud Engineering team by sharing knowledge and staying up to date on emerging cloud technologies, driving continuous improvement and innovation.

Job Details

Position Purpose

At Brambles there is a need to make sure that platforms built on cloud hypervisors run smoothly as expected and can scale to the demand. The SRE Lead will monitor, maintain, and drive the software engineering required to ensure performance, scalability and reliability of cloud-based applications and infrastructure.

This role will proactively use observation data to identify improvement opportunities, not just across cloud services, but also for the platform itself. This role will drive a self-healing mentality across the global estate that can scale seamlessly.

The SRE Lead will work alongside the Cloud Platform Engineering Team Lead(s) and others, and may assist in the creation of modules but is focused on delivering performance and optimisation to maintain production services.

Major/Key Accountabilities

Using Brambles observability tools to detect platform and workload issues
Work closely with the native platform management team, product groups and technical leads to formulate and design systems to troubleshoot issues proactively and automatically.
Support cloud operations in postmortem reviews to identify mitigation for future failure.
Evaluate the key workloads and implement strategies to mitigate risk of failure.
Continuous monitoring to review effectiveness
Minimizing mean time to respond. (MTTR)
Supporting the maintenance of tools for bug tracking
Ensuring documentation and designs are kept relevant

Experience:

Significant experience in within a technology automation/ SRE role
10+ years working with scripting languages.
Proven success in improving the customer experience.
Experience working within a matrix structure.

Qualifications

Essential Qualifications

• Extensive experience with Python

• Strong experience with BASH

• Strong experience with automation of processes

• Experience with Kubernetes

• Strong knowledge of CI/CD

Desirable Qualifications

SRE Reliability Engineering Practitioner
SRE Reliability Engineering Foundations
Bachelor's degree in Computer Science, Information Systems, Business or related field, Masters preferred or equivalent combination of education/experience.

Skills and Knowledge

Python - Can guide others to write clean, reusable, scalable code
Build pipelines for continuous improvement, writing Python scripts to automate testing, deployment, and rollback processes to ensure a smooth and reliable CI/CD pipeline
Advanced monitoring, logging and custom tooling
Write scripts to interact with cloud APIs, handling authentication, error handling, and maximizing availability
System Programming Languages - Can guide and support others in the development, testing, and deployment of cloud-native applications, services and infrastructure
Troubleshoot issues with guidance from senior team members
Support the integration of cloud applications with edge devices using system programming languages for low-level interactions and communication
Kernel-level development and optimization
Develop and implement networking protocols
Design - Understanding and use of event-based design, object-oriented design, functional design, multi-tenant design, domain driven design – and knowing which design approach is best suited for the particular problem and abstraction to solve complex problems. Ability to design at both the high level (the forest) and the low level (the tree); and include understanding of current design approaches used "in the field", and when they are appropriate to the use cases relevant to the platform being built.
Tooling - Use of well-established tools such as databases and Structured Query Language (SQL), and new leading-edge tools such as Kubernetes and the eco-system of tools around a particular language or programming environment with continuous research and learning of emerging new tools in a rapidly changing computing landscape.
Systems Thinking - Thinking abstractly to incorporate multiple perspectives; work within a space where the boundary or scope of problem or system may be "fuzzy"; understand diverse operational contexts of the system; identify inter- and intrarelationships and dependencies; understand complex system behavior; and reliably predict the impact of change to the system.
Cloud Platforms - Ability to navigate cloud platforms such as AWS and Azure, and use them effectively as the technical landscape for building Brambles specific platforms (both multi-tenant and purely internal). The platforms built within Brambles Digital need to be "cloud-native" and run securely, effectively and correctly at scale.

Remote TypeHybrid RemoteSkills to succeed in the roleActive Learning, Active Learning, Adaptability, Agile Methodology, Amazon Web Services (AWS), Automation Cloud, Cloud Infrastructure, Continuous Integration and Continuous Delivery Methodologies, Cross-Functional Work, Curiosity, Digital Literacy, Docker (Software), Emotional Intelligence, Empathy, GitHub, Infrastructure As Code (IaC), Initiative, Kubernetes, Lambda, Linux, NoSQL, Problem Solving, Prometheus, Python (Programming Language), Scala (Programming Language) {+ 5 more}

We are an Equal Opportunity Employer, and we are committed to developing a diverse workforce in which everyone is treated fairly, with respect, and has the opportunity to contribute to business success while realizing his or her potential. This means harnessing the unique skills and experience that each individual brings and we do not discriminate against any employee or applicant for employment because of race, color, sex, age, national origin, religion, sexual orientation, gender identity, status as a veteran, and basis of disability or any other federal, state, or local protected class.

Individuals fraudulently misrepresenting themselves as Brambles or CHEP representatives have scheduled interviews and offered fraudulent employment opportunities with the intent to commit identity theft or solicit money. Brambles and CHEP never conduct interviews via online chat or request money as a term of employment. If you have a question as to the legitimacy of an interview or job offer, please contact us

Site Reliability Engineer

7 days ago

Bengaluru, Karnataka, India Programming Full time ₹ 1,04,000 - ₹ 1,30,878 per year

Role - Site Reliability Engineering.Location - BengaluruYears of Expereince - 4+ YearsProfessional & Technical Skills:Must To Have Skills: Proficiency in Site Reliability Engineering.Good To Have Skills: Experience with cloud service providers such as AWS, Azure, or Google Cloud.Strong understanding of CI/CD tools and practices.Experience with container...
Site Reliability Engineer

2 weeks ago

Bengaluru, Karnataka, India Enterprise Minds, Inc Full time

We're Hiring | Site Reliability Engineer | 8-10 years
Site Reliability Engineer

1 week ago

Bengaluru, Karnataka, India FOSS United Full time ₹ 1,04,000 - ₹ 1,30,878 per year

All JobsSite Reliability Engineer at ZEISS IndiaSite Reliability EngineerApplyPosted on September 11, 2025ZEISS IndiaKadubeesanahalli, BengaluruFull TImeJob DescriptionZEISS in IndiaZEISS in India is headquartered in Bengaluru and present in the fields of Industrial Quality Solutions, Research Microscopy Solutions, Medical Technology, Vision Care and Sports...
site reliability engineer

2 weeks ago

Bengaluru, Karnataka, India Randstad Full time

Role: Site Reliability Engineer SummaryThe Network Engineer 2 provides technical design, planning, operation, maintenance, and advanced troubleshooting of the Bread Financials' network infrastructure. This position ensures continuity and alignment of the network administration/engineering direction. This position supports Bread Financials' strategies and...
Site Reliability Engineer

2 weeks ago

Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
Site Reliability Engineer

1 week ago

Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time

We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
Site Reliability Engineer

2 weeks ago

Bengaluru, Karnataka, India TRUGlobal Full time ₹ 9,00,000 - ₹ 12,00,000 per year

Job Title: Site Reliability Engineer (SRE) with Python Development ExpertisePosition Overview: We are seeking a skilled Site Reliability Engineer (SRE) with strong Python development experience to join our team. The ideal candidate will be responsible for ensuring the reliability, availability, and performance of our services across both on-premises and...
Site Reliability Engineer

1 week ago

Bengaluru, Karnataka, India CorroHealth Full time

We are seeking a highly skilled Site Reliability Engineer (SRE) to join our team. The ideal candidate will have a deep understanding of both software engineering and systems administration, with a focus on creating scalable and reliable systems. You will work closely with development and operations teams to ensure the reliability, availability, and...
Site Reliability Engineer

1 week ago

Bengaluru, Karnataka, India ViewSonic Full time ₹ 1,04,000 - ₹ 1,30,878 per year

Job Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...
Site Reliability Engineer

6 days ago

Bengaluru, Karnataka, India ViewSonic Full time

Job Requirements:1. Bachelor's degree in Computer Science, Engineering, or a related field.2. 3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.3. Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.4. Interest and understanding of Platform...

Americas

Europe

Asia / Oceania

Africa

Site Reliability Engineering