Site Reliability Engineer III
1 week ago
At American Express, our culture is built on a 175-year history of innovation, shared At American Express, our culture is built on a 175-year history of innovation, shared values and Leadership Behaviors, and an unwavering commitment to back our customers, communities, and colleagues. As part of Team Amex, you'll experience this powerful backing with comprehensive support for your holistic well-being and many opportunities to learn new skills, develop as a leader, and grow your career.
Here, your voice and ideas matter, your work makes an impact, and together, you will help us define the future of American Express.
How will you make an impact in this role?
Key Responsibilities:
- Manages the collaboration with Software Engineering teams to design, develop, and implement features that enhance system resilience, scalability, and performance, proactively identifying and resolving system bottlenecks and failure points
- Develops and refines sophisticated automation tools and frameworks, including advanced infrastructure as code (IaC) practices, to streamline operational workflows, deployment processes, and infrastructure management, ensuring high system efficiency
- Engages in architectural design discussions, ensuring that advanced reliability, scalability, and performance considerations are integrated into strategic decision-making processes
- Designs and executes comprehensive chaos engineering experiments and advanced resiliency testing, analyzing results to implement robust improvements that enhance system robustness and recovery capabilities
- Develops, optimizes, and maintains comprehensive disaster recovery plans and business continuity strategies, ensuring systems can recover quickly and effectively from complex and unexpected disruptions
- Advocates for observability practices by promoting and implementing best practices such as error budgeting, service-level objectives (SLOs), and service-level indicators (SLIs), contributing to a culture of continuous improvement and reliability
- Collaborates and co-creates effectively with teams in product and the business to align technology initiatives with business objectives
Qualifications
- Bachelor's degree in computer science, Information Technology, Engineering, and/or comparable experience; advance degree preferred
- Knowledge of modern observability stack - Splunk, Elastic Search, Prometheus, Grafana
- Knowledge of containerization technologies (e.g., Kubernetes, Docker) and microservices architecture
- Knowledge of observability tools and methodologies, including experience with logging, monitoring, tracing, and performance analysis platforms
- Knowledge of cloud-based Site Reliability Engineering (SRE) practices and experience with public cloud platforms such as AWS, Azure, or Google Cloud
We back you with benefits that support your holistic well-being so you can be and deliver your best. This means caring for you and your loved ones' physical, financial, and mental health, as well as providing the flexibility you need to thrive personally and professionally:
- Competitive base salaries
- Bonus incentives
- Support for financial-well-being and retirement
- Comprehensive medical, dental, vision, life insurance, and disability benefits (depending on location)
- Flexible working model with hybrid, onsite or virtual arrangements depending on role and business need
- Generous paid parental leave policies (depending on your location)
- Free access to global on-site wellness centers staffed with nurses and doctors (depending on location)
- Free and confidential counseling support through our Healthy Minds program
- Career development and training opportunities
American Express is an equal opportunity employer and makes employment decisions without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, disability status, age, or any other status protected by law.
Offer of employment with American Express is conditioned upon the successful completion of a background verification check, subject to applicable laws and regulations.
-
Site Reliability Engineer III
6 days ago
Bengaluru, Karnataka, India CME Group Full time ₹ 15,00,000 - ₹ 28,00,000 per yearAs a Observability Engineer under Site Reliability Engineering Team, you will be a crucial part of the team responsible for the availability, performance, and scalability of our cloud platform. You will blend software engineering and systems administration expertise to build and run large-scale, distributed, fault-tolerant systems. Your mission is to ensure...
-
Bengaluru, Karnataka, India Google Full time ₹ 12,00,000 - ₹ 24,00,000 per yearMinimum qualifications:Bachelor's degree in Computer Science, a related field, or equivalent practical experience.2 years of experience working with administration (e.g. filesystems, inodes, system calls) or networking (e.g. TCP/IP, routing, network topologies and hardware, SDN).2 years of experience with data structures/algorithms and software development...
-
Bengaluru, Karnataka, India Google Full time ₹ 12,00,000 - ₹ 24,00,000 per yearMinimum qualifications:Bachelor's degree in Computer Science, a related field, or equivalent practical experience.2 years of experience working with administration (e.g. filesystems, inodes, system calls) or networking (e.g. TCP/IP, routing, network topologies and hardware, SDN).2 years of experience with data structures/algorithms and software development...
-
Site Reliability Engineering
2 weeks ago
Bengaluru, Karnataka, India Viraaj HR Solutions Private Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSite Reliability Engineer (SRE)About The OpportunityA fast-growing organization in the Enterprise Cloud Infrastructure & SaaS sector delivering highly available, mission-critical services to enterprise customers. We are hiring an on-site Site Reliability Engineer in India to own reliability, automation, and operational excellence across cloud-native...
-
Bengaluru, Karnataka, India Google Full time ₹ 12,00,000 - ₹ 36,00,000 per yearMinimum qualifications:Bachelor's degree in Computer Science, a related field, or equivalent practical experience.2 years of experience working with administration (e.g. filesystems, inodes, system calls) or networking (e.g. TCP/IP, routing, network topologies and hardware, SDN).2 years of experience with data structures/algorithms and software development...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India super Full time ₹ 12,00,000 - ₹ 24,00,000 per yearSite Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...
-
Site Reliability Engineer
6 days ago
Bengaluru, Karnataka, India eBay Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAt eBay, we're more than a global ecommerce leader — we're changing the way the world shops and sells. Our platform empowers millions of buyers and sellers in more than 190 markets around the world. We're committed to pushing boundaries and leaving our mark as we reinvent the future of ecommerce for enthusiasts.Our customers are our compass, authenticity...
-
Site Reliability Engineer III
1 week ago
Bengaluru, Karnataka, India 6sense Full time ₹ 9,00,000 - ₹ 12,00,000 per yearOur Mission: 6sense is on a mission to revolutionize how B2B organizations create revenue by predicting customers most likely to buy and recommending the best course of action to engage anonymous buying teams. 6sense Revenue AI is the only sales and marketing platform to unlock the ability to create, manage and convert high-quality pipeline to revenue. Our...
-
Systems Reliability Engineer III
2 weeks ago
Bengaluru, Karnataka, India Nutanix Full time ₹ 12,00,000 - ₹ 24,00,000 per yearHungry, Humble, Honest, with Heart.The OpportunityAre you a top-tier Systems Reliability Engineer with a strong expertise in networking, virtualization, and cloud technologies, along with a passion for delivering exceptional customer support? If so, you'll thrive in our dynamic hybrid team at Nutanix, where you'll have the opportunity to work on cutting-edge...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India Zetamicron Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Title: Site Reliability Engineer (SRE)About the RoleWe are seeking a highly skilled and proactive Site Reliability Engineer (SRE)to ensure the stability, scalability, and reliability of our platform. The ideal candidate will have strong experience in managing production environments, automating operational processes, and enhancing system performance...