![American Express](https://media.trabajo.org/img/noimg.jpg)
Director of Site Reliability Engineering
4 weeks ago
You Lead the Way. We’ve Got Your Back.
With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you’ll learn and grow as we help you create a career journey that’s unique and meaningful to you with benefits, programs, and flexibility that support you personally and professionally.
At American Express, you’ll be recognized for your contributions, leadership, and impact—every colleague has the opportunity to share in the company’s success. Together, we’ll win as a team, striving to uphold our and powerful backing promise to provide the world’s best customer experience every day. And we’ll do it with the utmost integrity, and in an environment where everyone is seen, heard and feels like they belong.
As part of our diverse tech team, you can work alongside talented engineers in an open, supportive, inclusive environment where your voice is valued and you solve challenging tech problems. Amex offers a range of opportunities to work with the latest technologies and encourages you to back the broader engineering community through open source. And because we understand the importance of keeping your skills fresh and relevant, we give you dedicated time to invest in your professional development.
Join #TeamAmex and let's lead the way together.
Key Responsibilities:
SRE Strategy and Leadership: Develop and implement a comprehensive SRE strategy aligned with the company's goals and objectives. Lead a team of SRE professionals to drive the reliability, performance, and scalability of GRC technology solutions. Observability and Monitoring: Establish observability practices to ensure real-time insights into system performance, availability, and customer experience. Implement monitoring tools, metrics, and dashboards to proactively identify and address potential issues. Production Support Optimization: Lead all aspects of the end-to-end production support process, including incident management, problem resolution, and service-level agreement (SLA) compliance. Drive continuous improvement initiatives to enhance operational effectiveness and reduce mean time to resolution (MTTR). GRC Customer Journeys: Collaborate with cross-functional teams to enhance customer journeys through seamless and reliable technology experiences. Reliability Engineering Best Practices: Promote and implement standard methodologies, including error budgeting, chaos engineering, and disaster recovery planning. Foster a culture of resilience and reliability within technology. Automation and Efficiency: Champion automation initiatives to streamline operational workflows, deployment processes, and incident response tasks. Leverage automation tools and orchestration to improve reliability and reduce manual intervention.Qualifications:
Degree or equivalent experience in Computer Science, Information Technology, or related field. Advanced certifications in SRE or related are a plus. Deep understanding of observability tools and methodologies, including experience with logging, monitoring, tracing, and performance analysis platforms. Strong leadership and people management skills, with the ability to inspire and empower successful SRE teams.Preferred Skills:
Knowledge of cloud-based SRE practices and experience with public cloud platforms such as AWS, Azure, or Google Cloud. Familiarity with containerization technologies (e.g., Kubernetes, Docker) and microservices architecture. Demonstrated expertise in driving culture change, DevOps practices, and continuous improvement in SRE and production support functions.Join our innovative team and be at the forefront of advancing Site Reliability Engineering and production support in the Global Risk and Compliance Technology space. If you are passionate about driving reliability, observability, and excellence in customer experiences, we invite you to apply and join our mission to redefine the future of risk and compliance technology. Apply now and join us in shaping the reliability and performance of GRC solutions for a secure and compliant world.
We back our colleagues and their loved ones with benefits and programs that support their holistic well-being. That means we prioritize their physical, financial, and mental health through each stage of life. Benefits include:
Competitive base salaries Bonus incentives Support for financial-well-being and retirement Comprehensive medical, dental, vision, life insurance, and disability benefits (depending on location) Flexible working model with hybrid, onsite or virtual arrangements depending on role and business need Generous paid parental leave policies (depending on your location) Free access to global on-site wellness centers staffed with nurses and doctors (depending on location) Free and confidential counseling support through our Healthy Minds program Career development and training opportunitiesAmerican Express is an equal opportunity employer and makes employment decisions without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, disability status, age, or any other status protected by law.
Offer of employment with American Express is conditioned upon the successful completion of a background verification check, subject to applicable laws and regulations.
-
Director of Site Reliability Engineering
4 weeks ago
gurugram, India AMEX Full timeYou Lead the Way. Weve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, youll learn and grow as we help you create a...
-
Director of Site Reliability Engineering
4 weeks ago
gurugram, India American Express Full timeYou Lead the Way. We’ve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, you’ll learn and grow as we help you create a...
-
Director of Site Reliability Engineering
4 weeks ago
Gurugram, India AMEX Full timeYou Lead the Way. Weve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, youll learn and grow as we help you create a career...
-
Site Reliability Engineer
3 weeks ago
Gurugram, India BayOne Solutions Full timeTechnical SkillsProficiency with cloud platforms AWS, Terraform, Kubernetes, Jira, CloudFlare, FreshService, Jira, DataDog, GitHub, Postgres, Java / Kotlin,Proficient in SRE principlesUnderstanding of CI / CD processesExperience6+ years of experience in a Site Reliability Engineering, DevOps or similar roleBachelor's degree in Computer Science, Information...
-
Site Reliability Engineer
3 weeks ago
gurugram, India BayOne Solutions Full timeTechnical Skills Proficiency with cloud platforms AWS, Terraform, Kubernetes, Jira, CloudFlare, FreshService, Jira, DataDog, GitHub, Postgres, Java / Kotlin, Proficient in SRE principles Understanding of CI / CD processes Experience 6+ years of experience in a Site Reliability Engineering, DevOps or similar role Bachelor's degree in Computer Science,...
-
Site Reliability Engineer
2 weeks ago
Gurugram, India BayOne Solutions Full timeTechnical SkillsProficiency with cloud platforms AWS, Terraform, Kubernetes, Jira, CloudFlare, FreshService, Jira, DataDog, GitHub, Postgres, Java / Kotlin,Proficient in SRE principlesUnderstanding of CI / CD processesExperience6+ years of experience in a Site Reliability Engineering, DevOps or similar roleBachelor's degree in Computer Science, Information...
-
Site Reliability Engineer
1 month ago
Gurugram, India FX Consulting Full timeA Site Reliability Engineer (SRE) plays a crucial role in maintaining the reliability and performance of computer systems within an organization. As a bridge between development and IT operations, an SRE takes on operational tasks and responsibilities typically handled by operations teams. Here's a comprehensive job description for an SRE with the...
-
Site Reliability Engineer
3 weeks ago
gurugram, India FX Consulting Full timeA Site Reliability Engineer (SRE) plays a crucial role in maintaining the reliability and performance of computer systems within an organization. As a bridge between development and IT operations, an SRE takes on operational tasks and responsibilities typically handled by operations teams. Here's a comprehensive job description for an SRE with the specified...
-
Site Reliability Engineer, AVP
4 weeks ago
Gurugram, India NatWest Group Full timeJoin us as a Site Reliability Engineer You’ll manage the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ) We’ll look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of CCJ across...
-
Senior Site Reliability Engineer, Platform
4 weeks ago
gurugram, India GEMINI Full timeDepartment : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other...
-
Senior Site Reliability Engineer, Platform
4 weeks ago
Gurugram, India GEMINI Full timeDepartment : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other engineering...
-
Site Reliability Engineer
4 weeks ago
Gurgaon/Gurugram, India FX Consulting Full timeA Site Reliability Engineer (SRE) plays a crucial role in maintaining the reliability and performance of computer systems within an organization. As a bridge between development and IT operations, an SRE takes on operational tasks and responsibilities typically handled by operations teams. Here's a comprehensive job description for an SRE with the...
-
Site Reliability Engineer
1 month ago
Gurgaon/Gurugram, IN FX Consulting Full timeA Site Reliability Engineer (SRE) plays a crucial role in maintaining the reliability and performance of computer systems within an organization. As a bridge between development and IT operations, an SRE takes on operational tasks and responsibilities typically handled by operations teams. Here's a comprehensive job description for an SRE with the...
-
Site Reliability Engineer
2 months ago
gurugram, India StatusNeo Full timeJob Description: We are seeking a highly skilled and experienced Senior Site Reliability Engineer with expertise in Core Tools and DevOps to join our dynamic team. The ideal candidate will have a strong background in Linux administration, cloud infrastructure, Infrastructure as Code (IaC), Python programming, and be a subject matter expert in DevOps tools...
-
Site Reliability Engineer
1 month ago
gurugram, India IndusInd Bank Full timeAbout the Role As a Site Reliability Engineer you will troubleshoot, debug, evaluate and resolve customer impacting issues with a focus on detecting patterns and working with the engineering development and or product teams to eliminate defects. The position requires a combination of strong troubleshooting, technical, communication and problem solving...
-
Site Reliability Engineer
1 month ago
Gurugram, India IndusInd Bank Full timeAbout the RoleAs a Site Reliability Engineer you will troubleshoot, debug, evaluate and resolve customer impacting issues with a focus on detecting patterns and working with the engineering development and or product teams to eliminate defects. The position requires a combination of strong troubleshooting, technical, communication and problem solving skills....
-
Site Reliability Engineer
1 month ago
Gurugram, India IndusInd Bank Full timeAbout the RoleAs a Site Reliability Engineer you will troubleshoot, debug, evaluate and resolve customer impacting issues with a focus on detecting patterns and working with the engineering development and or product teams to eliminate defects. The position requires a combination of strong troubleshooting, technical, communication and problem solving skills....
-
Site Reliability Engineer
3 months ago
Gurugram, India StatusNeo Full timeJob Description: We are seeking a highly skilled and experienced Senior Site Reliability Engineer with expertise in Core Tools and DevOps to join our dynamic team. The ideal candidate will have a strong background in Linux administration, cloud infrastructure, Infrastructure as Code (IaC), Python programming, and be a subject matter expert in DevOps tools...
-
Site Reliability Engineer
2 weeks ago
Gurgaon / Gurugram, Mumbai, India Govind S (Proprietor of Vintage Fashions) Full timeHIRING FOR TECH MAHINDRAResponsibilities:Analyze metrics from operating systems and applications to optimize performance and troubleshoot issues effectively. Collaborate closely with development teams to enhance services through rigorous testing and streamlined release processes. Provide expertise in system design consulting, platform management, and...
-
Site Reliability Engineer
4 weeks ago
Gurugram, India Codersbrain technology pvt ltd Full timeKey Responsibilities :- Provide expert production support for application teams utilizing our platform, ensuring high availability, reliability, and performance.- Diagnose and resolve complex issues in production environments, collaborating closely with development teams and stakeholders.- Implement and maintain monitoring, alerting, and logging solutions to...