Platform Site Reliability Engineer

4 weeks ago

Chennai Tamil Nadu, India World Bank Full time

Platform Site Reliability Engineer Job req33873 Organization World Bank Sector Information Technology Grade GF Term Duration 3 years 0 months Recruitment Type Local Recruitment Location Chennai India Required Language s English Preferred Language s English Closing Date 8 12 2025 MM DD YYYY at 11 59pm UTC Description Do you want to build a career that is truly worthwhile Working at the World Bank Group provides a unique opportunity for you to help our clients solve their greatest development challenges The World Bank Group is one of the largest sources of funding and knowledge for developing countries a unique global partnership of five institutions dedicated to ending extreme poverty increasing shared prosperity and promoting sustainable development With 189 member countries and more than 120 offices worldwide we work with public and private sector partners investing in groundbreaking projects and using data research and technology to develop solutions to the most urgent global challenges For more information visit www worldbank org ITS Vice Presidency Context Information and Technology Solutions ITS enables the WBG to achieve its mission of ending extreme poverty and promote shared prosperity in a sustainable way by delivering transformative information and technologies to its staff working in over 150 locations Our vision is to transform how the Bank Group accomplishes its mission through information and technology In this fast-paced ever-changing world the formulation and implementation of the ITS strategy is an ongoing iterative process of learning and adaptation developed through extensive consultations with business partners throughout the World Bank Group ITS shapes its strategy in response to changing business priorities and leverages new technologies to achieve three high-level business outcomes business enablement by providing Bank Group units with innovative digital tools and technologies to transform how they deliver value for their clients empowerment effectiveness by ensuring that all Bank Group staff are connected able to find information and productive to accelerate the delivery of development solutions globally and resilience by equipping the Bank Group to provide risk-based cybersecurity and robust data protection for a global network and a growing cloud platform Implementation of the strategy is guided by three core principles The first is to deliver solutions for business partners that are customer-centric innovative and transformative The second is to provide the Bank Group with value for money with selective and standard technologies The third principle is to excel at the basics by providing a high performing robust and resilient IT environment for the organization The Technology Platforms Team ITSPL is anchored in the Chief Technology Officer ITSTO division in ITS ITS Technology Office ITSTO drives technology-enabled innovation and delivers the digital backbone for WBG s mission It develops future-ready technology strategy modernizes infrastructure manages innovation and fosters agility The unit collaborates across the organization to leverage technology as a force multiplier accelerating development impact and digital transformation globally ITSPL delivers secure cloud-first IT platforms with automation self-service IAM Platform Engineering IaC It manages databases integrations cloud ops to ensure reliable scalable cost-effective alignment with enterprise standards The primary programs that the ITSTO unit is responsible for is providing a wide range of technical infrastructure services to meet the institution s computing needs from mid-range servers large scale servers and the respective system network and supporting software on those platforms It provides engineering integration and system administration services for Server Administration Server Security Backup Restore Storage Virtual infrastructure on premise and in cloud and Data Center Management The role requires a hands-on approach hands-on position in a very multicultural environment which supports diversity continuous learning enhancing skillsets and collaboration The candidate must demonstrate excellent communication skills as the position requires interaction with other teams The candidate must possess a strong sense of curiosity adaptability and the drive to learn and innovate We provide a meaningful open and collaborative environment We have many interesting problems to solve providing you an opportunity to develop your skills while contributing to the mission of the bank We value teamwork openness curiosity and persistence About the Position The rapid shift to hybrid cloud environments necessitates a Site Reliability Engineer SRE with expertise in both legacy middleware systems e g JBoss Apache WebSphere IIS and modern DevOps DevSecOps pipelines Terraform Kubernetes GitOps This position focus on ensuring operational continuity for enterprise platforms by stabilizing traditional systems while driving modernization through Infrastructure-as-Code IaC practices and automation As a bridge between legacy and cloud-native technologies this role will implement robust SRE practices such as observability error budgets and incident response to maintain a highly reliable environment with minimal downtime Additionally the SRE will lead knowledge transfer efforts to prevent single points of failure and optimize platform performance through auto-remediation playbooks and chaos engineering This position will implement SRE practices to achieve resilience automate standards and contribute to the success of future-proof Platform as a Product strategy Competencies Required Technical Proficiency Cognitive skills - Experience as a Site Reliability Engineer with hands-on knowledge of Site Reliability Engineering SRE practices Principles including implementing and managing SLOs error budgets observability incident response and automation in high-availability environments - Proven experience with legacy middleware JBoss Apache WebSphere IIS and modern stacks NET Java NodeJS Angular - Strong Knowledge and working Experience in Multi-Cloud Platforms AWS Azure GCP - Strong database skills PostgreSQL MySQL other RDBMS NoSQL - Proficiency in DevOps DevSecOps tools Terraform Kubernetes GitOps Chef CI CD GitHub GitLab Azure Repos - Experience with containerization Docker Kubernetes AKS and web service management - Strong Scripting automation skills Python PowerShell Bash etc - Experience with monitoring observability tools Splunk Prometheus Grafana etc - Experience in setting up and managing PAAS and COTS solution - Experience working in Agile environments with a strong understanding of Agile principles and practices Exposure to the Scaled Agile Framework SAFe is highly desirable - Client Understanding and Advising Advocates for client needs and perspectives - Learning Orientation Keeps up with new SRE cloud middleware and automation trends - Analytical Thinking Strong diagnostic and troubleshooting skills - Foundation Architecture Knowledge Supports standards for hybrid cloud and middleware - Strategic Technology Planning Contributes to technological roadmaps especially for SRE and cloud Platform as a product - Technology Knowledge Deep understanding of hybrid cloud containerization and middleware - Modernize and Innovate Develops innovative solutions in automation observability and cloud migration - Deliver Results for Clients Ensures high reliability and performance - Collaboration Works effectively across teams and locations - Knowledge Sharing Actively participates in knowledge transfer and documentation - Decision Making Makes informed decisions especially in incident response - Communication Excellent written and verbal English able to explain complex technical concepts Roles Responsibilities - Infrastructure Operations Support Manage and support both legacy and cloud-native middleware across on-premises and hybrid cloud environments - SRE Implementation Apply SRE principles observability error budgets incident response chaos engineering to ensure reliability and performance Promote SRE practice and culture across the team and apply the SRE principle across all deliverables - Automation DevOps DevSecOps Build and maintain CI CD pipelines automate manual tasks toil develop self-service tools and implement Infrastructure-as-Code IaC using tools like Terraform and GitOps - Cloud Adoption Modernization Guide migration to cloud container platforms and adoption of cloud-native services - Knowledge Management Collaboration Document and communicate changes lead knowledge transfer and promote SRE culture across teams and stakeholders - Compliance Business Continuity Ensure adherence to SLAs compliance and contribute to business continuity and disaster recovery planning - Performance Optimization Develop auto-remediation playbooks conduct chaos engineering and optimize platform performance - Stakeholder Engagement Support organizational IT strategy and deliverables especially during team transitions or critical staffing changes Selection Criteria Bachelor s or Master s degree with at least 4-5 years of relevant experience Experience in adopting Site Reliability Engineering practices to work Having an SRE certification is a mandatory requirement Experience working in Agile environments and a SAFE Agile certification is mandatory Strong experience configuring and supporting NET Java NodeJS Angular Applications Good understanding of the multiple middleware technologies and custom COTS product hosting s Experience with Azure DevOps as both developer and administrator Solid knowledge of modern DevOps practices including CI CD git Docker and Kubernetes Familiarity with Artifactory solutions e g JFrog Experience with Infrastructure as Code tools Terraform Chef etc Knowledge of Azure AD authentication and authorization Proficient with monitoring tools and Splunk Demonstrated experience working in Agile environments Hands-on experience with AWS and Azure cloud services Having cloud certification in Azure AWS is an added advantage WBG Culture Attributes 1 Sense of Urgency - Anticipating and quickly reacting to the needs of internal and external stakeholders 2 Thoughtful Risk Taking - Taking informed and thoughtful risks and making courageous decisions to push boundaries for greater impact 3 Empowerment and Accountability - Engaging with others in an empowered and accountable manner for impactful results The World Bank Group offers comprehensive benefits including a retirement plan medical life and disability insurance and paid leave including parental leave as well as reasonable accommodations for individuals with disabilities We are proud to be an equal opportunity and inclusive employer with a dedicated and committed workforce and do not discriminate based on gender gender identity religion race ethnicity sexual orientation or disability Learn more about working at the and including our values and inspiring stories

Cloud Site Reliability Engineer

1 week ago

Chennai, Tamil Nadu, India Ford Global Career Site Full time ₹ 15,00,000 - ₹ 25,00,000 per year

Be at the Forefront of Mobility's Future: Join Ford as a Site Reliability EngineerEnterprise Technology is the engine driving the future of transportation, and we're looking for a talented Site Reliability Engineer (SRE) to help us redefine mobility. In this role, you'll leverage cutting-edge technology to enhance customer experiences, improve lives, and...
Associate, Site Reliability Engineer

3 weeks ago

Chennai, Tamil Nadu, India Pfizer Full time

ROLE SUMMARY At Pfizer we make medicines and vaccines that change patients lives with a global reach of over 780 million patients Pfizer Digital is the organization charged with winning the digital race in the pharmaceutical industry We apply our expertise in technology innovation and our business to support Pfizer in this mission Our team the Global Supply...
Site Reliability Engineer Ii

3 weeks ago

Chennai, Tamil Nadu, India Trimble Full time

Your Title Site Reliability Engineer -II Job Location Chennai India Our Department Trimble Platform Are you interested in cutting edge cloud technologies ready to dirt your hands in the cloud world Do you like to be part of a core team with industry leading site reliability engineering standards About the Role Are you passionate about cutting-edge cloud...
Site Reliability Engineer

2 weeks ago

Chennai, Tamil Nadu, India Elgebra Full time ₹ 6,00,000 - ₹ 18,00,000 per year

Hiring: Site Reliability Engineer – 7+ YearsLocation: Bangalore / Chennai Payroll: Elgebra Client: Qincline Joining: Immediate to 15 DaysRole Overview:We are looking for an experienced Site Reliability Engineer (SRE) with over 6 years of expertise to join our team. The ideal candidate will have strong technical skills, a problem-solving mindset, and the...
civil site engineer

1 day ago

Kundrathur, Chennai, Tamil Nadu, India The Chennai Engineer Full time ₹ 1,20,000 - ₹ 3,00,000 per year

Supervise and oversee construction projects to ensure they meet specifications and timelines.Provide technical support and direction to construction teams.Ensure projects comply with health and safety regulations.Coordinate and manage site activities and resources.Handle day-to-day problems that arise on the construction site.Liaise with clients,...
Senior Site Reliability Engineer

2 weeks ago

tamil nadu, India Tata Consultancy Services Full time

Dear Candidates,Greetings from TCS!!!TCS is looking for Senior Site Reliability Engineer – AWSExperience: 8-12 yearsLocation: ChennaiMust have skills: Design, implement, and maintain scalable, secure, and highly available infrastructure on AWSDevelop and improve CI/CD pipelines, Infrastructure as Code (IaC) using Terraform, HarnessOwn and implement...
Site Reliability Engineer

3 days ago

tamil nadu, India Tata Consultancy Services Full time

TCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together.What we are looking forRole: Digital : Site Reliability Engineering (SRE)Experience Range: 4 – 7 YearsLocation: Chennai/Pune/KolkataSRE Team Skills: (Must have) In...
Senior Site Reliability Engineer

2 weeks ago

tamil nadu, India Poshmark Full time

We’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through...
Mainframe Sre

4 days ago

Chennai, Tamil Nadu, India Kyndryl Full time

Who We Are At Kyndryl we design build manage and modernize the mission-critical technology systems that the world depends on every day So why work at Kyndryl We are always moving forward - always pushing ourselves to go further in our efforts to build a more equitable inclusive world for our employees our customers and our communities The Role Join us as a...
Senior Site Reliability Engineer

1 week ago

tamil nadu, India Poshmark Full time

We’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through...

Americas

Europe

Asia / Oceania

Africa

Platform Site Reliability Engineer