[Urgent Search] Senior Site Reliability Engineer

2 weeks ago


Bangalore Karnataka, India Booking Holdings Full time

Role Description Our mission at Booking com is to create transformative innovative and personalised travel experiences for millions of customers all across the world We want customers to have an amazing experience wherever and whenever they choose mobile web and through partners and 3rd parties About the team - Private cloud The Private Cloud group operates orchestrates and optimizes Booking-managed cloud infrastructure The Private Cloud capabilities are provided on platform instances that are privately owned and centrally managed by Booking com These platform instances and the workloads running on them are hosted both in Booking data centers on-premises and on public cloud infrastructure AWS The Private Cloud platform has three primary internal customer-facing verticals virtualisation containerisation and server-less corresponding to the three types of workloads it supports At the highest level the Booking Private Cloud drives three primary business outcomes Agility in provisioning and using cloud infrastructure Efficiency in cost and utilisation of cloud infrastructure as well as toil reduction for developers and engineers Trust in the safety reliability and performance of our cloud infrastructure Key Job Responsibilities and Duties The core premise for the Booking SRE lies in treating operational and reliability problems of software systems as a software engineering problem We code our way out of problems where operations are concerned addressing availability scalability latency and efficiency challenges within the vast infrastructure here at Booking We expect our SRE engineers to be software engineers that optimize systems rather than be system operators You will impact millions of people all over the globe with your creative solutions You work in one of the biggest e-commerce companies in the world You will solve exciting problems at scale by writing and deploying code across tens of thousands of servers Ensuring an everything as code mindset for yourself and your team You will have the opportunity to collaborate with many of the world s leading SREs You will be free to launch your own ideas and solutions within our sophisticated production environment Here are some of the tools and technologies we use to achieve this Python Go Puppet Kubernetes Elasticsearch Prometheus HAProxy Cassandra Kafka etc What you ll be doing Design develop and implement software that improves the stability scalability availability and latency of the Booking com products Take ownership of one or more services and have the freedom to do what is best for our business and customers Solve problems occurring with our highly available production systems and build solutions and automation to prevent them from happening again Build effective monitoring to supervise the health of your system and jump in to handle outages Build and run capacity tests to manage the growth of your systems Plan for reliability by designing systems to work across our multinational data centers Develop tools to assist the product development teams with successfully deploying 1000s of change sets every day Be an advocate of engineering standard processes Share the on-call rotation and be an escalation contact for incidents Contribute to Booking com s growth through interviewing on-boarding or other recruitment efforts What you ll bring 8 years hands-on experience in software and site reliability engineering within the technology sector Coupled with expertise with building operating and maintaining sophisticated and scalable systems Solid experience in at least one programming language We use Java Python Go Ruby Perl Experience with Infrastructure as Code technologies Knowledge of cloud computing fundamentals Solid foundation in Linux administration and troubleshooting Understanding of Service level agreements and objectives Additional experience in OpenStack Kubernetes Networking Security or Storage is desirable Supervising observability technologies like Prometheus Graphite Grafana Kibana Elasticsearch are a plus Good interpersonal skills Proficient command of the English language both written and spoken



  • bangalore, India Synechron Full time

    We have immediate opportunity for SRE (Senior Site Reliability Engineer) 5+ years.Synechron – MumbaiJob Role: - SRE (Senior Site Reliability Engineer)Job Location: - MumbaiAbout SynechronWe began life in 2001 as a small, self-funded team of technology specialists. Since then, we've grown our organization to 14,500+ people, across 58 offices, in 21...


  • bangalore, India Josys Full time

    Senior Site Reliability Engineer (SRE)About JOSYSJosys, a dynamic B2B SaaS platform startup, has embarked on a mission to revolutionize IT operations globally, following an exceptional launch in Japan and securing $125 million in Series A and B funding. Our platform enables businesses to conquer the complexities of work-from-anywhere setups, rapid digital...


  • bangalore, India Tata Consultancy Services Full time

    Role**: Senior Site Reliability Engineer (SRE) Required Technical Skill Set: Senior Site Reliability Engineer (SRE) Desired Experience Range: 7 - 10 yrs Notice Period: Immediate to 90Days only Location of Requirement: Bangalore We are currently planning to do a Virtual Interview Job Description: Key Responsibilities Infrastructure & Application Support...


  • bangalore, India Okta Full time

    Join our team Were building a world where Identity belongs to you.Oktas Workforce Identity Cloud Security Engineering group is looking for a Senior Site Reliability Engineer with a passion for DevSecOps , Infrastructure Security , and SRE . Join a team that is not just building solutions but redefining the standards for cloud security. If you have a proven...


  • bangalore, India super Full time

    Site Reliability Engineer (SRE) Level 3Overview:A Site Reliability Engineer (SRE) Level 3 is a senior technical leadership role focused on designing, implementing, and maintaining large-scale, complex, and highly reliable systems. This role emphasizes a blend of software and systems engineering to ensure the availability, latency, performance, and capacity...


  • Bangalore, Karnataka, India Empower Annuity Insurance Full time

    Our vision for the future is based on the idea that transforming financial lives starts by giving our people the freedom to transform their own We have a flexible work environment and fluid career paths We not only encourage but celebrate internal mobility We also recognize the importance of purpose well-being and work-life balance Within Empower and our...


  • Bangalore, Karnataka, India Delta Air Lines Full time

    About Delta Air Lines About the Company Delta Air Lines NYSE DAL is the U S global airline leader in safety innovation reliability and customer experience Powered by our employees around the world Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-winning customer service With our mission of...


  • Bangalore, Karnataka, India Pearson Full time

    Job Category Technology Role Overview Learning The Associate Site Reliability Engineer s SRE primary focus will be on acquiring and honing the essential skills required to excel in the role They will work closely with more experienced engineers who will mentor and guide them throughout their journey The responsibilities will encompass various facets of site...


  • Bangalore, Karnataka, India NatWest Group Full time

    Join us as a Site Reliability Engineer In this key role you ll support the improvement of non-functional and operational characteristics such as availability performance efficiency change management monitoring security incident response and capacity planning of our products and services You ll enjoy significant stakeholder interaction working in...


  • Bangalore, Karnataka, India Akamai Full time

    Job Category Site Reliability Do you love collaborating with teams to solve complex problems Are you passionate about cutting-edge technology and ensuring customer success Join our critical Nameserver SRE team We re responsible for defining measuring publishing and optimizing key performance indicators of Akamai s nameserver platform We take a holistic view...