Senior Staff Site Reliability Engineer

2 weeks ago


india Palo Alto Networks Full time
Our Mission

At Palo Alto Networks® everything starts and ends with our mission:

Being the cybersecurity partner of choice, protecting our digital way of life.

Our vision is a world where each day is safer and more secure than the one before. We are a company built on the foundation of challenging and disrupting the way things are done, and we’re looking for innovators who are as committed to shaping the future of cybersecurity as we are.

Who We Are

We take our mission of protecting the digital way of life seriously. We are relentless in protecting our customers and we believe that the unique ideas of every member of our team contributes to our collective success. Our values were crowdsourced by employees and are brought to life through each of us everyday - from disruptive innovation and collaboration, to execution. From showing up for each other with integrity to creating an environment where we all feel included.

As a member of our team, you will be shaping the future of cybersecurity. We work fast, value ongoing learning, and we respect each employee as a unique individual. Knowing we all have different needs, our development and personal wellbeing programs are designed to give you choice in how you are supported. This includes our FLEXBenefits wellbeing spending account with over 1,000 eligible items selected by employees, our mental and financial health resources, and our personalized learning opportunities - just to name a few

At Palo Alto Networks, we believe in the power of collaboration and value in-person interactions. This is why our employees generally work full time from our office with flexibility offered where needed. This setup fosters casual conversations, problem-solving, and trusted relationships. Our goal is to create an environment where we all win with precision.

Job Description

Your Career

Palo Alto Networks is looking for a talented Senior Site Reliability Engineer for our ever expanding Infrastructure & Cloud Operations. This position will be a part of the Infrastructure team, you will be working and partnering with our Network, Compute, Security, Database, Applications, and other teams to provide availability, reliability, and observability for our global IT infrastructure environments. You will help with building our next-generation IT operations through Automation, Code, Analytics, and continuous improvement. We are looking for analytical, agile, and influential leaders who can quickly deliver meaningful results and solutions with the flexibility to accommodate evolving business needs and shifting priorities. Are you a motivated, intelligent, creative, and hardworking individual who wants to contribute and make a difference? If yes, this job is for you

The ideal candidate enjoys working in a fast-paced environment with highly innovative technologies. Our team partners closely with IT and Engineering groups and requires individuals to bring a can-do, positive attitude, with a focus on delivering exceptional customer support.

Your Impact

Implementing and supporting the Linux infrastructure as code where our globally distributed customer-facing platform runs. Provision, configure & support resilient hybrid cloud deployment architecture using the automation framework and make it more efficient Manage Linux infrastructure CI/CD platform, work with other SREs in deploying and maintaining automation framework, capacity planning, create and review PKI operational runbooks. Manage scalability, capacity planning, redundancy, and resiliency. Maintain service availability and performance SLAs based on business and product requirements. Contribute to documentation related to design, deployment, validation, operations and DR/BCP. Design proactive service monitoring, alerting and trend analysis of underlying infrastructure, and support the operations team in implementation. Build and operate compute fabric for 1000s of VMs, Kubernetes Clusters. Develop scripts, build tools and write code to automate routine tasks. Provide technical support to platform users Respond to security implementation and audits of the environment. Plan maintenance windows, write up change requests, present technical updates. Participate in On-Call support including participating in RCA as required. Design and implement network, compute and application-level monitoring solutions Implement integrated and automated processes that drive operational excellence Advise on industry best practices as it relates to new product selection Drive operational cadences around business planning and performance management to ensure the efficient running of the IT org

Qualifications

Your Experience

First-hand experience with Enterprise infrastructure and application monitoring and reporting tools Strong working experience and exposure to containers and orchestration ( Docker, Kubernetes) Infrastructure as Code knowledge - Terraform, Ansible, Git, Puppet Fluent Scripting skills preferably Python OR Shell OR Bash Exposure to Public Cloud Platforms - GCP (Google cloud) OR AWS Proficient in CI/CD platforms like Jenkins, CircleCI, etc Excellent problem-solving skills; ability to multi-task and prioritize Ability to work independently; works well under pressure Possess solid communication skills, and will be comfortable working in a fast-paced technical environment Background knowledge of network and security technologies Strong hands-on Linux experience in managing and supporting Linux server infrastructure in CentOS/RHEL/Ubuntu. Bachelors/Masters degree in Computer Science, Information Technology or technical stream with the equivalent combination of work experience required. Design and performance tuning for Linux infrastructure and API, in-depth knowledge of multi-tier web applications. Experience in developing and managing APIs, understanding of API infrastructure optimization and security. In-depth knowledge of Certificate Lifecycle Management Fluent in Linux security & system hardening, vulnerability management & patching process. Familiarity with CIS compliance levels. Must be comfortable with Ansible, Chef or similar configuration management tool to manage infrastructure as code and source code control systems such as GIT or SVN. Ability to work cross-functionally across multiple business units, such as product development and engineering Must be able to collaborate with a global team spread across multiple time zones. Passion, drive, energy, a sense of humour and a great attitude 6+ years of relevant experience, Bachelor or Master’s degree in Computer Science or a related technical field. Experience with administration and orchestration of cloud computing (AWS, GCP, etc.) running virtual or container environments. Good user and admin Linux skills (Ubuntu a plus).Experience with virtual networking. Working experience with IaC tools like Terraform and Ansible. Knowledge of Python and shell scripting. Experience with CI/CD development using platforms like - Jenkins, Harness, Artifactory. Solid problem solving, troubleshooting, critical thinking, communication, and teamwork skills. Passion for automation and monitoring instrumentation in the code. Fluency in coding with one or more - Python, Go, Java, You will have to take coding and design tests as required. Experience in Infrastructure as Code environment - Terraform, Ansible.You will be asked to write and troubleshoot IaC code during interview. Proficient in Kubernetes based deployments, CI/CD platforms like Jenkins, Harness etc.. Takes great care in documenting conceptual work, detailed design specifications and can present ideas to engineers and engineering leaders. Knowledge of AIOps, Application of Machine Learning/Artificial Intelligence in Cloud Infrastructure or IT Operations.

Additional experience in one or more of the following areas is a big plus

Development of self-healing infrastructure and applications. Understanding of Big data, data analytics theory and application. Exposure to Enterprise Business Applications, ITSM frameworks and tools is a big plus.

On an everyday basis bring the following traits to succeed:

Self-motivated, decisive, with the ability to work through ambiguity, and adapt to change and competing demands. Excellent problem-solving skills; ability to multitask and prioritize Ability to work independently; works well under pressure Possess solid communication skills, and will be comfortable working in a fast-paced technical environment

Additional Information

The Team

We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple: we can’t accomplish our mission without diverse teams innovating, together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at accommodations@paloaltonetworks.com.

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

Our Commitment

We’re problem solvers that take risks and challenge cybersecurity’s status quo. It’s simple: we can’t accomplish our mission without diverse teams innovating, together.

We are committed to providing reasonable accommodations for all qualified individuals with a disability. If you require assistance or accommodation due to a disability or special need, please contact us at accommodations@paloaltonetworks.com.

Palo Alto Networks is an equal opportunity employer. We celebrate diversity in our workplace, and all qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or other legally protected characteristics.

All your information will be kept confidential according to EEO guidelines.

Covid-19 Vaccination Information for Palo Alto Networks Jobs

Vaccine requirements and disclosure obligations vary by country. Unless applicable law requires otherwise, you must be vaccinated for COVID or qualify for a reasonable accommodation if: The job requires accessing a company worksite The job requires in-person customer contact and the customer has implemented such requirements You choose to access a Palo Alto Networks worksite If you have questions about the vaccine requirements of this particular position based on your location or job requirements, please inquire with the recruiter.



  • india Synechron Full time

    We have immediate opportunity forSRE (Senior Site Reliability Engineer) 5 to 9 years. Synechron –BangaloreJob Role: -SRE (Senior Site Reliability Engineer) Job Location: -Bangalore Notice Period:Within 30daysAbout Synechron We began life in 2001 as a small, self-funded team of technology specialists. Since then, we’ve grown our organization to 14,500+...


  • India Akamai Technologies Full time

    Job Description Job Description Do you have the passion to architect and lead the next generation of public cloud infrastructure Would you like to lead modernization initiatives while building a public cloud platform from scratch Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure and services that power...


  • India Akamai Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Would you enjoy improving stability and safety of one of the largest global networks?Would you enjoy hands-on network operations work on a global scale to improve our operational efficiency?Join the Platform Cloud Services Engineering TeamThe Platform Cloud Services SRE team supports globally distributed hosting and database systems for Akamai. These systems...


  • India Akamai Full time

    Do you have the passion to architect and lead the next generation of public cloud infrastructure? Would you like to lead modernization initiatives while building a public cloud platform from scratch? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure and services that power the backbone of our cloud...


  • India Akamai Full time ₹ 8,00,000 - ₹ 25,00,000 per year

    Do you have the passion to architect and lead the next generation of public cloud infrastructure?Would you like to lead modernization initiatives while building a public cloud platform from scratch?Join our IaaS Site Reliability Engineering (SRE) team.We design, develop, and operate infrastructure and services that power the backbone of our cloud platform....


  • India Akamai Technologies Full time

    Job Description Job Description Do you like collaborating across teams to solve complex problems Do you enjoy solving large scale distributed systems problems Join the Mapping SRE team The Mapping SRE team manages availability, reliability, performance, and change processes for Akamai's mapping system. This system routes trillions of daily client...


  • India Akamai Full time

    Do you want to grow your career in Linux and Site Reliability Engineering? Would you like to contribute to the foundation of a new public cloud platform? Join our IaaS Site Reliability Engineering (SRE) team. We design, develop, and operate infrastructure and services that power the backbone of our cloud platform. This is a rare opportunity to help build a...


  • India Akamai Full time ₹ 5,00,000 - ₹ 15,00,000 per year

    Do you want to grow your career in Linux and Site Reliability Engineering?Would you like to contribute to the foundation of a new public cloud platform?Join our IaaS Site Reliability Engineering (SRE) team.We design, develop, and operate infrastructure and services that power the backbone of our cloud platform. This is a rare opportunity to help build a...


  • India PayPal Full time

    Job DescriptionThe CompanyPayPal has been revolutionizing commerce globally for more than 25 years. Creating innovative experiences that make moving money, selling, and shopping simple, personalized, and secure, PayPal empowers consumers and businesses in approximately 200 markets to join and thrive in the global economy.We operate a global, two-sided...


  • India IVedha Inc. Full time

    Senior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice Location: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone. Role Summary: Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure? We're looking for an...