Reliability Engineer

2 weeks ago


Bengaluru India CrowdStrike Full time

Job Description

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed - we're here to stop breaches, and we've redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing almost 3 trillion events per day and this traffic is growing daily. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We're also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We're always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters The future of cybersecurity starts with you.

About the Role:

CrowdStrike is looking to hire an Engineer III to the TechOps SRE team that will have a focus on our Commercial Cloud. We're looking for a deeply-technical, hands-on engineer, who loves to develop automation and tooling through software to ensure delivery of mission critical solutions and services for large-scale distributed systems.

What You'll Do:

- Expertise with Linux engineering and administration for thousands of bare metal servers and virtual machines

- Responsible for troubleshooting server hardware issues

- Responsible for all operational aspects of our platform - Availability, Latency, Throughput, Monitoring, Issue Response (analysis, remediation, deployment) and Capacity Planning with respect to Latency and Throughput

- Work in a team of highly motivated engineers distributed across the globe

- On-call rotation with other team members

- Use your passion for technology, automation, and tooling to ensure our platform operates 24x7

- Obsess about learning, and champion the newest technologies & tricks with others, raising the technical IQ of the team. We don't expect you to know all the technology we use but you will be able to get up to speed on new technology quickly

- Have broad exposure to our entire architecture and become one of our experts in our overall process flow

- Have an intrinsic drive to make things better

- Bias towards small/medium development projects and the occasional larger projects

- Have experience with modern monitoring and telemetry stacks (ELK, Prometheus, Grafana)

- Gather and analyze metrics from both operating systems and applications to assist in performance tuning

- Ability to lead incident analysis for incidents, champion incident response practices and assist in correlating incidents to systemic problems, and drive towards resolution.

What You'll Need:

- Bachelor's degree and/or equivalent experience in Computer Science

- 6+ years of experience in software engineering

- 6+ years of experience in one or more of: Java, Python, Go

- Experience with storage technologies (Examples: SAN, NAS, NFS, Object Storage, FreeNAS, iSCSI)

- Experience with Infrastructure technologies (Examples: Linux, Windows, VMware, Docker, Kubernetes, etc.)

- Experience writing technical documentation

- Configuration management experience with one or more tools such as Puppet, Chef, Ansible

- Solid understanding of application design, including operational trade-offs of various designs

- Analytical skills coupled with a strong sense of urgency, ownership, and drive

- Ability to work with well in a diverse, team-focused environment with other SREs and Engineers

- Ability to broadly communicate and present recommended conventions defined by the reliability team broadly

#LI-VJ1

Benefits of Working at CrowdStrike:

- Remote-friendly and flexible work culture

- Market leader in compensation and equity awards

- Comprehensive physical and mental wellness programs

- Competitive vacation and holidays for recharge

- Paid parental and adoption leaves

- Professional development opportunities for all employees regardless of level or role

- Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections

- Vibrant office culture with world class amenities

- Great Place to Work Certified across the globe

CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program.

CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements.

If you need assistance accessing or reviewing the information on this website or need help submitting an application for employment or requesting an accommodation, please contact us at for further assistance.



  • Bengaluru, Karnataka, India, Karnataka Chevron Full time

    Job DescriptionThe Subsurface Reliability Engineer is part of the Production Engineering team within the Chevron ENGINE Center and is responsible for ensuring the reliability and efficiency of subsurface operations across Chevron’s Shale and Tight (S&T) assets. The successful candidate will work closely with multidisciplinary teams to optimize production...


  • Bengaluru, India Landmark Group Full time

    Job Description Job Title: SRE Lead (Engineering & Reliability) Job Summary: We are seeking an experienced and dynamic Site Reliability Engineering (SRE) Lead to oversee the reliability, scalability, and performance of our critical systems. As an SRE Lead, you will play a pivotal role in establishing and implementing SRE practices, leading a team of...


  • Bengaluru, Karnataka, India, Karnataka ViewSonic Full time

    Job Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.3+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...


  • Bengaluru, India VidPro Consultancy Services Full time

    Job Description Experience: 2.55 Years Location: Bangalore (On-site) Work Mode: 5 Days WFO Mandatory Skills: Site Reliability engineer or SRE ,Linux, System architecture, TCP/IP. HTTP,DNS ,Grafana, Prometheus and Loki Troubleshooting ,Root cause, complex systems ,Ci/CD, Docker, Kubernetes Experience : 2-4 years of relevant experience Key Skills...


  • Chennai, Tamil Nadu, India, Tamil Nadu Supply Chain Resources Group, Inc. Full time

    ResponsibilitiesTranslate product management reliability goals into appropriate testable goals.Perform statistical data analysis, Accelerated Life Testing (ALT) and modeling, and risk assessment.Develop reliability performance metrics and lead management reviews to review progress against those metrics.Drive the failure analysis process for all failures...


  • Bengaluru, India BNP Paribas Full time

    Job Description Dear Candidate, BNP Paribas is hiring for Sire Reliability Engineer for Bangalore location! Kindly apply on the below link asap if interested, we shall take your candidature ahead post the application is submitted: https://bwelcome.hr.bnpparibas/su/cba292db5cf89f02 Technical & Behavioral Competencies : Mandatory skills: Site Reliability...


  • India Concord Full time

    SRE Sr. Engineers (Individual Contributors) Key Attributes: - Strong SRE (Site Reliability Engineering) experience - DevOps skills – CI/CD, monitoring, automation, infrastructure as code, etc. - Excellent troubleshooting and debugging skills (infrastructure + application level) - Perseverance – must push through complex/challenging issues without...


  • India Employ Full time

    Role - Site Reliability Engineer (SRE)/ Platform Engineering/ or Dev Ops Engineering roles Location – Fully Remote Type - 6 months Contract Work Ex - 5+ Yrs We’re working with a AI product company that’s building the next generation of Gen AI powered developer platforms . We’re looking for an experienced Site Reliability Engineer to join...


  • India Concord Full time

    SRE Sr. Engineers (Individual Contributors) Key Attributes : Strong SRE (Site Reliability Engineering) experience Dev Ops skills – CI/CD, monitoring, automation, infrastructure as code, etc. Excellent troubleshooting and debugging skills (infrastructure + application level) Perseverance – must push through complex/challenging issues without...


  • Bengaluru, India ExxonMobil Corporation Full time

    What you will do Knowledgeable and hands on practice in performing the below activities: Criticality Assessment Equipment Strategy development – Fleet based, RCM Based Data analysis techniques that can include: Reliability modeling and prediction Fault Tree Analysis Weibull Tree Analysis Root-Cause Failure Analysis Single Point of Failure studies LCCA...