
Service Reliability Infra Specialist
4 weeks ago
Job ID 2025-14310 Date posted 16/07/2025 Location Bengaluru, India Category IT
Job Overview
We are seeking a skilled and experienced Service Reliability Specialist to join our diverse team as part of newly created Service Reliability Centre (SRC). In this role, you will help improve the availability and performance of Arm infrastructure by utilising Arms AI Operations (AIOPS) and observability platforms. You will collaborate closely with development and platform teams to build and maintain robust observability and response processes.
Responsibilities
- Serve as the primary technical contact during critical incidents for both on-premise and cloud infrastructure.
- Lead Root Cause Analysis (RCA) for major incidents, identifying contributing factors and actionable remediation.
- Utilize Dynatrace and ServiceNow for correlation analysis, system tracing, and optimizing alerts and visibility.
- Perform detailed diagnostics for virtualization, storage, operating systems, and cloud services during incidents.
- Develop clear and comprehensive runbooks, diagnostic guides, and incident documentation.
- Collaborate post-incident with platform teams to implement improvements via automation, tuning, or design enhancements.
- Coordinate improvements in monitoring, event correlation, and response processes with platform and tooling teams.
- Automate routine diagnostic tasks using scripting (Ansible, Python).
- Provide technical expertise during service onboarding, including setting alert rules, thresholds, and RCA guidelines.
Required Skills And Experience
- 5+ years in Infrastructure Operations or Platform Support.
- Skilled in detailed root cause analysis and impact assessments in complex environments (cloud-native and legacy).
- Expertise with observability tools (Dynatrace, Datadog, Splunk).
- Proficient in managing Linux/Windows servers, virtualization, storage, and identity platforms (LDAP, Azure AD).
- Strong scripting skills (Python, PowerShell, Bash) and infrastructure automation experience using Ansible.
- Familiarity with ITSM processes and incident management using ServiceNow.
- Comfortable with independent work and flexible shift schedules (including off-hours/weekends) as part of a global team.
- Excellent documentation and communication skills to translate technical issues into actionable insights.
- Capable of analyzing incident trends and recommending reliability improvements.
- Knowledge of virtualization, storage infrastructure, high-performance computing, and cloud services.
- Experience with User Access Management (UAM) and Identity Access Management (IAM) on-premise (OUD LDAP) and Azure AD.
- Experience maintaining Windows and Linux operating systems.
- Proficient with engineering tools (GitHub, Jira, Confluence).
Nice To Have Skills And Experience
- Exposure to high performance computing or cloud-native services
- Knowledge to CI/CD tooling (e.g., Jenkins, GitLab) or container-based systems
- Experience defining SLIs, SLOs, and building service health dashboards
In Return
Accommodations at Arm
At Arm, we want to build extraordinary teams. If you need an adjustment or an accommodation during the recruitment process, please email [Confidential Information]. To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud, or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.
Equal Opportunities at Arm
Arm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and dont discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
Hybrid Working at Arm
Arms hybrid approach to working is centred around flexibility, where we split our time between the office and other locations to get our work done. Within that framework, we empower groups and teams to determine their own particular hybrid working pattern, depending on the work and the teams needs. Details of what this means for each role will be shared upon application. In some cases, the flexibility we can offer is limited by local legal, regulatory, tax, or other considerations, and where this is the case, we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.
Accommodations at Arm
At Arm, we want to build extraordinary teams. If you need an adjustment or an accommodation during the recruitment process, please email
Hybrid Working at Arm
Arms approach to hybrid working is designed to create a working environment that supports both high performance and personal wellbeing. We believe in bringing people together face to face to enable us to work at pace, whilst recognizing the value of flexibility. Within that framework, we empower groups/teams to determine their own hybrid working patterns, depending on the work and the teams needs. Details of what this means for each role will be shared upon application. In some cases, the flexibility we can offer is limited by local legal, regulatory, tax, or other considerations, and where this is the case, we will collaborate with you to find the best solution. Please talk to us to find out more about what this could look like for you.
Equal Opportunities at Arm
Arm is an equal opportunity employer, committed to providing an environment of mutual respect where equal opportunities are available to all applicants and colleagues. We are a diverse organization of dedicated and innovative individuals, and dont discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or status as a protected veteran.
-
Infra Architect
3 weeks ago
Bengaluru, Karnataka, India Espire Infolabs Private Limited Full timeJob Description- Upgrade/Consolidation/merger of different forest Active directory domains, DNS, DHCP, users, groups, and security policies- Consolidation/merger of applications and other services from another domain to the parental domain. Like Wi-Fi, SharePoint etc.- Configuration management for Infra projects- Project Management (Assessment, Planning,...
-
Service Reliability Infra Advisor
3 weeks ago
Bengaluru, Karnataka, India Arm Full timeJob DescriptionJob ID 2025-14309 Date posted 16/07/2025 Location Bengaluru, India Category ITJob OverviewWe are seeking a skilled and experienced Service Reliability Analyst to join our diverse team as part of newly created Service Reliability Centre (SRC). In this role, you will help improve the availability and performance of Arm infrastructure by...
-
Service Reliability Infra Analyst
3 weeks ago
Bengaluru, Karnataka, India Arm Full timeJob DescriptionJob ID 2025-14314 Date posted 16/07/2025 Location Bengaluru, India Category ITJob OverviewWe are seeking a skilled and experienced Service Reliability Advisor to join our diverse team as part of newly created Service Reliability Centre (SRC). In this role, you will help improve the availability and performance of Arm infrastructure by...
-
AI Infra Architect
3 weeks ago
Bengaluru, Karnataka, India People Prime Worldwide Full timeAbout Company:Our Client Corporation provides digital engineering and technology services to Forbes Global 2000 companies worldwide. Our Engineering First approach ensures we can execute all ideas and creatively solve pressing business challenges. With industry expertise and empowered agile teams, we prioritize execution early in the process for impactful...
-
AI Infra Architect
2 weeks ago
Bengaluru, Karnataka, India People Prime Worldwide Full timeAbout Company: Our Client Corporation provides digital engineering and technology services to Forbes Global 2000 companies worldwide. Our Engineering First approach ensures we can execute all ideas and creatively solve pressing business challenges. With industry expertise and empowered agile teams, we prioritize execution early in the process for impactful...
-
Urgent) AI Infra Architect
3 weeks ago
Bengaluru, Karnataka, India People Prime Worldwide Full timeAbout Company:Our Client Corporation provides digital engineering and technology services to Forbes Global 2000 companies worldwide. Our Engineering First approach ensures we can execute all ideas and creatively solve pressing business challenges. With industry expertise and empowered agile teams, we prioritize execution early in the process for impactful...
-
Highly Experienced Reliability Specialist
3 days ago
Bengaluru, Karnataka, India beBeeReliability Full time ₹ 9,00,000 - ₹ 12,00,000Job Title: Highly Experienced Reliability SpecialistAbout Us:We are a leading global consulting company, providing innovative solutions to our clients. Our team of experts works together to deliver exceptional results and exceed customer expectations.Position Summary:We are seeking a highly experienced reliability specialist to join our technology team. This...
-
Cloud Reliability Specialist
7 hours ago
Bengaluru, Karnataka, India beBeeCloudReliability Full time ₹ 1,04,000 - ₹ 1,30,878Job Title: Cloud Reliability SpecialistAbout the Role:We are seeking a highly skilled Cloud Reliability Specialist to join our team. As a key member of our infrastructure support team, you will be responsible for maintaining and improving the reliability, availability, and performance of AWS-based infrastructure and applications.Key Responsibilities:Maintain...
-
Reliable Infrastructure Specialist
2 days ago
Bengaluru, Karnataka, India beBeeInfrastructure Full time ₹ 9,00,000 - ₹ 12,00,000Job OverviewWe are seeking a highly skilled Reliable Infrastructure Specialist to join our team. The ideal candidate will have a strong background in system administration and network configuration.About the RoleThis is a critical position that requires exceptional technical skills, attention to detail, and effective communication abilities. The Reliable...
-
Reliable Home Systems Specialist
5 days ago
Bengaluru, Karnataka, India beBeeSpecialist Full timeReliable Home Products SpecialistThis position requires a skilled professional to design, launch, and maintain reliable Home products and services. The ideal candidate will have experience with software development, data structures, and algorithms.The successful candidate will be responsible for identifying opportunities to improve the reliability of Home...