Home
Jobs

Service Reliability Infra Advisor

3 - 6 years

12 - 17 Lacs

Posted:1 day ago| Platform: Naukri logo

Apply

Work Mode

Work from Office

Job Type

Full Time

Job Description

Job Overview:


We are seeking a skilled and experienced Service Reliability Analyst to join our diverse team as part of newly created Service Reliability Centre (SRC). In this role, you will help improve the availability and performance of Arm infrastructure by utilising Arms AI Operations (AIOPS) and observability platforms. You will collaborate closely with development and platform teams to build and maintain robust observability and response processes.

Responsibilities:


  • Lead the

    analysis and resolution of infrastructure incidents

    across physical and virtual servers, storage, identity, and engineering platforms.
  • Work with platform and engineering teams to

    expand monitoring coverage

    , define alert thresholds, and onboard new applications and services into SRC support.
  • Drive

    proactive monitoring, tuning, and optimization

    of systems using

    Dynatrace

    and other observability tools.
  • Look for opportunities to adapt automation to support the AIOps platform
  • Conduct root cause analysis of incidents and implement preventive measures.
  • Management of incidents to suppliers and Arms technical on-call rotas as appropriate
  • To log all issues in the Service Management Tool and manage them to completion within EIT service levels and quality criteria matrix
  • Work on a shift pattern, on a 24/7/365 operating model, while being able to work independently and flexibly in response to emergencies or critical issues

Required Skills and Experience:


  • 3 6 years of hands-on experience in Platform

    Operations

    , or

    Infrastructure Support

    roles.
  • Solid experience with observability tools managing and optimising an enterprise observability (e.g.,

    Dynatrace

    ,

    Datadog

    ,

    Splunk

    ) for real-time monitoring, alerting, and diagnostics.
  • Proficiency in one or more scripting or programming languages (e.g., Python, Java, .NET, Node.js, Ansible or JavaScript).
  • Practical knowledge of

    infrastructure automation

    using

    Ansible

    , including writing and managing playbooks.
  • Understanding of UAM and IAM across on Premise, OUD/LDAP and Azure AD, including fault finding and access issues.
  • Experience supporting Windows and Linux operating systems
  • Experience with engineering tools such as Github, Jira, and Confluence
  • Virtualization and Storage infrastructure, High Performance computing and Cloud services in an enterprise environment.
  • Proficient in ticket management via an ITSM platform such as ServiceNow
  • Experience leading

    incident response

    , driving service restoration and coordinating root cause analysis under pressure.
  • Effective communicator within a team with a proactive approach and personal accountability for outcomes.
  • Ability to analyze incident patterns and metrics to proactively recommend reliability improvements.

Nice To Have Skills and Experience:


  • Exposure to

    high performance computing

    or cloud-native services
  • Proven background in automation and DevOps practices

In Return:



Accommodations at Arm
At Arm, we want to build extraordinary teams. . To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud, or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.

Equal Opportunities at Arm

Hybrid Working at Arm
#LI-LK2

Accommodations at Arm


At Arm, we want to build extraordinary teams. . To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality, and information concerning these requests will only be disclosed as necessary to provide the accommodation. Although this is not an exhaustive list, examples of support include breaks between interviews, having documents read aloud, or office accessibility. Please email us about anything we can do to accommodate you during the recruitment process.


Equal Opportunities at Arm


Mock Interview

Practice Video Interview with JobPe AI

Start Service Management Interview Now
cta

Start Your Job Search Today

Browse through a variety of job opportunities tailored to your skills and preferences. Filter by location, experience, salary, and more to find your perfect fit.

Job Application AI Bot

Job Application AI Bot

Apply to 20+ Portals in one click

Download Now

Download the Mobile App

Instantly access job listings, apply easily, and track applications.

coding practice

Enhance Your Python Skills

Practice Python coding challenges to boost your skills

Start Practicing Python Now
ARM Embedded Technologies
ARM Embedded Technologies

Technology / Embedded Systems

San Jose

50-200 Employees

29 Jobs

    Key People

  • Jane Doe

    CEO
  • John Smith

    CTO

RecommendedJobs for You

Kolkata, Mumbai, New Delhi, Hyderabad, Pune, Chennai, Bengaluru

Kolkata, Mumbai, New Delhi, Hyderabad, Pune, Chennai, Bengaluru