Principal Network Reliability Engineer

1 week ago


india Oracle Full time

DescriptionJob DescriptionThe Oracle Cloud Infrastructure (OCI) delivers mission-critical applications for top tier enterprises around the world. Our cloud offers unmatched hyper-scale, multi-tenant services deployed in more than 40 regions worldwide. The mission of our Network Reliability Engineering team is to provide exceptional network reliability and automation services that enable our customers to drive operational excellence in OCI networks at scale. By focusing on both reactive and proactive functions, we aim to minimize downtime, quickly resolve incidents, and continuously enhance network performance through automation, advanced monitoring, and a customer-centric approach.As a Principal Network Reliability Engineer, you will play a critical role in designing, building, testing, deploying, and operating highly reliable, scalable network solutions to support Oracle's next-generation Cloud Infrastructure. You will help ensure the reliability and availability of large-scale distributed systems, managing hundreds of thousands of networking devices.You will contribute to both proactive and reactive initiatives - automating processes, implementing advanced monitoring, swiftly resolving incidents, and continuously improving network performance. You possess strong coding abilities, a deep understanding of networking and distributed systems, and a passion for automation to drive operational excellence.You thrive in a collaborative, agile environment, effectively manage multiple projects and priorities, and consistently deliver results in fast-paced, dynamic conditions. Most importantly, you are a dedicated team player, eager to learn and adapt, and committed to helping the team achieve exceptional standards of network reliability.What you will bring:Bachelor's degree in CS or related engineering field with 10+ years of Network Engineering experience or Master's or equivalent experience with 8+ years of Network Engineering experience.Experience working in a large ISP or cloud provider environment.Experience working in a network operations/reliability engineering role.Folks with solid understanding of protocols such as MPLS, BGP/OSPF/IS-IS, TCP, IPv4, IPv6, DNS, and DHCP. Also, VxLAN and EVPN will be an added advantage.Extensive experience with scripting or automation and data center design – Python preferred but must demonstrate expertise in scripting or compiled language.Experience with networking protocols such as TCP/IP, VPN, DNS, DHCP, and SSL.Experience with network monitoring and telemetry solutions.Experience with network modeling and programming – YANG, OpenConfig, NETCONF.Ability to use professional concepts and company objectives to resolve sophisticated issues in creative and effective ways. Capable of working under limited supervision.Excellent organizational, verbal, and written communication skills.Excellent judgment in influencing product roadmap direction, features, and priorities.Participate in an on-call rotation.ResponsibilitiesResponsibilitiesSupports the design, deployment, and operations of a large-scale global Oracle Cloud Infrastructure (OCI). Primarily focused on the development and support of network fabric and systems through a combination of a deep-level understanding of networking at the protocol level coupled with programming skills. As OCI is a cloud-based network with a global footprint, this support will include hundreds of thousands of network devices supporting millions of servers, connected over a mix of dedicated backbone infrastructure, CLos Network, and the Internet.Ownership mindset - delivering results, embracing ambiguity, and driving continuous improvements.Collaborate with program/project managers to develop breakthroughs and results.Will primarily use existing procedures and tools to develop and safely implement network change. However, may have to develop new procedures from time to time.Develop solutions to enable front line support teams to act on network failure conditions.Mentor junior engineers.Participates in network solution and architecture design process.Participate in operational rotations as either primary or secondary.Provide break-fix support for events. Serve as the partner concern point for event remediation. Lead post-event root cause analysis.Coordinate with networking automation services for the development and integration of support tooling.Coordinate with network supervising to capture telemetry and build alerts rules using them.Build dashboards to represent data at various network layers and device roles that help identify network issues, anomalies.Frequently develops scripts to automate routine tasks for team and business units.Serves as SME on software development projects for network automation and network monitoring.Collaborate with network vendor technical account team and internal Quality Assurance team to drive bug resolution and assist in the qualification of new firmware and/or operating systems.QualificationsCareer Level - IC4



  • India Oracle Full time

    Job Description Description The NRE (Network Reliability Engineering) team is accountable for ensuring the robustness of the Oracle Cloud Network Infrastructure. A Network Reliability Engineer (NRE) role is primarilyfocused on applying an engineering approach to measure a network's reliability to align with Organization's service-level objectives,...


  • Karnataka, Karnataka, India NIKE Full time

    PRINCIPAL SITE RELIABILITY ENGINEERIndia Technology Center WHO YOU WILL WORK WITHThe Principal Site Reliability Engineer will work alongside a talented team of Site Reliability Engineers focused on delivering reliabile and observable software used by millions of athletes* around the world.  You will be a part of the Resilience Engineering organization which...

  • Vice Principal

    2 weeks ago


    India Aga Khan Development Network Full time

    Sector Social Development About the Agency The Aga Khan Education Services is one of the largest private not-for-profit non-denominational educational networks in the Global South AKES currently operates over 190 pre-primary primary secondary and higher secondary schools and more than 100 non-formal education programmes in diverse geographic locations in...


  • india Vantive Manufacturing Full time

    Vantive is a vital organ therapy company on a mission to extend lives and expand possibilities for patients and care teams everywhere. For 70 years, our team has driven meaningful innovations in kidney care. As we build on our legacy, we are deepening our commitment to elevating the dialysis experience through digital solutions and advanced services, while...


  • Bengaluru, India Oracle Full time

    Job Description The NRE (Network Reliability Engineering) team is accountable for ensuring the robustness of the Oracle Cloud Network Infrastructure. A Network Reliability Engineer (NRE) role is primarily focused on applying an engineering approach to measure and automate a network's reliability to align with Organization's service-level objectives,...


  • India S&P Global Full time

    About the Role Grade Level for internal use 12 S P Global - Corporate About the Role Principal Network Engineer The Team Global Network Services is looking for a key player in the Data Center and Cloud Network Engineering team to design and implement networking solutions in data center and cloud environments This role requires a seasoned engineer who works...


  • Pune, India Amadeus Full time

    Job Description Job Title Principal Service Reliability Engineer Common Accountabilities - Proficient in technical knowledge to ensure team performs at a high level. Is recognized as a leader in own area and may formally train Specialists/Senior Specialists. - Understands how main business drivers may impact on own area. Can assess complex problems with...


  • Hyderabad, India Cubic Corporation Full time

    Job Description Business Unit: Cubic Transportation Systems Company Details: When you join Cubic, you become part of a company that creates and delivers technology solutions in transportation to make people's lives easier by simplifying their daily journeys, and defense capabilities to help promote mission success and safety for those who serve their nation....


  • India Prophecy Full time ₹ 12,00,000 - ₹ 24,00,000 per year

    About ProphecyProphecy is a rapidly growing startup enabling all the data users to visually build data pipelines with modern software practices including code on Github using its Low-Code Data Engineering Platform.Prophecy is trusted by top Fortune 500 firms to replace their legacy ETL tools as they re-platform to the Cloud or Apache Spark. We're very well...

  • Network Engineer

    3 weeks ago


    Chennai, India Antal International Network Full time

    Job Description Job Description Key Responsibilities: - Design, implement, and manage Illumio-based Zero Trust and microsegmentation solutions across hybrid environments (cloud, on-premises, containers). - Develop and enforce Zero Trust Network Architecture (ZTNA) policies to reduce attack surfaces and prevent lateral movement. - Collaborate with security,...