Site Reliability Engineering Technical Leader

1 month ago


Bengaluru, India Cisco Full time

Who we are

Today’s challenging business environment is more than that – it’s a period of disruption between the pandemic, global business change and internal process complexity. For us to focus on simplicity and the best customer experience, we need great talent and the right abilities to be successful. This is now a mantra for our Cisco leadership team and for us.

Cisco is redefining its platforms to run the next generation of cloud-native and multi-cloud services. This role offers a superb opportunity to transform how infrastructure platforms are developed, handled with full software automation and at the same time is highly available with self-healing, full lifecycle monitoring, and management capabilities.

What you’ll Do

As a Site Reliability Engineer Technical Leader within the platform engineering group in Network Engineering Operation, a division of CISCO IT, you will use an array of tools and integrations to deliver and lead a suite of foundational services pivotal to Cisco's essential business functions through GitOps. We are in search of a technically astute leader who is proficient in DevOps and GitOps to helm a flexible and adaptable team of multi-skilled engineers responsible for maintaining CISCO's internal network infrastructure. Your role will include taking charge of agile leadership responsibilities and acting as a Partner Security Advocate to safeguard both the infrastructure and application stack.

Responsibilities:

Manage CISCO Network Management and Orchestrator tools (Catalyst Center, NSO etc.) Integrate Observability Stack and handle lifecycle and operations of network infra Ensure the quality, performance, robustness, and scalability of the services that are implemented, perform bug fixes and triaging issues Automate the development, testing, and deployment processes through CI/CD pipelines (GitHub, GitHub Action, Jenkins, Helm, ArgoCD) Champion and drive the adoption of Infrastructure as Code (IaC) practices and demeanour Write terraform automations for infrastructure and application deployment. Lead Software development lifecycle including design, development, testing, packaging, deployment, upgrade, and support (Python). Collaborate with other core services team members to define roadmaps, write clear user stories with well-defined acceptance criteria, design, and build solutions Applies global knowledge of IT Infrastructure to develop standard solutions that can be leveraged across multiple areas; Supplies to the development of new technical principles and concepts Evaluate new and emerging technology and determine applicability in collaboration with technical leaders Proactively engages and/or creates multi-functional teams to tackle problems or add business value Generates ideas and/or technical strategies and presents them to peers, leaders for feedback Influences others to support/implement ideas and/or technical strategies through collaboration with leaders and peers in the organization Creating standards and policies and influencing technology decisions beyond own functional area or project; Practice DevOps supporting application from development through the operation lifecycle Responsible for resolving and setting SLO’s, creating adequate monitoring and logging for features so that SLO can successfully be measured Provide on-call support

Who you will work with

Within the Network Engineering Operation sector of CISCO IT, our team handles the management and upkeep of CISCO-developed network management and orchestration products, such as Catalyst Center, NSO, BPA, Matrix, EPNM, and CISCO Spaces, which are integral to our Foundation Services.

As a member of our team, you'll collaborate with enthusiastic Site Reliability Engineers dedicated to developing cloud-based applications and advancing our utilization of multi-cloud and on-premises platforms within the company. Your role will involve creating new microservices and infrastructure enhancements to optimize our business workflows. We operate within a dynamic, agile setting, leveraging CISCO's suite of products to enhance our operational processes.

Who You Are

We are on the lookout for a skilled and accomplished Site Reliability Engineering Technical Leader who possesses exceptional leadership capabilities and a fervent desire to spearhead the automation of enterprise network infrastructure and the implementation of DevOps practices. Your professional history should demonstrate expertise in architecting, programming, and overseeing operations code for cloud infrastructures utilizing open-source technologies. Additionally, you have a background in guiding the development of software applications across both private and public cloud computing environments.

Required Technical Skills and Experience

8+ years of solid hands-on software development leading experience with a focus on continuous delivery and deployment and cloud automation Software programming experience in one or more programming languages (Python preferred) IaC experience – Terraform, Ansible, Github, Github Actions, Jenkins, Helm, ArgoCD, Conjur/Vault Public & Private Cloud experience (AWS/GCP/OpenStack) Software design patterns, SDLC, OpenSource Development, Test Driven Development (TDD), Continuous Integration and Continuous Delivery Very Good understanding of Linux/RHEL systems and hands-on Kubernetes(k8S) working knowledge Monitoring & Logging systems – Prometheus/ELK Security protocols including OS hardening, firewalls, iptables, and working with Infosec. Experience building cloud-based application using micro-services and deploying in containerized environments Excellent knowledge of building cloud-native and server-side RESTful applications, APIs and automation tools Domain knowledge about contemporary network technologies, network management and protocols CCNA/CCNP/CISSP is a plus Bachelor’s degree in CS/CE/EE or equivalent is required

Non-Technical Requirements:

Leadership in building and maintaining SRE technologies. Mentor/Coach team Experience working in an agile development environment. Work with geographically distributed teams. Understand IT processes, including Design, implementation, and Operations. Strong analytical and problem-solving skills Effective communication and collaboration skills with ability to engage and influence Self-motivated, able, and willing to help where help is needed. Able to build and establish relationships, be culturally sensitive, have goal alignment and learning agility. Ambitious to work with geographically distributed teams

  • Bengaluru, India First American (India) Full time

    The Role: A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission. As a Site Reliability Engineering Manager...


  • Bengaluru, India Ensono Full time

    About Role Ensono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • Bengaluru, India First American (India) Full time

    The Role:A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission.As a Site Reliability Engineering Manager working...


  • Bengaluru, India Ensono Full time

    About RoleEnsono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • Bengaluru, India Vistex Full time

    Vistex is currently hiring a Site Reliability Engineer. The Vistex Site Reliability Engineer will be primarily responsible for service availability, performance, monitoring, incident response, and capacity planning. This is a highly technical, hands-on role with a strong focus on automation, accurate monitoring, actionable alerting, resilient design,...


  • Bengaluru, Karnataka, India Vistex Full time

    Vistex is currently hiring a Site Reliability Engineer. The Vistex Site Reliability Engineer will be primarily responsible for service availability, performance, monitoring, incident response, and capacity planning. This is a highly technical, hands-on role with a strong focus on automation, accurate monitoring, actionable alerting, resilient design,...


  • Bengaluru, Karnataka, India Ensono Full time

    About RoleEnsono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • Bengaluru, India First American (India) Full time

    The Role:A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission.As a Site Reliability Engineering Manager working...


  • Bengaluru, India First American (India) Full time

    The Role:A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission.As a Site Reliability Engineering Manager working...


  • Bengaluru, Karnataka, India First American (India) Full time

    The Role: A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission. As a Site Reliability Engineering Manager...


  • Bengaluru, Karnataka, India First American (India) Full time

    The Role:A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission.As a Site Reliability Engineering Manager working...


  • Bengaluru, India First American (India) Full time

    The Role:A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission.As a Site Reliability Engineering Manager working...


  • Bengaluru, India First American (India) Full time

    The Role: A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission. As a Site Reliability Engineering Manager...


  • Bengaluru, Karnataka, India First American (India) Full time

    The Role:A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission.As a Site Reliability Engineering Manager working...


  • Bengaluru, Karnataka, India Ensono Full time

    About Role Ensono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • Bengaluru, India Staffopedia Consulting LLP Full time

    Looking for a Head of Engineering & Site Leader from a SAAS Background .15+ years of experience in engineering leadership roles, with a proven track record of driving successful AI system integration efforts, with a focus on delivering high-performance, scalable, and reliable solutions that meet customer needs.Deep technical expertise in AI infrastructure,...


  • Bengaluru, India Staffopedia Consulting LLP Full time

    Looking for a Head of Engineering & Site Leader from a SAAS Background .15+ years of experience in engineering leadership roles, with a proven track record of driving successful AI system integration efforts, with a focus on delivering high-performance, scalable, and reliable solutions that meet customer needs.Deep technical expertise in AI infrastructure,...


  • Bengaluru, India Staffopedia Consulting LLP Full time

    Looking for a Head of Engineering & Site Leader from a SAAS Background .15+ years of experience in engineering leadership roles, with a proven track record of driving successful AI system integration efforts, with a focus on delivering high-performance, scalable, and reliable solutions that meet customer needs.Deep technical expertise in AI infrastructure,...


  • Bengaluru, India Staffopedia Consulting LLP Full time

    Looking for a Head of Engineering & Site Leader from a SAAS Background . 15+ years of experience in engineering leadership roles, with a proven track record of driving successful AI system integration efforts, with a focus on delivering high-performance, scalable, and reliable solutions that meet customer needs. Deep technical expertise in AI...


  • Bengaluru, India Staffopedia Consulting LLP Full time

    Looking for a Head of Engineering & Site Leader from a SAAS Background .15+ years of experience in engineering leadership roles, with a proven track record of driving successful AI system integration efforts, with a focus on delivering high-performance, scalable, and reliable solutions that meet customer needs.Deep technical expertise in AI infrastructure,...