
Observability Support Engineer
1 week ago
Position Purpose
The main responsibility of Stability & Resilience division is to support the IT strategy & Production and gathers activities contributing directly to the stability and integrity of the Production and to the Information Systems resilience.
Within the division, the domain Global Monitoring & Log Analytics oversees Global Production Observability Systems and provides platforms and services around Elasticsearch, Splunk and Dynatrace technologies.
This domain includes the following key services:
Global Monitoring, providing Dynatrace services
Splunk (decommissioning by and of the year)
Logs As a Service, providing log management platform as a service based on Elastic stack (Elasticsearch, Kibana, Fleet, Elastic Agent, Logstash, Ingest pipelines) and Kafka technology.
Elastic As a Service, providing
o Elasticsearch (+Kibana) dedicated specific clusters for some applications on its servers
o Elasticsearch dedicated standard clusters on dMZR (based on an IBM Cloud product)
CyberSOC central data platform (Databus based on Kafka+Logstash, and DAP based on Elasticsearch)
Leveraging BNP Paribas Paris teams expertise and ISPL IT skills, the goal is to enable applications flawless production by providing secure and stable environments and by ensuring that all actions on production environments are done in a controlled manner.
The Observability Support Engineer will be integrated closely in the STA04 domains SRE & Data Engineering team members which are in charge of:
Keeping a monitoring/alerting system to correctly manage infrastructure of our internal services (Log as a Service, Dedicated Elasticsearch cluster as a service, Global Monitoring)
Manage data preparation on observability metrics to take maximum benefit from it
Create and make evolutions of specific alerts and dashboards on our components and services, with high level and top/down approach, to provide best quality of service
Define house keeping procedures and surveillance, including morning and evening checks
Implement SRE approch (SLI/SLO for quality/perf improvment and reduction of incidents rate and impacts)
Responsibilities
Direct Responsibilities
Take care of our infrastructure of our internal services (Log as a Service, Dedicated Elasticsearch cluster as a service, Global monitoring) and ensure that performance and features are ok for our customers
Provide support for our customers (incident management, help on usages)
Make evolutions and enrich our end-user and internal documentation
Identify bugs or needed evolutions on our services for our customers to have an easier and richer solution, and for our team to reduce manual and/or recurrent actions
For a predefined applications scope take care of ITSM processes based on ITIL framework:
o Incidents
o Requests
o Changes
Ensure that SLA targets are met for above activities
Handover to Paris teams if knowledge and skills are not available in ISPL
General Responsibilities
Contribute to the knowledge transfer with Paris teams
Contribute to the definition of procedures and processes necessary for the team
Help build team spirit and integrate into BNP Paribas culture
Contribute to the regular activity reporting and KPI calculation
Contribute to continuous improvement actions
Work with cross-functional teams to ensure IT services align with business needs and service level agreements (SLAs).
Technical & Behavioral Competencies
Mandatory Skills
Expertise on usage and administration of observability systems (Elasticsearch, Kibana, Grafana, Logstash, Kafka, Dynatrace, or others)
Decent knowledge of modern observability practices (SRE, Log Management, SLI/SLO, Synthetic, APM, RUM )
Good knowledge on containerization technologies (Docker, Kubernetes, Nomad, OpenShift)
Medium level knowledge in script development (Python, Shell, PowerShell, )
Medium level knowledge in Ansible and Ansible Tower
Common knowledge of CI/CD tools like gitlab, gitlab runner, jenkins, .
Understanding of ITIL or similar ITSM frameworks & tools
Experience with Service Now ticketing system
Experience in Agile framework and tools (e.g., Jira, Confluence, etc)
Basic about microsegmentation (Illumio) and secured environments and safes (Vault)
Good written and spoken English
Ability to measure and identify areas for improving Quality and overall Delivery
Capable of communicating efficiently
Good to have Skills
Knowledge of IT production backup and resilience setup (High Availability setup, Disaster Recovery Plan, etc.)
Basic knowledge of RedHat Linux administration and performance management
Experience with any cloud platform (preferably IBM Cloud).
Ability to make contact with Paris team in case of difficulties, lack of information or any other problem where getting more information could help on solving issues or risk limitation
Good Team Player
-
High-Performance Observability Engineer
4 days ago
Mumbai, Maharashtra, India beBeeObservability Full time ₹ 20,40,000 - ₹ 24,96,000Job DescriptionWe are looking for a skilled Engineer to join our team in the field of Observability. As an Engineer Site Reliability, you will be responsible for building and maintaining our platform components for Observability.You will work closely with our Lead Engineer, performance team, data ingestion, platform DevOps and data visualization teams under...
-
Senior PostgreSQL Support Engineer
2 weeks ago
Mumbai, Maharashtra, India CYBERTEC PostgreSQL Services and Support Full timeSenior PostgreSQL Support Engineer (Full time, Austria or Remote)CYBERTEC PostgreSQL is seeking a Senior PostgreSQL Support Engineer (Level 3/4 Escalation) to join our team. If you live and breathe PostgreSQL, thrive on solving the toughest performance puzzles, and enjoy mentoring the next generation of database pros, this role is for you. About CYBERTEC...
-
Observability Delivery Manager
1 week ago
Mumbai, Maharashtra, India beBeeDelivery Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job OverviewThis is a seasoned, high-impact leadership role that will lead customer implementations of our mission-critical Business Journey Observability Platform. The successful candidate will have a strong focus on large BFSI clients and will be responsible for managing complex integrations, high-SLA go-lives, and cross-functional teams across...
-
Cloud Infrastructure Architect
7 days ago
Mumbai, Maharashtra, India beBeeElk Full time US$ 1,80,000 - US$ 2,00,000Large Scale Observability Engineer OpportunityWe are seeking a skilled Large Scale Observability Engineer to join our team. As a key member of our cloud infrastructure team, you will be responsible for designing, managing, and scaling large-scale observability infrastructure using ELK clusters.This is an exciting opportunity for a seasoned engineer to take...
-
Mumbai, Maharashtra, India Tata Consultancy Services Full timeTCS Hiring for Observability Tools Tech Lead_PAN IndiaExperience: 8 to 12 Years OnlyJob Location: PAN IndiaTCS Hiring for Observability Tools Tech Lead_PAN IndiaRequired Technical Skill Set:Core Responsibilities:Designing and Implementing Observability Solutions:This involves selecting, configuring, and deploying tools and platforms for collecting,...
-
It Support Engineer
6 days ago
Mumbai, Maharashtra, India Sirius Global Full time ₹ 1,04,000 - ₹ 1,30,878 per yearRole & responsibilitiesPerform Root Cause Analysis (RCA).Incident reporting of technical issues.Problem Management and documentation.Coordinate/ Escalate unresolved issues with OEM/ Vendors/Technical team for in-depth analysis and problem resolution.Technical training to Support Technician on implemented solutions/ new observations.Management of technical...
-
Application Support
2 weeks ago
Mumbai, Maharashtra, India Menschen Consulting Pvt. Ltd. Full timeJob DescriptionApplication Support (Production) Engineer ManagerMumbai | Bachelors/Masters in Computer Science | 5+ years in production supportSeeking a skilled professional to manage Level 1 (L1) production support for a high-availability trading platform. This role demands strong technical expertise, incident management skills, and the ability to thrive in...
-
Application Support
1 week ago
Mumbai, Maharashtra, India Menschen Consulting Pvt. Ltd. Full time ₹ 15,00,000 - ₹ 25,00,000 per yearApplication Support (Production) Engineer – Manager Mumbai | Bachelor's/Master's in Computer Science | 5+ years in production supportSeeking a skilled professional to manage Level 1 (L1) production support for a high-availability trading platform. This role demands strong technical expertise, incident management skills, and the ability to thrive in a...
-
Senior Application Support Engineer
2 weeks ago
Mumbai, Maharashtra, India 3Pillar Full time US$ 90,000 - US$ 1,20,000 per yearAt 3Pillar, we focus on providing exceptional support for cutting-edge technologies that revolutionize industries. As a Senior Application Support Engineer, you'll play a crucial role in our dynamic team, contributing to critical projects that redefine urban living, establish new media channels for enterprise companies, or drive innovation in healthcare....
-
Senior Application Support Engineer
1 week ago
Mumbai, Maharashtra, India 3Pillar Global Full time US$ 90,000 - US$ 1,20,000 per yearAt 3Pillar, we focus on providing exceptional support for cutting-edge technologies that revolutionize industries. As a Senior Application Support Engineer, you'll play a crucial role in our dynamic team, contributing to critical projects that redefine urban living, establish new media channels for enterprise companies, or drive innovation in healthcare....