Senior Reliability Engineer
2 weeks ago
Role Profile In this role you ll be joining our Corporate Engineering Reliability Engineering Enablement within First Level Support team as a Senior Reliability Engineer This team serves as a link between Corporate Engineering LSEG central engineering operations and compliance teams with the key focus of crafting the monitoring view for CT building the new L1 team driving the team with building all required documentation and continuous improvements First Level Support Engineer do this by 24 7 support Ensuring an acceptable level of IT operations performance and error risk in line with service level agreements Monitor all alerts and report any issue that may significantly impact the business Follow established set of processes while handling issues and for proper customer concern of unresolved issues to appropriate internal external teams Continuously review operating metrics and data to find opportunities to improve Monitors production servers scheduled jobs and incident both from monitoring set up end users analyse them respond with a solution or call out them to relevant teams for further investigation Ensuring accurate tooling standards are being followed as well as automation of operational tasks Detailed Responsibilities We engage and collaborate with global teams in time zones spanning UK US APAC areas Partner with Capability Business REE squads and Technology teams globally to understand and fix complex problems which improves product availability Manage and improve production Environment We provide 24 7 Production support eyes on glass monitoring and candidate has to cover 3 shifts on rotational basis Performs independently and in team environment Enabling monitoring for all the application batch jobs and servers in Production environment and excellent troubleshooting skills to take necessary action in fixing the issue Participate in transition of technology and services into production support Performs day to day operations tasks for management of production environment Maintain services once they are live by measuring and monitoring availability latency and overall system health Responsible for configuration maintenance and changes in Unix and Cloud environment Strong presentation skills written and oral with analytical and negotiation skills The ability to perform a multi-task job and adjust priorities as appropriate Possess excellent time management skills Able to take initiatives that meet deadlines and ability to prioritize workload with minimal direction In-depth knowledge of computer hardware systems including circuit boards memory modules and processors Ability to effectively communicate to various support teams and Leadership teams Coordination implementation of infrastructure changes upgrades patches maintenance etc Knowledge of ITIL production support standard processes Good Knowledge in Unix Linux OS and shell scripting Control M monitoring jobs monitoring Problem solving critical thinking Ability to multitask following priorities and adhering to SLA Troubleshooting Application and Batch job issues Ability to handle the Major Bridge calls of customer impacting incidents Provide a high level of customer service whilst working under pressure Setting up alarm view working with capabilities monitor them as per agreement and report any issue that may significantly impact the business We follow and adhere to established Incident Change SR and Problem Management procedures Excellent track record of continuous service improvements contributing to the goals and objectives rolling up to organization To ensure knowledge Articles of all supported systems is continually updated Tech Profile Essential Skills 7 year administrative and hands on experience in Windows Linux Servers Cloud services AWS Azure GCP Proficient in Power Apps Power Automate SharePoint Power Platform Power BI Understanding of Microsoft Dataverse data modelling management Role-based access in Dataverse security roles and solutions concept in Power Platform Preferred Skills and Experience Leading incidents change requests service requests and driving Technical Recovery calls Strong solving skills on these platforms Minimum knowledge and understanding of financial markets are desirable Ability to work independently and in a team environment Effective communication in English with all levels of staff both orally and written Prioritise our work multitask to meet tight deadlines without losing focus on overall goals under minimum supervision Highly motivated self-directed individual with a positive pro-active attitude to work Customer and service focused with determination to meet their needs and expectations Be driven and committed to the goals and objectives of the team and organization Ability to provide Production support as per shift rota and Point of escalation in event of a major incident Education and Professional Skills Professional qualification on BS MS degree in Computer Science Software Engineering or STEM degree Desirable Proven English reading writing capability alongside good communication articulation skills Curious about new technologies and tools creative thinking and initiative taking Deep technical understanding of development and systems platform engineering Good interpersonal skills Customer centricity Strong collaborator and partner relationship Strategic ability to define technical direction in the portfolio of products including technical strategies and plans and execution plans LSEG is a leading global financial markets infrastructure and data provider Our purpose is driving financial stability empowering economies and enabling customers to create sustainable growth Our purpose is the foundation on which our culture is built Our values of Integrity Partnership Excellence and Change underpin our purpose and set the standard for everything we do every day They go to the heart of who we are and guide our decision making and everyday actions Working with us means that you will be part of a dynamic organisation of 25 000 people across 65 countries However we will value your individuality and enable you to bring your true self to work so you can help enrich our diverse workforce You will be part of a collaborative and creative culture where we encourage new ideas and are committed to sustainability across our global business You will experience the critical role we have in helping to re-engineer the financial ecosystem to support and drive sustainable economic growth Together we are aiming to achieve this growth by accelerating the just transition to net zero enabling growth of the green economy and creating inclusive economic opportunity LSEG offers a range of tailored benefits and support including healthcare retirement planning paid volunteering days and wellbeing initiatives We are proud to be an equal opportunities employer This means that we do not discriminate on the basis of anyones race religion colour national origin gender sexual orientation gender identity gender expression age marital status veteran status pregnancy or disability or any other basis protected under applicable law Conforming with applicable law we can reasonably accommodate applicants and employees religious practices and beliefs as well as mental health or physical disability needs Please take a moment to read this carefully as it describes what personal information London Stock Exchange Group LSEG we may hold about you what its used for and how its obtained If you are submitting as a Recruitment Agency Partner it is essential and your responsibility to ensure that candidates applying to LSEG are aware of this privacy notice
-
Senior Site Reliability Engineer
3 weeks ago
India BQE Software Full timeWe are seeking a Senior Site Reliability Engineer to lead reliability efforts across our application stack, focusing on high availability, performance, and scalability.This role will own the health and uptime of our mission-critical application , Cloud infrastructure , database system , and monitoring infrastructure . About Us At BQE, our mission...
-
Senior Site Reliability Engineer- ELK Expert
4 weeks ago
India iVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...
-
Senior Site Reliability Engineer
10 hours ago
India Cimpress Full timeSenior Site Reliability EngineerWho We Are:Cimpress Technology develops cutting-edge, best-in-world software that our mass customization businesses use to create personalized products for over 17 million global customers. Our Mass Customization Platform consists of modular, multi-tenant services. Our businesses can choose the solutions that work for them, or...
-
Site Reliability Engineering Manager
4 weeks ago
India CloudHire Full timeJob SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...
-
Senior Site Reliability Engineer
3 days ago
India beBeeCloud Full time £ 1,00,870 - £ 1,41,290**Reliable Engineering Expert Wanted**Key Responsibilities:Collaborate with operations and development teams to automate repetitive tasks.Ensure end-to-end quality for application migrations to cloud, from high-quality documentation to observability flows, to operational acceptance testing and beyond.Craft reusable dashboards and alerting pipelines from raw...
-
Reliability Engineer
2 weeks ago
India London Stock Exchange Group Full timeRole Profile In this role you ll be joining our Corporate Engineering Reliability Engineering Enablement within First Level Support team as a Reliability Engineer This team serves as a link between Corporate Engineering LSEG central engineering operations and compliance teams with the key focus of crafting the monitoring view for CT building the new...
-
Site Reliability Engineer
10 hours ago
India Xebia Full timeWe are looking for a highly skilled AWS Engineer with strong Python development and Chaos Engineering expertise to design, build, and validate resilient, scalable, and automated cloud-native environments. The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault tolerance, and operational efficiency...
-
Senior Site Reliability Engineer
2 weeks ago
India MindBrain Full timePosition SITE Reliability Engineer Budget- 1.7 LPMExp- 10 yrsDuration- 6 monthsTechnical Skills:Programming: Proficiency in languages like Python.Operating Systems: Deep understanding of Linux/Windows operating systems and networking concepts. Cloud Technologies: Experience with Azure including services, architecture, and best practices. Containerization and...
-
Senior Cloud Reliability Engineer
3 hours ago
India beBeeCloudReliability Full time ₹ 1,80,00,000 - ₹ 2,20,00,000Key Responsibilities:Design and implement scalable, secure, and cost-efficient cloud infrastructure.Automate infrastructure provisioning using Terraform / CloudFormation and AWS SDKs.Manage containerized workloads (Docker, Kubernetes) and automate deployment utilities.Develop automation scripts and tooling using Python (Boto3).Implement monitoring/alerting...
-
Site Reliability Engineer
14 hours ago
Remote, India Rackspace Technology Full timeJob DescriptionSite Reliability Engineer / Observability EngineerPublic Cloud - Offerings and Delivery - Workforce Mgmt & Delivery Ops /Full - Time / RemoteRackspace is building up its Professional Services Center of Excellence on Application Performance Monitoring Suites.If you enjoy solving complex business problems and can contribute to building next...