Manager - SRE

3 months ago


Hyderabad, India PepsiCo Full time
Overview

Event Management: Set up and manage monitoring tools to track MS Power BI and downstream Application performance andhealth. Monitor, maintain, and optimize Power BI dashboards to ensure they are functioning correctly and efficiently. Generate reports and provide insights performance, incidents, and improvements. Collaborate with cross-functional teams to implement preventive measures and address emerging concernspromptly.Incident Management: Respond to and manage incidents related to Power BI and associated downstream systems, investigate andtrack incidents to resolution in a timely manner and within predefined SLAs. Perform Root Cause Analysis (RCA) to the underlying causes of issues. Implement long-term solutions toprevent recurrence of incidents. Support and maintain Azure Data Factory pipelines, ensuring data ingestion and transformation processes runsmoothly. Monitor and troubleshoot Databricks environments, optimizing performance and resolving any issues thatarise. Manage and maintain UiPath automation workflows, ensuring they operate reliably and efficiently. Execute and document post-incident summaries, root cause analysis and mitigation protocols to lessen thelikelihood of repeat incidents.Collaboration and Communication: Execute the communication of incidents to relevant stakeholders, relaying information on business impact,risks, prioritization, mitigation, and estimated time to resolution. Participate in on-call rotations to provide timely responses to production incidents and contribute to swiftissue & Release Management: Execute Service Introduction & Service Acceptance process, to validate and test the Business Application (MSPower BI & digital products) prior to production deployment / redeployment. Deployment of products and enhancements with minimal disruption to production systems.Knowledge Management: Documentation of processes, procedures, standards, and SLAs of S&T BI & Reporting Services in ServiceKnowledge Management System (SKMS)Continual Service Improvement: Continually seek opportunities for improvement, automate repetitive tasks and reduce manual intervention.

Responsibilities

Qualification:Education: Degree in Computer Science, Computer Engineering, or related field preferredExperience: 6-9 years of experience; with minimum 3+ years of experience in Site Reliability (SRE roles) / IT ApplicationSupport role Candidates must have strong background in supporting and managing MS Power BI Application and hands onexperience with least one of the specified technologies (Azure Data Factory, Databricks, Uipath). Candidate must have proven experience as a Site Reliability Engineer, DevOps Engineer, or similar role. Experience in Developing and implementing automation scripts and tools to improve Application reliabilityand operational efficiency. Candidate must be willingness to be an integral part of the Production Support team, to work in UK, US shifthours and weekend shift on rotation. Candidate must demonstrate a willingness to learn and adapt to new technologies as needed.Preferred Qualifications: Certifications in Azure, Power BI, or related technologies. Experience with CI/CD pipelines and infrastructure as code (IaC) tools. Familiarity with ITIL practices and principles.Technical Skills: Candidate must have experience with monitoring and logging tools such as Azure Monitor, Prometheus,Grafana, or similar. Strong understanding of cloud platforms, particularly Microsoft Azure. Proficiency in scripting languages such as Python, PowerShell, or Skills: Ability to work is a fast paced, agile environment with large cross-functional teams. Ability to manage multiple priorities at the same time. Strong problem-solving skills and the ability to work under pressure. Excellent interpersonal and communication skills, both written and verbal Attention to detail and a proactive approach to identifying and resolving issues.

Qualifications

.


  • Sre Implementation

    3 months ago


    Hyderabad, India Alignity Solutions Full time

    Do you love a career where you Experience, Grow & Contribute at the same time, while earning at least 10% above the market? If so, we are excited to have bumped onto you. Learn how we are redefining the meaning of work, and be a part of the team raved by Clients, Job-seekers and Employees. Jobseeker Video Testimonials Employee Glassdoor Reviews If you...

  • SRE Engineer

    2 months ago


    Hyderabad, India Virtusa Full time

    SRE Engineer - CREQ196069 Description Requirements: Bachelors degree, preferably in Computer Science, or relevant work experience. 3+ years of experience as an SRE. Wants to automate everything. Familiarity in ITIL or other Information Technology operations foundations. Interest and ability in mentoring staff - improving technical capabilities. Knowledge...

  • Sre/devops

    4 months ago


    Hyderabad, India Luxoft Full time

    **Project** Description**: Group Technology and Operations (T&O) enables and empowers the bank with an efficient, nimble and resilient infrastructure through a strategic focus on productivity, quality & control, technology, people capability and innovation. In Group T&O, we manage the majority of the Bank's operational processes and inspire to delight our...


  • Hyderabad, India ADP Pvt Ltd - India Full time

    As a Senior SRE, you will be part of our global team and is expected to actively work in SRE team to design, implement CICD pipeline and Cloud & Database Initiatives, working side-by-side with our Architecture & Application Development teams. This position is responsible for developing innovative, flexible and scalable solutions, driving the automation and...

  • SRE Devops aws

    2 months ago


    Hyderabad, India Virtusa Full time

    SRE Devops aws - CREQ194972 Description System Reliability Design, build, and maintain reliable and scalable infrastructure solutions to support our applications and services. Automation: Develop automation tools and processes to improve efficiency, streamline operations, and reduce manual intervention. Monitoring and Alerting: Implement robust monitoring...


  • Hyderabad, India Bristol Myers Squibb Full time

    Description Working With Us Challenging. Meaningful. Life-changing. Those aren’t words that are usually associated with a job. But working at Bristol Myers Squibb is anything but usual. Here, uniquely interesting work happens every day, in every department. From optimizing a production line to the latest breakthroughs in cell therapy, this is...

  • Cloud SRE

    2 months ago


    Hyderabad, India Experian Full time

    Job DescriptionRole SummaryThe Cloud SRE would play a key role in project delivery with hands on experience in managing enterprise cloud platform. The role will work closely to design and build automated operational processes focusing on scalable cloud deployments with paradigm of infrastructure as a code.Knowledge, Skills and ExperienceA strong background...


  • Hyderabad, India Thomson Reuters Full time

    We are looking for a Senior SRE Engineer to join our Product Reliability and Operations team. We provide innovative evidence management software to some of the world's leading law firms, governments and corporations. We are leaders in legal digital transformation around the world.  As a Senior SRE Engineer you will be part of the core Platform...

  • Cloud & SRE Engineer

    3 months ago


    Hyderabad, India Experian Full time

    Job Description Cloud Site Reliability Engineers should have knowledge of automating Infrastructure creation, deployment, and management in private, public and hybrid cloud models. You will have a solid grasp of automating Infrastructure within AWS using Terraform and Ansible as well as proficiency in other scripting languages such Python and...


  • Hyderabad, India Persistent Systems Full time

    About Position:As a Site Reliability Manager, you will play a pivotal role in ensuring the scalability, performance, and reliability of systems. Responsible for ensuring the scalability, performance, and reliability of our software systems. You will work closely product development team to design, build, and maintain the infrastructure and tools needed to...


  • Hyderabad, India Genpact Full time

    Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose – the relentless pursuit of a world that works better for people –...


  • Hyderabad, India Talentiser Full time

    The SRE team with our client works across Engineering disciplines and Product Management to continually improve and support our product and services. The candidate will be responsible for defining the functional responsibilities of the SRE Platform Operations team, the team's operating model, standards, practices and help to instill observability-first...


  • hyderabad, India LivePerson, Inc Full time

    Overview: LivePerson is looking for a Devops engineer for the GPT (Global Product & Technology) Division. You will be part of the LiverPerson SRE team building and managing highly available, distributed systems. You will have the opportunity to be part of a strong team and enjoy the work environment of a start-up, with a robust product and the...


  • Hyderabad, India LivePerson, Inc Full time

    Overview: LivePerson is looking for a Devops engineer for the GPT (Global Product & Technology) Division. You will be part of the LiverPerson SRE team building and managing highly available, distributed systems. You will have the opportunity to be part of a strong team and enjoy the work environment of a start-up, with a robust product and the benefits...


  • hyderabad, India LivePerson, Inc Full time

    Overview: LivePerson is looking for a Devops engineer for the GPT (Global Product & Technology) Division. You will be part of the LiverPerson SRE team building and managing highly available, distributed systems. You will have the opportunity to be part of a strong team and enjoy the work environment of a start-up, with a robust product and the benefits...

  • Software Engineer III

    2 months ago


    Hyderabad, India NCR Corporation Full time

    About NCR VOYIX NCR VOYIX Corporation (NYSE: VYX) is a leading global provider of digital commerce solutions for the retail, restaurant and banking industries. NCR VOYIX is headquartered in Atlanta, Georgia, with approximately 16,000 employees in 35 countries across the globe. For nearly 140 years, we have been the global leader in consumer transaction...


  • Hyderabad, India Persistent Systems Full time

    About Position: As a Site Reliability Manager, you will play a pivotal role in ensuring the scalability, performance, and reliability of systems. Responsible for ensuring the scalability, performance, and reliability of our software systems. You will work closely product development team to design, build, and maintain the infrastructure and tools needed...


  • hyderabad, India Persistent Systems Full time

    About Position: As a Site Reliability Manager, you will play a pivotal role in ensuring the scalability, performance, and reliability of systems. Responsible for ensuring the scalability, performance, and reliability of our software systems. You will work closely product development team to design, build, and maintain the infrastructure and tools needed to...


  • Hyderabad, India Persistent Systems Full time

    About Position: As a Site Reliability Manager, you will play a pivotal role in ensuring the scalability, performance, and reliability of systems. Responsible for ensuring the scalability, performance, and reliability of our software systems. You will work closely product development team to design, build, and maintain the infrastructure and tools needed to...


  • Hyderabad, India Persistent Systems Full time

    About Position: As a Site Reliability Manager, you will play a pivotal role in ensuring the scalability, performance, and reliability of systems. Responsible for ensuring the scalability, performance, and reliability of our software systems. You will work closely product development team to design, build, and maintain the infrastructure and tools needed to...