Principal Site Reliability Developer

3 weeks ago


hyderabad, India Oracle Full time

SaaS Cloud CPQ is seeking a motivated Site Reliability Engineer that thrives in a fast-paced rapidly evolving technology environment. This individual will be a member of the CPQ System Administration team and focused on driving for those quality standards across all projects. The purpose of this position is to support build, operations, customer support, and DevOps within the organization. As part of the CPQ System Administration group, you will be instrumental in fostering a culture of SRE for horizontal activities and DevOps for products and tools across our global operations teams. The team you work in will have diverse expertise in systems, networking, and software development to provide the stability, performance and reliability our customers need. We work with multiple service development teams, identifying cross-team issues which create risk for operations across the organization and resolving those issues with a mixture of engineering, troubleshooting expertise, and general operational guidance. Your role also requires communication and organizational skills. You are an interface between DevOps Tools, application teams that implement OCI services. You will deliver the solutions that directly contribute to our customer's success. As a member of our global team, you will:

Deploy, operate and maintain large scale cloud build in a cloud native environment Improve our offerings through performance and reliability analysis Assist in building and maintaining Cl/CD pipeline Diagnose and resolve issues across cloud services such as database, network, compute, storage, and application services such as java, Weblogic, OHS (Apache), etc.. Participate in system design consulting, platform management, and capacity planning Anticipate the future and deliver those concepts to reality Participate in a global break-fix alert calls

Key qualifications of an ideal internal candidate:

Must Have:

Good Understanding of Cloud Infrastructure and Virtual Networking Experience working in closely held/confidential environments Experience in operating Cl/CD pipelines that build and deliver services on cloud A mind focused on systems reliability, automation, and improvement Experience with desktop support, VDI and troubleshoot issues with their workstations/laptops Motivation to collaborate with your local and global teams Experience with Linux 3-8 years' experience in Systems Engineering, DevOps or SRE roles supporting large scale infrastructure, cloud or web services

Nice to have:

Proficient with Git source code management (SCM) Oracle Database Administration experience OS image build for Linux, Windows and patch automation using Python, Terraform, Ansible, PowerShell Good understanding of Agile software development principles including using common tools such as JIRA Aptitude to be a good team player and the desire to learn and implement new Cloud technologies as needed Excellent organizational, verbal, and written communication skills Experience in compute, network, storage, database troubleshooting for improving capacity, reliability, scalability, availability Experience working with fault tolerant, highly available, high throughput, distributed, scalable systems A history of working with Cl/CD related systems (Kubernetes, Terraform, or similar)

Career Level -

Work with other teams in the SaaS organization to identify cross team issues and come up with solutions that improve the customer experience and raise the application performance and SLA. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of application integration to customer services. Responsible for the design and delivery of the automated processes, with a focus on security, resiliency, scale, and performance. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.



  • Hyderabad, India Microsoft Full time

    Overview Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers into...


  • hyderabad, India Microsoft Full time

    Overview Every minute of every day, customers stake their entire business and reputation on the Microsoft Cloud. The Azure Customer Experience (CXP) team believes that when we meet our high standards for quality and reliability, our customers win. If we falter, our customers fail their end-customers. Our vision is to turn Microsoft Cloud customers...


  • Hyderabad, India Oracle Full time

    Job Description:  Oracle Database Engineer - Join Oracle Cloud Infrastructure and play a vital role in our rapid expansion and adoption of ground breaking cloud technologies. We're actively developing scalable public cloud services, including Compute, Storage, Networking, Governance, Database, and Load Balancing Services. If you're a skilled database...


  • hyderabad, India Oracle Full time

    Job Description:  Oracle Database Engineer - Join Oracle Cloud Infrastructure and play a vital role in our rapid expansion and adoption of ground breaking cloud technologies. We're actively developing scalable public cloud services, including Compute, Storage, Networking, Governance, Database, and Load Balancing Services. If you're a skilled database...


  • Hyderabad, India Korn Ferry Full time

    Role - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM IST We are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...


  • Hyderabad, India Korn Ferry Full time

    Role - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM IST We are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...


  • hyderabad, India Insight Global Full time

    Required Skills and Experience *- Bachelor's or master's degree in computer science, Software Engineering, or a related field.- Proven experience (7+ years) in SRE, automation testing- Strong skills in developing and implementing automation testing strategies and frameworks.- Solid understanding of site reliability principles and best practices.- Leadership...


  • Hyderabad, India Genpact Full time

    Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose - the relentless pursuit of a world that works better for people - we...


  • hyderabad, India Genpact Full time

    Genpact (NYSE: G) is a global professional services and solutions firm delivering outcomes that shape the future. Our 125,000+ people across 30+ countries are driven by our innate curiosity, entrepreneurial agility, and desire to create lasting value for clients. Powered by our purpose - the relentless pursuit of a world that works better for people - we...


  • Hyderabad, India Insight Global Full time

    Required Skills and Experience *- Bachelor's or master's degree in computer science, Software Engineering, or a related field.- Proven experience (7+ years) in SRE, automation testing- Strong skills in developing and implementing automation testing strategies and frameworks.- Solid understanding of site reliability principles and best practices.- Leadership...


  • Hyderabad, India Insight Global Full time

    Required Skills and Experience * - Bachelor's or master's degree in computer science, Software Engineering, or a related field. - Proven experience (7+ years) in SRE, automation testing - Strong skills in developing and implementing automation testing strategies and frameworks. - Solid understanding of site reliability principles and best practices. -...


  • hyderabad, India Microsoft Full time

    Overview Microsoft Digital (MSD)’s  mission is to power, protect, and transform the employee experience at Microsoft across the globe. Come, build community, explore your passions, pursue your AI and ML aspirations, do your best work and be a part of the team within Microsoft’s Data Platform & Growth (DPG) organization and Experiences &...


  • Hyderabad, India Microsoft Full time

    Overview Microsoft Digital (MSD)’s  mission is to power, protect, and transform the employee experience at Microsoft across the globe. Come, build community, explore your passions, pursue your AI and ML aspirations, do your best work and be a part of the team within Microsoft’s Data Platform & Growth (DPG) organization and Experiences & Devices...


  • Hyderabad, India ValueLabs Full time

    Experienced in SRE or Site Reliability Engineer Design, implement, and maintain automated processes for deploying, monitoring, and managing applications on Azure DevOps. Collaborate with cross-functional teams to optimize system performance, reliability, and scalability. Develop and maintain tools for continuous integration, continuous deployment (CI/CD),...


  • hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability.Should have proficiency in industry standard monitoring...


  • Hyderabad, India Virtusa Full time

    Site Reliability engineer - CREQ188641 Description Position : SRE Primary skills: devops CI/CD pipeline Location: Hyderabad Should have proficiency in understanding of application monitoring stack(Logs, Events, Metrics and Alerts) and ability to visualize and setup end-to-end observability. Should have proficiency in industry standard monitoring tools...


  • Hyderabad, India FedEx ACC Full time

    Skill Required: Under general supervision, assists in the development and design of deliverables that support the resolution of moderately complex problems and technical design gaps. Supports improvement initiatives that are aligned with overarching global reliability of the company‘s systems, including capacity planning, failover strategies, performance...


  • hyderabad, India ValueLabs Full time

    Experienced in SRE or Site Reliability EngineerDesign, implement, and maintain automated processes for deploying, monitoring, and managing applications on Azure DevOps.Collaborate with cross-functional teams to optimize system performance, reliability, and scalability.Develop and maintain tools for continuous integration, continuous deployment (CI/CD), and...


  • Hyderabad, India SID Global Solutions Full time

    Job Title: Site Reliability EngineerLocation: Hyderabad - OnsiteWork Mode: 5 Days Working from OfficeJOB DESCRIPTION6-7 years of experience in 24x7 support of enterprise level applicationsGraduate in Computers, Engineering or similar educational qualificationFamiliarity with Kubernetes and container orchestration.Knowledge of Apigee development tools &...


  • Hyderabad, India ValueLabs Full time

    Experienced in SRE or Site Reliability Engineer Design, implement, and maintain automated processes for deploying, monitoring, and managing applications on Azure DevOps.Collaborate with cross-functional teams to optimize system performance, reliability, and scalability.Develop and maintain tools for continuous integration, continuous deployment (CI/CD), and...