Principal Site Reliability Engineer

2 weeks ago


India Oracle Full time

We are looking for dynamic and forward-looking engineers to join our database cloud engineering team. Candidate must have Oracle Database Administration experience as a Site Reliability Engineer or DBA on large production environments. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of large-scale implementation of Oracle database. Engineers with good Oracle Application DBA experience will be added advantage to the role with experience in Oracle E-biz suite, Oracle Fusion Apps or Netsuite

Responsibility includes development and implementation of critical test cases with focus on security, resiliency, scale, and performance. Partner with development teams and work towards addressing and fixing production issues on cloud, defining and implementing product enhancements. Collaborate with various cloud operations teams to understand the production issues and work towards to create a reproducible test cases in the lab environment to present to the development teams.

Detailed Responsibilities

Certification of Database products for cloud integration

Design & Develop Highly Automated Multi-Tier/Multi-Stack Stress Test Suites/Workloads for Testing and Certifications Conduct extensive testing of Oracle database HA, data protection and disaster recovery. Implement and validate all Maximum Availability Architecture and how these helps to prevent, detect, tolerate and repair from various outages. Work on large engineered cloud environments on OCI and EXaCS Independently setup and configure large database environments with RAC, data-guard, and Oracle Goldengate. Independently install, configure, patch and upgrade large database clusters on Linux Develop test cases and configurations to simulate application/business critical workloads and usage scenarios. Develop and Maintain Test Specs/Plans/Methodologies and then Design, and Implement End-to-End Test Suites/Frameworks simulating Real-world production systems. Extensive upgrade and Patching testing while simulating production like load scenarios.

Research and Development

Reviewing production issues and identifying gaps in the test suite and implementing these as test cases. This may include DB Schema Design/Normalisation, Data generation, Load generation and Application/Business Logic programming in Oracle SQL, PL-SQL, Perl/Shell/Python and JDBC/JMS or Python.

Carry-out independent research and review new functionality/features in nextgen Oracle Database releases and other products. Performance Tuning and pro-active measurements of future planning. Backup & Recovery Strategy. Log and track product defects (bugs), Collaborating closely with Development teams to resolve problems encountered in these Multi-tier test simulations. Develop Automation tools, Simulation Apps and Re-usable Framework for efficient System/Stress Testing. Participate in Product Feature Review, Certification experiments and User Document reviews. Research and acquire skills on new technologies as needed from time to time

Technical qualifications

Minimum Requirements for this job role

10-14 years of Oracle database administration experience on large production environments Database hands on skills especially around database, system and administration GoldenGate setup, administration and tuning RAC setup and administration Strong Linux/UNIX OS understanding including OS Architecture & Internals (Networking, File Systems, Process/Memory Monitoring/Tuning/Linux Virtualisation etc). Programming/Scripting skills in one or more of below languages is required. Scripting - Perl / Shell / Python, Microservices/REST APIs Programming - Oracle SQL, PL/SQL, Java/JDBC, Python , ./MS in CS/ECE/EE, MCA from Reputed Engineering Colleges preferred.

Preferred Requirements for this job role

Architect and administrating large Oracle database systems Data Guard administration and tuning Experience on any ERP applications like Oracle E-biz suite, Oracle Fusion Apps or Netsuite Experience in developing test-cases, automation and orchestration

Career Level - IC4

Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.



  • india IKAI Technology Solutions Full time

    Company Description IKAI Technology Solutions is a leading provider of IT services, supporting businesses across various industries to harness the full potential of information technology. With extensive experience in managing the intricate systems and operations of global enterprises, IKAI is committed to revolutionizing the way businesses navigate the...


  • India IKAI Technology Solutions Full time

    Company Description IKAI Technology Solutions is a leading provider of IT services, supporting businesses across various industries to harness the full potential of information technology. With extensive experience in managing the intricate systems and operations of global enterprises, IKAI is committed to revolutionizing the way businesses navigate the...


  • India IKAI Technology Solutions Full time

    Company Description IKAI Technology Solutions is a leading provider of IT services, supporting businesses across various industries to harness the full potential of information technology. With extensive experience in managing the intricate systems and operations of global enterprises, IKAI is committed to revolutionizing the way businesses navigate the...


  • India Unilog Full time

    Job Title : Site Reliability EngineerJob Summary :As a Site Reliability Engineer (SRE) specializing in Google Cloud Platform (GCP), you will be responsible for designing, implementing, and maintaining highly scalable and reliable systems. You will collaborate with development teams to ensure that applications are designed with reliability and performance in...


  • India Serendipity Recruiting Full time

    Job Description As a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government.Our client firmly believes that exceptional technology services are built upon exceptional individuals. For over two decades,...


  • India Circles Life Full time

    Job Description Role: Site Reliability Engineer (SRE) Title: Software Engineer II, SRE Location: Bangalore About Circles Founded in 2014, Circles is a global technology company reimagining the telco industry with its SaaS platform - Circles X, helping telco operators launch and operate successful digital brands through its offerings. ...


  • India Exoscale Full time

    Job Description Exoscale is the leading Swiss/European cloud service provider.With services covering the full cloud infrastructure spectrum - from fast deploying virtual machines to S3 compatible object storage - Exoscale provides a simple and scalable experience in order to let its clients focus on their core business.Join a dynamic working environment with...


  • India Exoscale Full time

    Job Description Exoscale is the leading Swiss/European cloud service provider.With services covering the full cloud infrastructure spectrum - from fast deploying virtual machines to S3 compatible object storage - Exoscale provides a simple and scalable experience in order to let its clients focus on their core business.As part of its ongoing efforts to grow...


  • india iScale Solutions Full time

    Job Description This is a remote position. Key Responsibilities: Design, implement, and maintain highly available and scalable infrastructure on AWS cloud platform. Develop and manage Infrastructure as Code (IaC) using Terraform for provisioning and managing cloud resources. Implement containerization strategies using Docker for packaging and deploying...


  • India Agensi Pekerjaan BTC Sdn Bhd Full time

    Job Description Open Position: Site Reliability Engineer (MNC Tech Company) A well-known MNC Tech Company is hiring Site Reliability Engineer to join them in the Kuala Lumpur office.Key responsibilities include: Develop and provide operational support for full-stack software applicationsCollaborate with development operations staff to create, monitor, and...


  • India System Soft Technologies Full time

    Title: Site Reliability Engineer 100% REMOTE The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • India Aventurine Technologies Inc Full time

    Job Description SRE (Site Reliability Engineer) Dallas, TX – Hybrid (F2F interview will be requested) 6+ Mon Contract Note: Look for candidates with over 9+ Years' experience.Job Description (SRE) • Collaborating closely with engineering teams on building and enhancing tooling and automation solutions for faster resolution of issues impacting SLO's and...


  • india Aventurine Technologies Inc Full time

    Job Description SRE (Site Reliability Engineer) Dallas, TX – Hybrid (F2F interview will be requested)   6+ Mon Contract  Note: Look for candidates with over 9+ Years’ experience. Job Description (SRE) • Collaborating closely with engineering teams on building and enhancing tooling and automation solutions for faster resolution of issues impacting...


  • India Mobile Programming LLC Full time

    Location : Pune NP : Immediate / Serving Notice Period Years of Experience : 12+ Role : Site Reliability Engineer Mandatory Skill : Java, GCP, AWS, CICD Job Description : Requirements : Minimum 12+ years experience as a Site Reliability engineer supporting different application and application infrastructure in a Hybrid-cloud platforms with mix of...


  • india Thoucentric Full time

    Job Description Job Description:We are seeking a skilled and dedicated Site Reliability Engineer (SRE) to join our team. The SRE will be responsible for ensuring the reliability, performance, and scalability of our systems and applications. This role combines software development and systems engineering to build and run large-scale, distributed,...


  • india Azilen Technologies Full time

    Job purpose: Design & implement the best engineered technical solutions using latest technologies and tools. Who you are: Bachelors degree in Computer Science, E&C Engineering, IT Engineering or related field. (2023-2024 passout) Any professional certification in area like Cloud Administration (AWS, Azure, GCP etc.), Site Reliability Engineering, Security...


  • india Encora Inc. Full time

    Description Sr. Software Engineer (Site Reliability Engineer) Important Information Location: Ahmedabad Experience: 5+ years Job Mode: Full-time Work Mode: Remote Job Summary Working with DevOps SRE with good experience in Site Reliability Engineer. Responsibilities and Duties Design, implement, and maintain highly...


  • India Encora Inc. Full time

    Description Sr. Software Engineer (Site Reliability Engineer) Important Information Location: Ahmedabad Experience: 5+ years Job Mode: Full-time Work Mode: Remote Job Summary Working with DevOps SRE with good experience in Site Reliability Engineer. Responsibilities and Duties Design, implement, and maintain highly available and...


  • India System Soft Technologies Full time

    Title: Site Reliability Engineer 100% REMOTE Applications written in .NET (python or any other scripting would be good) we need more of a dev background then operations. Automation experience: Ansible preferred but good with Terraform as well. Doesn’t need to come from a 24x7 environment but needs to be okay working in that environment. AWS preferred but...


  • India System Soft Technologies Full time

    Title: Site Reliability Engineer100% REMOTEApplications written in .NET (python or any other scripting would be good) we need more of a dev background then operations.Automation experience: Ansible preferred but good with Terraform as well.Doesn’t need to come from a 24x7 environment but needs to be okay working in that environment.AWS preferred but any...