Principal Site Reliability Engineer

1 week ago


Delhi, India Oracle Full time
We are looking for dynamic and forward-looking engineers to join our database cloud engineering team. Candidate must have Oracle Database Administration experience as a Site Reliability Engineer or DBA on large production environments. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of large-scale implementation of Oracle database. Engineers with good Oracle Application DBA experience will be added advantage to the role with experience in Oracle E-biz suite, Oracle Fusion Apps or Netsuite
Responsibility includes development and implementation of critical test cases with focus on security, resiliency, scale, and performance. Partner with development teams and work towards addressing and fixing production issues on cloud, defining and implementing product enhancements. Collaborate with various cloud operations teams to understand the production issues and work towards to create a reproducible test cases in the lab environment to present to the development teams.
Detailed Responsibilities
Certification of Database products for cloud integration
Design & Develop Highly Automated Multi-Tier/Multi-Stack Stress Test Suites/Workloads for Testing and Certifications
Conduct extensive testing of Oracle database HA, data protection and disaster recovery.
Implement and validate all Maximum Availability Architecture and how these helps to prevent, detect, tolerate and repair from various outages.
Work on large engineered cloud environments on OCI and EXaCS
Independently setup and configure large database environments with RAC, data-guard, and Oracle Goldengate.
Independently install, configure, patch and upgrade large database clusters on Linux
Develop test cases and configurations to simulate application/business critical workloads and usage scenarios.
Develop and Maintain Test Specs/Plans/Methodologies and then Design, and Implement End-to-End Test Suites/Frameworks simulating Real-world production systems.
Extensive upgrade and Patching testing while simulating production like load scenarios.
Research and Development
Reviewing production issues and identifying gaps in the test suite and implementing these as test cases. This may include DB Schema Design/Normalisation, Data generation, Load generation and Application/Business Logic programming in Oracle SQL, PL-SQL, Perl/Shell/Python and JDBC/JMS or Python.
Carry-out independent research and review new functionality/features in nextgen Oracle Database releases and other products.
Performance Tuning and pro-active measurements of future planning.
Backup & Recovery Strategy.
Log and track product defects (bugs), Collaborating closely with Development teams to resolve problems encountered in these Multi-tier test simulations.
Develop Automation tools, Simulation Apps and Re-usable Framework for efficient System/Stress Testing.
Participate in Product Feature Review, Certification experiments and User Document reviews.
Research and acquire skills on new technologies as needed from time to time
Technical qualifications
Minimum Requirements for this job role
10-14 years of Oracle database administration experience on large production environments
Database hands on skills especially around database, system and administration
GoldenGate setup, administration and tuning
RAC setup and administration
Strong Linux/UNIX OS understanding including OS Architecture & Internals (Networking, File Systems, Process/Memory Monitoring/Tuning/Linux Virtualisation etc).
Programming/Scripting skills in one or more of below languages is required.
Scripting - Perl / Shell / Python, Microservices/REST APIs
Programming - Oracle SQL, PL/SQL, Java/JDBC, Python
, ./MS in CS/ECE/EE, MCA from Reputed Engineering Colleges preferred.
Preferred Requirements for this job role
Architect and administrating large Oracle database systems
Data Guard administration and tuning
Experience on any ERP applications like Oracle E-biz suite, Oracle Fusion Apps or Netsuite
Experience in developing test-cases, automation and orchestration
Career Level - IC4
Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas. Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the mission critical stack, with focus on security, resiliency, scale, and performance. Authority for end-to-end performance and operability. Partner with development teams in defining and implementing improvements in service architecture. Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. Act as ultimate escalation point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). Utilize a deep understanding of service topology and their dependencies required to troubleshoot issues and define mitigations. Understand and explain the affect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies.

  • delhi, India Cricbuzz.com Full time

    Site Reliability EngineerWe are looking for a highly skilled and motivated Web Server Site Reliability Engineer to join our team. As a Web Server Site Reliability Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our web server infrastructure and CDN services.Experience - 3 - 5 yearsResponsibilities:● Design,...


  • Delhi, India Career Stone Consultant Full time

    PRINCIPAL ACCOUNTABILITIES:1.AWS Infrastructure Design:o Lead the design and implementation of scalable, reliable, and secure AWS infrastructure.o Provide expertise in architecting solutions that maximize the benefits of AWS services.o Lead the upgrade of Apache web servers for improved performance and security.o Oversee the database (DB) upgrade process,...


  • Delhi, India ViewSonic Full time

    Job Requirements:Bachelor’s degree in computer science, Engineering, or a related field.3+ years of experience as a Site Reliability Engineer, DevOps Engineer, or similar role.Proficient in AWS solutions including but not limited to EC2, S3, CloudWatch, Lambda, and RDS.Strong understanding of Platform Engineering concepts and principles.Experience with...


  • delhi, India Korn Ferry Full time

    Role - Site Reliability EngineerExp - 5+ years RequiredLocation - Hyderabad ( Work from Office-Hybrid)Shift Timings - 5AM -1 PM ISTWe are looking for a Site Reliability Engineer with strong development background to join our team. In this role, you will be responsible for ensuring the reliability and performance of our systems. You will work closely to our...


  • delhi, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners 8 to 9 years for Hyderabad Location for a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience in SRE, GCP and Kubernetes , send me your updated cv : Please...


  • Delhi, India Daxko Full time

    Company DescriptionDaxko powers health & wellness throughout the world. Every day our team members focus their passion and expertise in helping health & wellness facilities operate efficiently and engage their members.Whether a neighborhood yoga studio, a national franchise with locations in every city, a YMCA or JCC--and every type of organization in...


  • Delhi, India SID Global Solutions Full time

    Dear Candidates,We are looking for immediate joiners8 to 9 years for Hyderabad Locationfor a talented Site Reliability Engineer-Manager to join our dynamic team and contribute to the development of our cutting-edge web applications. If you're passionate about the role and have experience inSRE, GCP and Kubernetes , send me your updated cv : find below the...


  • Delhi, Delhi, India Serendipity Recruiting Full time

    Job DescriptionAs a Site Reliability Engineer (SRE), you will play a vital role in continuously driving improvements in observability, performance, and reliability, aiming to make a substantial impact across the federal government.Our client firmly believes that exceptional technology services are built upon exceptional individuals. For over two decades, our...


  • Delhi, India Quiktrak, LLC Full time

    Job Title: Azure Site Reliability Engineer (SRE) / DevOps EngineerJob Description:Summary:As an Azure Site Reliability Engineer (SRE) / DevOps Engineer, you will be responsible for ensuring the reliability, scalability, and performance of our cloud infrastructure on the Azure platform. This role involves managing deployments, implementing continuous...


  • Delhi, Delhi, India Exoscale Full time

    Job DescriptionExoscale is the leading Swiss/European cloud service provider.With services covering the full cloud infrastructure spectrum - from fast deploying virtual machines to S3 compatible object storage - Exoscale provides a simple and scalable experience in order to let its clients focus on their core business.Join a dynamic working environment with...


  • Delhi, India System Soft Technologies Full time

    Title: Site Reliability Engineer 100% REMOTE The Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • delhi, India System Soft Technologies Full time

    Title: Site Reliability Engineer100% REMOTEThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • Delhi, Delhi, India System Soft Technologies Full time

    Title: Site Reliability Engineer100% REMOTEThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and...


  • delhi, India WaferWire Cloud Technologies Full time

    Role: SRE (Site Reliability Engineer)Experience: 4+ YearsAbout WaferWire Cloud Technologies:WaferWire Cloud Technologies is a leading provider of innovative cloud solutions aimed at transforming businesses and driving digital growth. With a focus on cutting-edge technology and customer-centric approaches, we empower organizations to thrive in the digital...


  • Delhi, India System Soft Technologies Full time

    Title: Site Reliability Engineer100% REMOTEApplications written in .NET (python or any other scripting would be good) we need more of a dev background then operations.Automation experience: Ansible preferred but good with Terraform as well.Doesn’t need to come from a 24x7 environment but needs to be okay working in that environment.AWS preferred but any...


  • Delhi, Delhi, India WaferWire Cloud Technologies Full time

    Role:SRE (Site Reliability Engineer)Experience:4+ YearsAbout WaferWire Cloud Technologies:WaferWire Cloud Technologies is a leading provider of innovative cloud solutions aimed at transforming businesses and driving digital growth. With a focus on cutting-edge technology and customer-centric approaches, we empower organizations to thrive in the digital era....


  • new delhi, India dentsu Full time

    The purpose of this role is to ensure the availability and stability of production and test platforms. Job Title: Site Reliability Engineer Job Description: Key responsibilities:Troubleshoots and owns issues in our development, test and production environments. Including performance optimisation and continuous tuningWorks alongside the DevOps team in...


  • Delhi, India System Soft Technologies Full time

    Job SummaryThe Site Reliability Engineer (SRE) is a technician who utilizes an array of skills to enhance reliability in critical customer facing digital assets. The SRE is responsible for maintaining the availability and performance of relevant systems through supporting, building, and enhancing applications, tools and engaging with infrastructure teams....


  • Delhi, India Next-Link Full time

    Job DescriptionSenior Site Reliability EngineerDesirable Skills:Experience with additional programming languages and technologies beyond Python and Ruby.Familiarity with cloud platforms such as AWS, Azure, or GCP.Proficiency in additional logging and monitoring tools.Experience with other Infrastructure as Code (IaC) tools and practices.Knowledge of...


  • Delhi, Delhi, India SkySys Full time

    Role: Site Reliability Engineer (SRE) Position Type: Full-Time Contract (40hrs/week) Contract Duration: Long Term Work Time zone: IST Work Schedule: 8 hours/day (Mon-Fri) Location: 100% remote (candidate can work from anywhere in India) Must haves: Monitoring and deploying .net applications Maintaining code, writing scripts Monitor application...