Current jobs related to Database Reliability Engineer - Hyderabad, Telangana - Splunk


  • Hyderabad, Telangana, India Splunk Inc Full time

    Database Reliability EngineerSplunk Inc is seeking a highly skilled Database Reliability Engineer to join our team. As a key member of our infrastructure software engineering team, you will be responsible for designing, implementing, and operating large-scale distributed data stores and streaming services.Key ResponsibilitiesDesign and implement new...


  • Hyderabad, Telangana, India Apps Associates Full time

    AWS Database Reliability EngineerAbout the Role:We are seeking a highly skilled AWS Database Reliability Engineer to join our team at Apps Associates. As a key member of our technology team, you will be responsible for managing and maintaining database systems in AWS, ensuring high availability, performance, and security.Key Responsibilities:Manage and...


  • Hyderabad, Telangana, India Splunk Inc Full time

    RoleYou will help us run one of the largest and most sophisticated cloud-scale, big data, and microservices platforms in the world. As a Database Reliability Engineer at Splunk, you will be responsible for enabling developers to operate highly available, scalable, and cost-efficient applications with low operational burden by managing and improving the...


  • Hyderabad, Telangana, India Selsoft Full time

    Senior Database Reliability EngineerAbout the Role:We are seeking a highly skilled Senior Database Reliability Engineer to join our team at Selsoft. As a key member of our engineering team, you will be responsible for ensuring the availability, scalability, and performance of our database systems.Key Responsibilities:Design, build, and maintain core database...


  • Hyderabad, Telangana, India Splunk Inc Full time

    RoleYou will help us run one of the largest and most sophisticated cloud-scale, big data, and microservices platforms in the world. You will be responsible for enabling developers to operate highly available, scalable, and cost-efficient applications with low operational burden by managing and improving the reliability and resiliency of SRE-managed services...


  • Hyderabad, Telangana, India LivePerson, Inc Full time

    Job Summary:LivePerson, Inc. is seeking a skilled Database Reliability Engineer II to join our team. As a key member of our database team, you will be responsible for designing and implementing highly-available and fault-tolerant cloud-based databases, as well as collaborating with the team to deliver innovative solutions.Key Responsibilities:Design and...


  • Hyderabad, Telangana, India Apps Associates Full time

    AWS Database Reliability EngineerExperience: – yearsLocation: Remote/HyderabadShift Timings: PM IST to AM ISTManage/maintain/monitor database systems in AWS.Perform routine audits of DBMS systems in AWS.Performance tune DBMS systems in AWS.Maintain and report system performance statistics.Support automation scripting concepts for AWS (CLI, Terraform,...


  • Hyderabad, Telangana, India LivePerson, Inc Full time

    Job Summary:LivePerson, Inc. is seeking a skilled Database Reliability Engineer II to join our team. As a key member of our database team, you will be responsible for designing and implementing highly available and fault-tolerant cloud-based databases.Key Responsibilities:Design and implement new cloud-based databases and redesign existing systems to ensure...


  • Hyderabad, Telangana, India Splunk Inc Full time

    RoleYou will help us run one of the largest and most sophisticated cloud-scale, big data, and microservices platforms in the world.As a highly skilled systems engineer, you will be responsible for enabling developers to operate highly available, scalable, and cost-efficient applications with low operational burden.By managing and improving the reliability...


  • Hyderabad, Telangana, India Splunk Inc Full time

    RoleYou will help us run one of the largest and most sophisticated cloud-scale, big data, and microservices platforms in the world. This requires enabling developers to operate highly available, scalable, and cost-efficient applications with low operational burden by managing and improving the reliability and resiliency of SRE-managed services and...


  • Hyderabad, Telangana, India Apps Associates Full time

    AWS Database Reliability EngineerExperience: – yearsLocation: Remote/HyderabadShift Timings: PM IST to AM ISTManage/maintain/monitor database systems in AWS.Perform routine audits of DBMS systems in AWS.Performance tune DBMS systems in AWS.Maintain and report system performance statistics.Support automation scripting concepts for AWS (CLI, Terraform,...


  • Hyderabad, Telangana, India 2104 Merative Technologies India Private Limited Full time

    Job SummaryAt 2104 Merative Technologies India Private Limited, we are seeking an experienced Database Systems Engineer to join our team. In this role, you will be responsible for designing, implementing, and maintaining Oracle databases to ensure high availability and reliability.Administer and maintain Oracle databases to ensure optimal performance and...


  • Hyderabad, Telangana, India LivePerson, Inc Full time

    Evolve Our Database SystemsLivePerson is a leader in enterprise customer conversations, and we're seeking a skilled Database Engineer II to help us scale our database and data storage systems. As a key member of our team, you'll design and architect highly-available and fault-tolerant cloud-based databases, collaborating with cross-functional teams to...


  • Hyderabad, Telangana, India SINGLE POINT TECHNOLOGIES PRIVATE LIMITED Full time

    Job Title: Site Reliability EngineerAbout the Role:We are seeking a skilled Site Reliability Engineer to join our team at Single Point Technologies Private Limited. As a Site Reliability Engineer, you will play a critical role in ensuring the reliability, performance, and security of our cloud-based product suite.Key Responsibilities:* Design and implement...


  • Hyderabad, Telangana, India Zeta Services Inc. Full time

    About ZetaZeta is a cutting-edge banking technology company that empowers financial institutions to innovate and grow. Our flagship platform, Zeta Tachyon, is a modern, cloud-native, and fully API-enabled stack that streamlines banking operations.Job SummaryWe are seeking a skilled Data Reliability Engineer II to join our team. As a key member of our...


  • Hyderabad, Telangana, India FactSet Full time

    Job Title: Lead Site Reliability EngineerAt FactSet, we're seeking a highly skilled Lead Site Reliability Engineer to join our team. As a key member of our infrastructure team, you will be responsible for designing, implementing, and maintaining highly available and scalable architectures for our applications and infrastructure.Key...


  • Hyderabad, Telangana, India Experian Full time

    Job Title: Site Reliability EngineerJob Summary:Experian is seeking a highly skilled Site Reliability Engineer to join our team. As a Site Reliability Engineer, you will be responsible for ensuring the reliability, performance, and scalability of our AWS platform.Key Responsibilities:Optimize microservice and serverless processes on robust distributed...


  • Hyderabad, Telangana, India Micron Full time

    Transforming Information into IntelligenceMicron Technology is a world leader in innovating memory and storage solutions that accelerate the transformation of information into intelligence.As a Solid State Drive Quality and Reliability Test Development Engineer at Micron Technology, you will be responsible for designing, developing, and debugging complex...


  • Hyderabad, Telangana, India VOZIQ AI Full time

    Role : Manager - Database EngineeringJob Location : Hyderabad - INDIAAbout Company : VOZIQ AI is a leading provider of AI-powered Customer Lifecycle Management Solutions to help recurring revenue businesses maximize customer lifetime value. We are working with leading brands to help them improve customer retention, optimize prices, improve NPS and increase...


  • Hyderabad, Telangana, India Micron Full time

    Transforming Information into IntelligenceAt Micron Technology, we're redefining innovation with the world's most advanced memory and semiconductor technologies. As a Non-Volatile Memory QRA Solid State Drive (SSD) Quality and Reliability Test Development Engineer, you'll play a crucial role in designing, developing, and debugging complex test programs to...

Database Reliability Engineer

3 months ago


Hyderabad, Telangana, India Splunk Full time

Join us as we pursue our ground-breaking vision to make machine data accessible, usable, and valuable to everyone. We are a company filled with people who are passionate about our product and seek to deliver the best experience for our customers. At Splunk, we are committed to our work, customers, having fun, and most significantly to each other's success.

The Splunk Observability Cloud ) provides full-fidelity monitoring and troubleshooting across infrastructure, applications, and user interfaces, in real-time and at any scale, to help our customers keep their services reliable, innovate faster, and deliver great customer experiences. Infrastructure Software Engineers at Splunk are cloud-native systems engineers who use infrastructure-as-code, microservices, automation, and efficient design to build, operate, and scale our products.

Role

You will help us run one of the largest and most sophisticated cloud-scale, big data, and microservices platforms in the world. You will be responsible for enabling developers to operate highly available, scalable, and cost-efficient applications with low operational burden by managing and improving the reliability and resiliency of SRE-managed services and infrastructure. You thrive on automation, infrastructure-as-code, reliability engineering, and getting rid of tedious, manual tasks.

You will:

  • Design new services, tools, and monitoring to be implemented by the entire team.
  • Analyze the tradeoffs of the proposed design and make recommendations based on these tradeoffs.
  • Mentor new engineers to achieve more than they thought possible. You enjoy making other teams successful and are fulfilled through the success of others.

Work on database reliability projects, including:

  • HA, Business Continuity Planning, disaster recovery, backup/restore, RTO, RPO
  • Database uptime and performance
  • Capacity management & planning
  • SLIs, SLOs, error budgets, and monitoring dashboards
  • Responsible for deployment and operations of large-scale distributed data stores and streaming services
  • Establishing design patterns for monitoring and benchmarking
  • Establishing and documenting production run books and guidelines for developers
  • Tooling, toil reduction, runbooks & automation to manage production environments
  • Incident management and improving MTTD/MTTR for services
  • Cloud cost optimization

Qualifications

Mandatory

  • 3+ years of experience with deployment, operations, and performance management of large-scale Cassandra clusters along with Zookeeper.
  • 2+ years of experience in managing large-scale cloud-native microservices platforms.
  • Experience with infrastructure automation and scripting using Python and/or bash scripting.
  • Excellent problem-solving, troubleshooting, and debugging skills in large-scale distributed systems

Preferred

  • Confluent Certified Administrator for Apache Kafka and/or Apache Cassandra Administrator Associate certifications are preferred
  • AWS Solutions Architect certification preferred.
  • Experience with automating deployment, operations, and performance management of one or more of the following large-scale clusters such as Cassandra, Kafka, Elastic/Open Search, MongoDB, ZooKeeper, Redis, etc.
  • Strong hands-on experience deploying, managing, and monitoring large-scale Kubernetes clusters in the public cloud specifically AWS or GCP
  • Experience with Infrastructure-as-Code using Terraform, CloudFormation, Google Deployment Manager, Pulumi, Packer, ARM, etc.
  • Proven skills to effectively work across teams and functions to influence the design, operations, and deployment of highly available software.

Bachelors/Masters in Computer Science, Engineering, or related technical field, or equivalent practical experience.

We value diversity, equity, and inclusion at Splunk and are an equal employment opportunity employer. Qualified applicants receive consideration for employment without regard to race, religion, color, national origin, ancestry, sex, gender, gender identity, gender expression, sexual orientation, marital status, age, physical or mental disability or medical condition, genetic information, veteran status, or any other consideration made unlawful by federal, state, or local laws. We consider qualified applicants with criminal histories, consistent with legal requirements.