Site Reliability Engineer

7 hours ago


Hyderabad, Telangana, India BYLD Group Full time ₹ 12,00,000 - ₹ 36,00,000 per year

Description
Job Title :
Site Reliability Engineer (SRE) - DataDog / AWS Lambda / DynamoDB / Serverless

Location :
Bangalore / Pune / Hyderabad

Experience :
5- 10 Years

About The Role
We are seeking an experienced Site Reliability Engineer (SRE) with strong expertise in DataDog integration, AWS Lambda, DynamoDB, and Serverless architectures. The ideal candidate will be responsible for building, monitoring, and maintaining highly reliable, scalable, and secure cloud-based systems.

Key Responsibilities

  • Design, implement, and maintain monitoring and observability solutions using DataDog (metrics, logs, traces, dashboards, and alerts).
  • Develop and optimize serverless applications using AWS Lambda and related AWS services.
  • Manage and optimize DynamoDB for scalability, reliability, and cost efficiency.
  • Automate deployment and infrastructure provisioning using AWS CDK / CloudFormation / Terraform.
  • Implement reliability engineering practices including performance tuning, auto-scaling, and fault tolerance.
  • Collaborate with development teams to design and implement highly available, resilient, and secure architectures.
  • Troubleshoot production issues and drive root cause analysis (RCA) to ensure long-term stability.
  • Continuously improve CI/CD pipelines and observability frameworks.

Required Skills & Experience

  • 5-10 years of total experience, with at least 3+ years in SRE / DevOps roles.
  • Hands-on experience with DataDog setup and integrations (custom metrics, APM, log management).
  • Strong experience with AWS Lambda, DynamoDB, and other Serverless services (API Gateway, Step Functions, SQS, SNS).
  • Proficiency in Python / / Bash scripting for automation.
  • Experience with IaC tools like Terraform, CloudFormation, or AWS CDK.
  • Solid understanding of AWS architecture, networking, and security best practices.
  • Working knowledge of CI/CD tools (GitHub Actions, Jenkins, CodePipeline, etc.).
  • Experience with incident management, monitoring dashboards, and alerting automation.

Good To Have

  • Experience with Kubernetes / ECS / EKS for container orchestration.
  • Familiarity with CloudWatch, Prometheus, or Grafana.
  • AWS Certification (Solutions Architect / DevOps Engineer) preferred.

)



  • Hyderabad, Telangana, India Talent Worx Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Site Reliability Engineer (SRE)At Talent Worx, we are looking for a dedicated Site Reliability Engineer (SRE) to join our team. This role involves maintaining high availability and reliability of our services through the application of software engineering practices and systems administration skills. The ideal candidate will bridge the gap between...


  • Hyderabad, Telangana, India 2a1d0a41-1875-4bbb-b5a8-e4d5620cfd5f Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Role & responsibilitiesCoordinates cross-product chaos experimentation to proactively test system resilience and uncover reliability gaps.Maintains the centralized incident response playbook for the subdivision, documenting standards for communication, escalation, and recovery during incidents. Aggregates and reports quantifiable availability data to senior...


  • Hyderabad, Telangana, India Assurant Full time ₹ 6,00,000 - ₹ 12,00,000 per year

    Site Reliability Engineer, GCC-AssurantThe Site Reliability Engineer (SRE) will be part of the Assurant Reliability Team, specifically within the Site Reliability Engineering area. This remote position, based in India, focuses on building and maintaining reliable, scalable systems through a combination of software development and network diagnostics. The...


  • Hyderabad, Telangana, India Assurant Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Site Reliability Engineer, GCC-Assurant The Site Reliability Engineer (SRE) will be part of the Assurant Reliability Team, specifically within the Site Reliability Engineering area. This remote position, based in India, focuses on building and maintaining reliable, scalable systems through a combination of software development and network diagnostics. The...


  • Hyderabad, Telangana, India Technology Next Full time ₹ 20,00,000 - ₹ 30,00,000 per year

    Urgently hiring for Site Reliability Engineer (SRE) / Chaos EngineerLocation: HyderabadJob Type: Full-time, PermanentJob Description:We are looking for an experienced Site Reliability Engineer (SRE) with strong Python automation skills (Boto3 required) and hands-on experience in chaos engineering to improve system reliability and resilience. The ideal...


  • Hyderabad, Telangana, India Elios Talent Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Site Reliability EngineerKey Highlights Build, automate, and support cloud-native infrastructure powering high-availability platforms Contribute to automation-first engineering across AWS, Terraform, CI/CD, and observability tooling Improve reliability, uptime, system health, and performance across production environments Strengthen DevSecOps...


  • Hyderabad, Telangana, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Oracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Mainframe zLinux, DB2, zVM, AIX.  Site Reliability Engineer expected to work with multiple service and product development teams, identifying cross-team issues that...


  • Hyderabad, Telangana, India Oracle Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    Oracle is seeking motivated Principal Site Reliability Engineer who thrives in a fast-paced rapidly evolving technology environment. This position requires wide and overall knowledge in Linux administration, AI technologies, software development, cloud computing, networking, cloud security, performance analysis and monitoring to provide the stability,...


  • Hyderabad, Telangana, India Talent Worx Full time ₹ 12,00,000 - ₹ 36,00,000 per year

    SRE (Site Reliability Engineer)Talent Worx is seeking a talented SRE (Site Reliability Engineer) to enhance our technology team. In this role, you will be pivotal in ensuring the reliability, performance, and availability of our applications and services. Your work will involve both software engineering and systems operations as you strive to improve...


  • Hyderabad, Telangana, India SID Global Solutions Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Job Role: Site Reliability Engineer (SRE) – GCPExperience: 3+ yearsLocation: HyderabadAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...