Site Reliability Engineer

3 weeks ago


India CES Full time
Job Description

We are seeking a hands-on SRE with expertise in infrastructure automation, cloud scalability, and performance optimization. Youll design, manage, and monitor large-scale AWS environments, ensuring high availability, security, and reliability for our SaaS platforms

Key Responsibilities

- Develop and execute UI automation using Cypress with TypeScript.
- Conduct performance testing using K6.
- Perform API testing with Postman.
- Run accessibility testing using Wave, AudioEye, and similar tools.
- Manage and optimize AWS infrastructure at scale (EC2, S3, ELB, Lambda, Route 53, ECS, SQS, CloudWatch).
- Package, deploy, and manage containerized workloads (Docker, Kubernetes).
- Automate workflows using Terraform, CDK, Chef.
- Implement CI/CD pipelines (TeamCity, Octopus Deploy, GitHub, Jenkins, Codefresh).
- Monitor and troubleshoot using ELK stack, Dynatrace, New Relic, Nagios.
- Manage and optimize IIS and web farms in high-traffic SaaS environments.

Key Skills & Experience

- 3+ years with IaaC & DSC tools (Terraform, CDK, Chef).
- 3+ years managing containerized workloads on PaaS (Docker, Kubernetes).
- Strong scripting/automation skills (PowerShell, Ruby, Go, Python, Bash).
- Experience with large-scale monitoring & reporting.
- Solid understanding of .NET application architecture.
- Proven problem-solving & troubleshooting skills in DevOps/SRE environments.

  • India CloudHire Full time

    Job SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...


  • India BQE Software Full time

    We are seeking a Senior Site Reliability Engineer to lead reliability efforts across our application stack, focusing on high availability, performance, and scalability.This role will own the health and uptime of our mission-critical application , Cloud infrastructure , database system , and monitoring infrastructure . About Us At BQE, our mission...


  • India CES Full time

    We're looking for a highly skilled Site Reliability Engineer to help us build, manage, and scale modern infrastructure systems for high-availability applications. If you're passionate about automation, cloud platforms, and solving tough operational challenges, we would love to hear from you.Key Skills and Competencies3+ years of extensive experience with...


  • India JoVE Full time

    Jo VE is the world-leading producer and provider of science video solutions with the mission to improve scientific research and education.Millions of scientists, educators and students use Jo VE for their research, teaching and learning.Our institutional clients comprise over 1,000 universities, colleges, and biopharma companies, including such leaders as...


  • India JoVE Full time

    JoVE is the world- leading producer and provider of video solutions with the mission to improve scientific research and education. Millions of scientists, educators and students use JoVE for their research, teaching and learning. Our institutional clients comprise over 1,000 universities, colleges, and biopharma companies, including such leaders as Harvard,...


  • Remote, India Rackspace Technology Full time

    Job DescriptionSite Reliability Engineer / Observability EngineerPublic Cloud - Offerings and Delivery - Workforce Mgmt & Delivery Ops /Full - Time / RemoteRackspace is building up its Professional Services Center of Excellence on Application Performance Monitoring Suites.If you enjoy solving complex business problems and can contribute to building next...


  • India pythian Full time

    Remote Site Reliability Engineering - Site Reliability Engineering Full Time Remote Site Reliability Engineer India Multiple Timezones Remote Work from Home Why Pythian At Pythian we are experts in strategic database and analytics services driving digital transformation and operational excellence Pythian a multinational company was...


  • India Xebia Full time

    We are looking for a highly skilled AWS Engineer with strong Python development and Chaos Engineering expertise to design, build, and validate resilient, scalable, and automated cloud-native environments. The ideal candidate will combine cloud engineering, DevOps, and chaos experimentation to improve reliability, fault tolerance, and operational efficiency...


  • India AionNimbius Full time

    We are looking for a Site Reliability Engineering Manager – Cloud Engineering to join our team in Bengaluru.This role will lead operations for a 24x7 cloud environment, ensuring our systems stay reliable, resilient, and ready to scale.You'll be the one making sure incidents are handled quickly, systems are well-documented, and automation is in place to...


  • India Cimpress Full time

    Senior Site Reliability EngineerWho We Are:Cimpress Technology develops cutting-edge, best-in-world software that our mass customization businesses use to create personalized products for over 17 million global customers. Our Mass Customization Platform consists of modular, multi-tenant services. Our businesses can choose the solutions that work for them, or...