Senior Site Reliability Engineer
3 weeks ago
Job Description We're looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying and resolving production issues. The ideal candidate will be passionate about an operations role that involves deep knowledge of both the application and the product, and will also believe that automation is a key component to operating large-scale systems. 6-Month Accomplishments - Familiarize with poshmark tech stack and functional requirements. - Get comfortable with automation tools/frameworks used within cloudops organization and deployment processes associated with. - Gain in depth knowledge related to related product functionality and infrastructure required for it. - Start Contributing by working on small to medium scale projects. - Understand and follow on call rotation as a secondary to get familiarized with the on call process. 12+ Month Accomplishments - Execute projects related to comms functionality, independently, with little guidance from lead. - Create meaningful alerts and dashboards for various sub-system involved in targeted infrastructure. - Identify gaps in infrastructure and suggest improvements or work on it. - Get involved in on-call rotation. Responsibilities - Serve as a primary point responsible for the overall health, performance, and capacity of one or more of our Internet-facing services. - Gain deep knowledge of our complex applications. - Assist in the roll-out and deployment of new product features and installations to facilitate our rapid iteration and constant growth. - Develop tools to improve our ability to rapidly deploy and effectively monitor custom applications in a large-scale UNIX environment. - Work closely with development teams to ensure that platforms are designed with operability in mind. - Function well in a fast-paced, rapidly-changing environment. - Participate in a 24x7 on-call rotation Desired Skills - 4+ years of experience in Systems Engineering/Site Reliability Operations role is required, ideally in a startup or fast-growing company. - 4+ years in a UNIX-based large-scale web operations role. - 4+ years of experience in doing 24/7 support for large scale production environments. - Battle-proven, real-life experience in running a large scale production operation. - Experience working on cloud-based infrastructure e.g AWS, GCP, Azure. - Hands-on experience with continuous integration tools such as Jenkins, configuration management with Ansible, systems monitoring and alerting with tools such as Nagios, New Relic, Graphite. - Experience scripting/coding - Ability to use a wide variety of open source technologies and tools. Technologies we use: - Ruby, JavaScript, NodeJs, Tomcat, Nginx, HaProxy - MongoDB, RabbitMQ, Redis, ElasticSearch. - Amazon Web Services (EC2, RDS, CloudFront, S3, etc.) - Terraform, Packer, Jenkins, Datadog, Kubernetes, Docker, Ansible and other DevOps tools.
- 
					
						Cloud Site Reliability Engineer
2 weeks ago
Chennai, Tamil Nadu, India Ford Global Career Site Full time ₹ 15,00,000 - ₹ 25,00,000 per yearBe at the Forefront of Mobility's Future: Join Ford as a Site Reliability EngineerEnterprise Technology is the engine driving the future of transportation, and we're looking for a talented Site Reliability Engineer (SRE) to help us redefine mobility. In this role, you'll leverage cutting-edge technology to enhance customer experiences, improve lives, and...
 - 
					
						Senior site reliability engineer
3 weeks ago
Chennai, India Tata Consultancy Services Full timeDear Candidates,Greetings from TCS!!!TCS is looking for Senior Site Reliability Engineer – AWSExperience: 8-12 yearsLocation: ChennaiMust have skills:- Design, implement, and maintain scalable, secure, and highly available infrastructure on AWS- Develop and improve CI/CD pipelines, Infrastructure as Code (Ia C) using Terraform, Harness- Own and implement...
 - 
					
						Senior Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India, Tamil Nadu Tata Consultancy Services Full timeDear Candidates,Greetings from TCS!!!TCS is looking for Senior Site Reliability Engineer – AWSExperience: 8-12 yearsLocation: ChennaiMust have skills: Design, implement, and maintain scalable, secure, and highly available infrastructure on AWSDevelop and improve CI/CD pipelines, Infrastructure as Code (IaC) using Terraform, HarnessOwn and implement...
 - 
					
						Site Reliability Engineer
3 weeks ago
, India, IN Sonata Software Full timeWe're Hiring: Senior Site Reliability Engineer Location: Onsite (Office: Hyderabad – Mandatory from Day 1) Employment Type: Full-time Notice Period: Immediate to 15 Days Only Experience: 8+ Years About the RoleWe’re looking for a Senior Site Reliability Engineer (SRE) to lead reliability initiatives across our production systems. This is a high-impact...
 - 
					
						Senior site reliability engineer
4 weeks ago
Chennai, India Tata Consultancy Services Full timeDear Candidates,Greetings from TCS!!!TCS is looking for Senior Site Reliability Engineer – AWSExperience: 8-12 yearsLocation: ChennaiMust have skills:- Design, implement, and maintain scalable, secure, and highly available infrastructure on AWS- Develop and improve CI/CD pipelines, Infrastructure as Code (Ia C) using Terraform, Harness- Own and implement...
 - 
					
						Senior/expert site
3 weeks ago
India IVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering Practice Location: India (Remote) -Must be available to work in the EST (US/Canada) Time Zone. Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...
 - 
					
						Senior Site Reliability Engineer
4 weeks ago
Pune, India Barclays Full timeJob Description Step into the role of Senior Site Reliability Engineer. At Barclays, we are more than a bank we are a force for progress. You will be the part of the central SRE (Site Reliability Engineer) core team within our wider Infrastructure team. You will act as a centre of excellence providing hands on consultancy to our different infrastructure...
 - 
					
Senior Site Reliability Engineer
2 weeks ago
India Akamai Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWould you enjoy improving stability and safety of one of the largest global networks?Would you enjoy hands-on network operations work on a global scale to improve our operational efficiency?Join the Platform Cloud Services Engineering TeamThe Platform Cloud Services SRE team supports globally distributed hosting and database systems for Akamai. These systems...
 - 
					
						Senior site reliability engineer
4 weeks ago
India Sapaad Full timeWHO WE ARE Sapaad is a global leader in unified commerce platforms, delivering world-class software solutions for the food and beverage industry. Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering thousands of F& B businesses across 40+ countries —with many more coming onboard each day. Driven by a...
 - 
					
						Senior Site Reliability Engineer
4 weeks ago
India Sapaad Full timeWHO WE ARE Sapaad is a global leader in unified commerce platforms, delivering world-class software solutions for the food and beverage industry. Our flagship product, also named Sapaad, has achieved remarkable success over the past decade, empowering thousands of F&B businesses across 40+ countries —with many more coming onboard each day. Driven by a...