Senior Site Reliability Engineer
3 weeks ago
We’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying and resolving production issues. The ideal candidate will be passionate about an operations role that involves deep knowledge of both the application and the product, and will also believe that automation is a key component to operating large-scale systems.6-Month Accomplishments- Familiarize with poshmark tech stack and functional requirements. - Get comfortable with automation tools/frameworks used within cloudops organization and deployment processes associated with. - Gain in depth knowledge related to related product functionality and infrastructure required for it. - Start Contributing by working on small to medium scale projects. - Understand and follow on call rotation as a secondary to get familiarized with the on call process.12+ Month Accomplishments- Execute projects related to comms functionality, independently, with little guidance from lead. - Create meaningful alerts and dashboards for various sub-system involved in targeted infrastructure. - Identify gaps in infrastructure and suggest improvements or work on it. - Get involved in on-call rotation.Responsibilities- Serve as a primary point responsible for the overall health, performance, and capacity of one or more of our Internet-facing services. - Gain deep knowledge of our complex applications. - Assist in the roll-out and deployment of new product features and installations to facilitate our rapid iteration and constant growth. - Develop tools to improve our ability to rapidly deploy and effectively monitor custom applications in a large-scale UNIX environment. - Work closely with development teams to ensure that platforms are designed with "operability" in mind. - Function well in a fast-paced, rapidly-changing environment. - Participate in a 24x7 on-call rotationDesired Skills- 4+ years of experience in Systems Engineering/Site Reliability Operations role is required, ideally in a startup or fast-growing company. - 4+ years in a UNIX-based large-scale web operations role. - 4+ years of experience in doing 24/7 support for large scale production environments. - Battle-proven, real-life experience in running a large scale production operation. - Experience working on cloud-based infrastructure e.g AWS, GCP, Azure. - Hands-on experience with continuous integration tools such as Jenkins, configuration management with Ansible, systems monitoring and alerting with tools such as Nagios, New Relic, Graphite. - Experience scripting/coding - Ability to use a wide variety of open source technologies and tools.Technologies we use:- Ruby, JavaScript, NodeJs, Tomcat, Nginx, HaProxy - MongoDB, RabbitMQ, Redis, ElasticSearch. - Amazon Web Services (EC2, RDS, CloudFront, S3, etc.) - Terraform, Packer, Jenkins, Datadog, Kubernetes, Docker, Ansible and other DevOps tools.
-
Senior Site Reliability Engineer
3 weeks ago
New Delhi, India Tata Consultancy Services Full timeDear Candidates, Greetings from TCS!!! TCS is looking for Senior Site Reliability Engineer – AWS Experience: 8-12 years Location: ChennaiMust have skills: Design, implement, and maintain scalable, secure, and highly available infrastructure on AWS Develop and improve CI/CD pipelines, Infrastructure as Code (IaC) using Terraform, Harness Own and implement...
-
Site Reliability Engineer
2 weeks ago
New Delhi, India WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Site Reliability Engineer
3 weeks ago
New Delhi, India WhiteLotus Talent Partners Full timeWe are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes . In this role, you will focus onmonitoring ,basic troubleshooting , andincident response , helping to maintain high system availability,...
-
Site Reliability Engineer
2 weeks ago
New Delhi, India SID Global Solutions Full timeJob Role: Site Reliability Engineer (SRE) – GCPExperience: 3+ yearsLocation: HyderabadAbout SIDGS:SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...
-
Site Reliability Engineer
4 weeks ago
New Delhi, India SID Global Solutions Full timeJob Role: Site Reliability Engineer (SRE) – GCP Experience: 3+ years Location: HyderabadAbout SIDGS: SIDGS is a premium global systems integrator and global implementation partner of Google corporation, providing Digital Solutions & Services to Fortune 500 companies. Our Digital solutions go across following domains: User Experience, CMS, API Management,...
-
Senior Staff Site Reliability Engineer
2 weeks ago
New Delhi, India Movius Full timeSenior Staff Site Reliability EngineerLocation: Bengaluru, KA, 560076 Job Description: We are seeking a highly skilled Senior Staff Site Reliability Engineer with extensive experience in DevOps/SRE roles and large-scale distributed systems. The ideal candidate will have a proven background in cloud operations, automation, and CI/CD, with a preference for...
-
Senior Staff Site Reliability Engineer
1 week ago
New Delhi, India Movius Full timeSenior Staff Site Reliability EngineerLocation: Bengaluru, KA, 560076Job Description:We are seeking a highly skilled Senior Staff Site Reliability Engineer with extensive experience in DevOps/SRE roles and large-scale distributed systems. The ideal candidate will have a proven background in cloud operations, automation, and CI/CD, with a preference for...
-
Senior Staff Site Reliability Engineer
2 weeks ago
New Delhi, India Movius Full timeSenior Staff Site Reliability EngineerLocation: Bengaluru, KA, 560076Job Description:We are seeking a highly skilled Senior Staff Site Reliability Engineer with extensive experience in DevOps/SRE roles and large-scale distributed systems. The ideal candidate will have a proven background in cloud operations, automation, and CI/CD, with a preference for...
-
Site Reliability Engineer
3 weeks ago
New Delhi, India Brillio Full timeHiring: Senior Infrastructure Technical Specialist (SRE Experience)Location: Bengaluru, Pune, ChennaiMode of work: 3 days WFOExperience: Senior LevelWe’re looking for a Senior Infrastructure Technical Specialist with strong Site Reliability Engineering (SRE) expertise to join our dynamic team. The ideal candidate will have hands-on experience with IT...
-
Senior Site Reliability Engineer
2 weeks ago
New Delhi, India Tata Consultancy Services Full timeRole**: Senior Site Reliability Engineer (SRE)Required Technical Skill Set: Senior Site Reliability Engineer (SRE)Desired Experience Range: 7 - 10 yrsNotice Period: Immediate to 90Days onlyLocation of Requirement: BangaloreWe are currently planning to do a Virtual InterviewJob Description:Key ResponsibilitiesInfrastructure & Application Support- Design,...