Senior Site Reliability Engineer
2 weeks ago
We’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through identifying and resolving production issues. The ideal candidate will be passionate about an operations role that involves deep knowledge of both the application and the product, and will also believe that automation is a key component to operating large-scale systems.6-Month AccomplishmentsFamiliarize with poshmark tech stack and functional requirements. Get comfortable with automation tools/frameworks used within cloudops organization and deployment processes associated with. Gain in depth knowledge related to related product functionality and infrastructure required for it.Start Contributing by working on small to medium scale projects.Understand and follow on call rotation as a secondary to get familiarized with the on call process.12+ Month AccomplishmentsExecute projects related to comms functionality, independently, with little guidance from lead.Create meaningful alerts and dashboards for various sub-system involved in targeted infrastructure.Identify gaps in infrastructure and suggest improvements or work on it.Get involved in on-call rotation.ResponsibilitiesServe as a primary point responsible for the overall health, performance, and capacity of one or more of our Internet-facing services.Gain deep knowledge of our complex applications.Assist in the roll-out and deployment of new product features and installations to facilitate our rapid iteration and constant growth.Develop tools to improve our ability to rapidly deploy and effectively monitor custom applications in a large-scale UNIX environment.Work closely with development teams to ensure that platforms are designed with "operability" in mind.Function well in a fast-paced, rapidly-changing environment.Participate in a 24x7 on-call rotationDesired Skills4+ years of experience in Systems Engineering/Site Reliability Operations role is required, ideally in a startup or fast-growing company.4+ years in a UNIX-based large-scale web operations role.4+ years of experience in doing 24/7 support for large scale production environments. Battle-proven, real-life experience in running a large scale production operation.Experience working on cloud-based infrastructure e.g AWS, GCP, Azure.Hands-on experience with continuous integration tools such as Jenkins, configuration management with Ansible, systems monitoring and alerting with tools such as Nagios, New Relic, Graphite.Experience scripting/coding Ability to use a wide variety of open source technologies and tools.Technologies we use: Ruby, JavaScript, NodeJs, Tomcat, Nginx, HaProxy MongoDB, RabbitMQ, Redis, ElasticSearch. Amazon Web Services (EC2, RDS, CloudFront, S3, etc.) Terraform, Packer, Jenkins, Datadog, Kubernetes, Docker, Ansible and other DevOps tools.
-
Site Reliability Engineer
4 days ago
tamil nadu, India Tata Consultancy Services Full timeTCS has been a great pioneer in feeding the fire of young techies like you. We are a global leader in the technology arena and there’s nothing that can stop us from growing together. What we are looking for Role: Site Reliability Engineering (SRE) Experience Range: 5 – 15 Years Location: Chennai/Pune candidates should come to office for Walk in...
-
Site Reliability Engineer
7 days ago
tamil nadu, India Grootan Technologies Full timeAbout the RoleWe are seeking a skilled Site Reliability Engineer (SRE) with 4–5 years of hands-on experience to join our engineering team. In this role, you will be responsible for building and maintaining reliable, scalable, and secure infrastructure to support our applications. You will leverage your expertise in automation, cloud platforms, and...
-
Site Reliability Engineer
3 days ago
tamil nadu, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWSExperience: 8+ yearsLocation: Chennai / MumbaiWork Mode: HybridKey Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, DatadogJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Site Reliability Engineer
55 minutes ago
tamil nadu, India Datum Technologies Group Full timeJob Title: Site Reliability Engineer (SRE) – AWS Experience: 8+ years Location: Chennai / Mumbai Work Mode: Hybrid Key Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, Datadog Job Summary: We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and...
-
Sr. Site Reliability Engineer
2 weeks ago
tamil nadu, India Datum Technologies Group Full timeJob Details: Job Title: Sr. Site Reliability Engineer (SRE) Duration: Contract to Hire (On the Payroll of Datum Technology Group) Location: Chennai || Mumbai || Gurugram Interview Process: Virtual (2 Rounds) + 1 Technical screening. Job Description: We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to enhance reliability, scalability,...
-
Sr. Site Reliability Engineer
2 weeks ago
tamil nadu, India Datum Technologies Group Full timeJob Details:Job Title: Sr. Site Reliability Engineer (SRE) Duration: Contract to Hire (On the Payroll of Datum Technology Group)Location: Chennai || Mumbai || Gurugram Interview Process: Virtual (2 Rounds) + 1 Technical screening.Job Description:We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to enhance reliability, scalability, and...
-
Senior Site Reliability Engineer
1 week ago
tamil nadu, India Poshmark Full timeWe’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through...
-
Senior Site Reliability Engineer
4 weeks ago
Chennai, Tamil Nadu, India Miratech Full timeCompany Description Miratech helps visionaries change the world We are a global IT services and consulting company that brings together enterprise and start-up innovation Today we support digital transformation for some of the world s largest enterprises By partnering with both large and small players we stay at the leading edge of technology remain nimble...
-
Lead Site Reliability Engineer
2 weeks ago
tamil nadu, India Datum Technologies Group Full timeJob Details: Job Title: Lead Site Reliability Engineer (SRE) Duration: Contract to Hire (On the Payroll of Datum Technology Group) Location: Chennai || Mumbai || Gurugram Interview Process: Virtual (2 Rounds) + 1 Technical screening. Job Description: We are seeking a highly skilled and experienced Lead Site Reliability Engineer (SRE) to drive reliability,...
-
Site Reliability Engineer
2 weeks ago
Chennai, Tamil Nadu, India, Tamil Nadu Tata Consultancy Services Full timeRole: Site Reliability EngineerLocation: Chennai/Bangalore/HyderabadExp- 5-11 years1.Exposure to any APM tool like Dynatrace, Appdynamics, Splunk, etc2.DBA or Infra admin 3.Gremlin or Chaos Monkey or Simian Army or Litmus expertise4.Exposure to ITSM tools like Service Now, etc5.Understanding of Automation and Chaos Engineering6.Exposure to Devops tools and...