Site Reliability Engineer
1 month ago
Site Reliability Engineer is one of the critical role in the technology team and the person working in this team will be responsible for application performance, availability, reliability and system uptime. Candidate is responsible to provide consultation and strategic recommendations by quickly assessing and remediating complex platform availability issues. Site Reliability Engineer will dive head-first into creating or applying innovative solutions and techniques that advance the reliability of Digital products.
Experience Criteria
2-5 Years of relevant experience
Key responsibilities:
- Installation/deployment of new releases , environments for applications.
- Build and maintain highly scalable, large scale deployments globally
- Co-Create and maintain architecture for 100% uptime. E.g. creating alternate connectivity.
- Practice sustainable incident response/management and blameless post-mortems.
- Monitor and maintain production environment stability.
- Own entire platforms (prod environments) Deploying, automating, maintaining and
managing production systems, to ensure the availability, performance, scalability and
security of productions systems
- Engage in and improve the whole lifecycle of services from inception and design, through
deployment, operation and refinement.
- Support services before they go live through activities such as system design consulting,
developing software platforms and frameworks, capacity planning and launch reviews.
- Maintain services once they are live by measuring and monitoring availability, latency
and overall system health.
- Scale systems sustainably through mechanisms like automation and evolve systems by
pushing for changes that improve reliability and velocity.
- Collaborate with Agile teams in defining technical requirements and best practices with
containerized and cloud-native applications
- Represent production support and site reliability in stand-ups, planning sessions,
code reviews, and architecture reviews
- Help evolve our configuration management (CM) efforts and our move to containers
- Help the operations head in selecting the enthusiastic and technically knowledgeable
team and guide the existing team members.
Skills Required :
- Should have good knowhow of application, middleware, Databases (posgres, mongo,
mysql etc.), infra, OS.
- Should have good understanding in Docker and Kubernetes.
- Should have an understanding of CI/CD and DevOps tools like Jenkins, Ansible, Shell
scripting etc
- Monitoring and Logging: Experience with monitoring and logging tools (e.g. Nagios /
appdynamics, ELK, Prometheus).
- Good Experience of distributed systems RabbitMQ, Kafka, Redis etc.
- Should have an experience of working on Linux, Weblogic/tomcat, Jboss and middleware
technology.
- Should have worked on high traffic & highly scalable systems in past
- Knowledge on fundamental aspects for release automation (packaging, dependencies,
promotion, deployment, compliance)
-
Site Reliability Engineer
1 month ago
Gurugram, India Airtel Digital Full timeSite Reliability Engineer is one of the critical role in the technology team and the person working in this team will be responsible for application performance, availability, reliability and system uptime. Candidate is responsible to provide consultation and strategic recommendations by quickly assessing and remediating complex platform availability issues....
-
Senior Site Reliability Engineer, Platform
2 weeks ago
gurugram, India GEMINI Full timeDepartment : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other...
-
Senior Site Reliability Engineer, Platform
1 month ago
Gurugram, India GEMINI Full timeDepartment : Platform Our Platform organization’s purpose is to enable Gemini to scale effectively and empower our engineering teams to focus on building innovative financial products and experiences for individuals around the world. Within Platform, the Site Reliability Engineering team is responsible for partnering with Gemini’s other engineering...
-
Site Reliability Engineer
2 weeks ago
gurugram, India StatusNeo Full timeJob Description: We are seeking a highly skilled and experienced Senior Site Reliability Engineer with expertise in Core Tools and DevOps to join our dynamic team. The ideal candidate will have a strong background in Linux administration, cloud infrastructure, Infrastructure as Code (IaC), Python programming, and be a subject matter expert in DevOps tools...
-
Site Reliability Engineer
1 month ago
Gurugram, India StatusNeo Full timeJob Description: We are seeking a highly skilled and experienced Senior Site Reliability Engineer with expertise in Core Tools and DevOps to join our dynamic team. The ideal candidate will have a strong background in Linux administration, cloud infrastructure, Infrastructure as Code (IaC), Python programming, and be a subject matter expert in DevOps tools...
-
Principle Engineer
1 month ago
Gurgaon,Gurugram, India SAR HR Consultancy Full timePrinciple Engineer - SRE What You Need for this Position:- 10+ years of hands-on technical experience within the realm of Site Reliability Engineering- Architect-level understanding of one or more of the major public cloud services (AWS, GCP & Azure), using them to effectively design secure and scalable services.- Strong understanding of SRE concepts and...
-
Site Reliability Engineer
2 weeks ago
Gurugram, India Codersbrain technology pvt ltd Full timeKey Responsibilities :- Provide expert production support for application teams utilizing our platform, ensuring high availability, reliability, and performance.- Diagnose and resolve complex issues in production environments, collaborating closely with development teams and stakeholders.- Implement and maintain monitoring, alerting, and logging solutions to...
-
Senior SRE
1 month ago
Gurugram, India Epam Full timeDescription EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that...
-
Lead Site Reliability Engineer
2 weeks ago
gurugram, India Epam Full timeDescription EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects...
-
Senior SRE
2 weeks ago
gurugram, India Epam Full timeDescription EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects...
-
Site Reliability Engineer
2 weeks ago
Bangalore/Gurgaon/Gurugram, IN Codersbrain technology pvt ltd Full timeKey Responsibilities :- Provide expert production support for application teams utilizing our platform, ensuring high availability, reliability, and performance.- Diagnose and resolve complex issues in production environments, collaborating closely with development teams and stakeholders.- Implement and maintain monitoring, alerting, and logging solutions to...
-
Site Reliability Engineer
2 weeks ago
Gurgaon/Gurugram, India E-Qube Digital Services Full timeJob Description : - 5 - 7 years' experience in cloud infrastructure engineering roles- 1-3 years' experience as Site Reliability Engineer or similar role, in a global organization.- Bachelor's degree in computer science, information systems or other related field (or equivalent work experience) - Customer service: experience working with...
-
Site Reliability Engineer
1 month ago
Gurugram, India Citadel Securities Full timeJob Description Responsibilities: Candidates who have less than 3 years of experience should possess: Good knowledge of UNIX/Linux command line. Good understanding of the usage of TCP/IP and UDP networking in applications. Basic understanding of network routing and troubleshooting. Basic experience in writing SQL database queries. Basic...
-
Site Reliability Engineer
3 weeks ago
Gurugram, India Acefone Full timeKey Responsibilities:1. Telephony Infrastructure Management:Design, implement, and maintain internet telephony systems to ensure high availability and call quality.Manage and optimize cloud telephony services to scale with our growing user base.Troubleshoot and resolve telephony-related issues to minimize downtime and disruptions. 2. Cloud Expertise:Utilize...
-
Site Reliability Engineer
3 weeks ago
gurugram, India Acefone Full timeKey Responsibilities: 1. Telephony Infrastructure Management: Design, implement, and maintain internet telephony systems to ensure high availability and call quality. Manage and optimize cloud telephony services to scale with our growing user base. Troubleshoot and resolve telephony-related issues to minimize downtime and disruptions. 2. Cloud Expertise:...
-
Site Reliability Engineer
3 weeks ago
Gurugram, India Acefone Full timeKey Responsibilities:1. Telephony Infrastructure Management:Design, implement, and maintain internet telephony systems to ensure high availability and call quality.Manage and optimize cloud telephony services to scale with our growing user base.Troubleshoot and resolve telephony-related issues to minimize downtime and disruptions. 2. Cloud Expertise:Utilize...
-
Site Reliability Engineer
2 weeks ago
gurugram, India Citadel Securities Full timeJob Description Responsibilities: Candidates who have less than 3 years of experience should possess: Good knowledge of UNIX/Linux command line. Good understanding of the usage of TCP/IP and UDP networking in applications. Basic understanding of network routing and troubleshooting. Basic experience in writing SQL database queries. Basic...
-
Lead Site Reliability Engineer
2 weeks ago
gurugram, India Cvent Full timeOverview: Founded in 1999, Cvent has become the global leader in meetings, event, travel, and hospitality technology, with more than 4000+ employees worldwide. As a leading cloud-based technology company, we have over 28,000+ customers, including 80% of the Fortune 100 companies, in more than 100 countries. Cvent’s software solutions optimize the entire...
-
Site Construction Engineer, PEB
2 weeks ago
gurugram, India HITACHI ENERGY INDIA LIMITED Full timeDescription : General Information: We are looking for a Site Construction Engineer for Pre-Engineered Buildings (PEB) to join our HVDC team at Hitachi Energy. The Site Construction Engineer, PEB will be coordinating and inspecting the PEB construction works throughout its scope of delivery. The Site Construction Engineer, PEB will report to...
-
Site Construction Engineer, PEB
1 month ago
Gurugram, India HITACHI ENERGY INDIA LIMITED Full timeDescription : General Information: We are looking for a Site Construction Engineer for Pre-Engineered Buildings (PEB) to join our HVDC team at Hitachi Energy. The Site Construction Engineer, PEB will be coordinating and inspecting the PEB construction works throughout its scope of delivery. The Site Construction Engineer, PEB will report to the...