Reliability Engineer I
3 weeks ago
The Site Reliability Engineer (SRE) bridges the gap between technical customer support and high-level infrastructure management. This role is responsible for supporting a proprietary, Linux-based and AWS-hosted Platform-as-a-Service offering, ensuring both operational excellence and outstanding customer satisfaction. Initially focused on front-line technical product support, the SRE will have opportunities to grow into more advanced positions and responsibilities. The ideal candidate will possess the technical expertise and interpersonal skills necessary to engage directly with end users, resolving complex issues while continuously improving platform reliability and performance.
Site Reliability Engineer focuses on providing front-line technical product support, troubleshooting customer issues, and building foundational expertise in platform operations and automation tools. Site Reliability Engineer II demonstrates the same technical capability while also taking on advanced infrastructure responsibilities, leading infrastructure initiatives, and mentoring junior team members.
Responsibilities:
● Serve as the front-line technical resource for troubleshooting and resolving customer issues related to the Company's Linux-based AWS platform.
● Provide exceptional technical support to internal and external stakeholders, ensuring timely resolution of issues within established SLAs.
● Document and escalate complex issues to senior technical resources as needed while striving to independently resolve more advanced issues over time.
● Monitor and respond to technical incidents, identify root causes, and collaborate with internal teams to implement long-term solutions.
● Write and maintain knowledge base articles and training materials for end users and internal teams.
● Manage and maintain infrastructure via automation tools such as Terraform, Ansible, CloudFormation, and Chef, as responsibilities grow.
● Act as a subject matter expert during client deployment, implementation, and migration projects.
● Collaborate closely with the Product, Quality Assurance, Engineering, and Operations teams to ensure alignment and a seamless user experience.
● Document product use cases, enhancements, and bug fixes; advocate for product improvement based on user feedback.
● Participate in on-call rotations to provide 24/7 operational support.
● Maintain strong relationships with customers and stakeholders, striving for exceptional satisfaction and engagement.
● Become familiar with the secure use and management of the AWS control plane to ensure compliance with security and data privacy standards as expertise develops.
● Participate in the design, deployment, and maintenance of CI/CD pipelines to support seamless application development and deployment.
●Oversee the availability and performance of production and development environments, ensuring alignment with SLAs and industry best practices.
Requirements
3+ years of experience in technical customer support or service desk environments, with a focus on technical product support.
● 5+ years of experience in cloud computing and infrastructure management.
● Strong knowledge of Amazon Web Services (AWS), including containerized applications (EKS, ECS, ECR, Elastic Beanstalk).
● Proficiency in Linux administration, including user management, software installation, and file system management.
● Familiarity with networking concepts and DNS.
● Hands-on experience with CI/CD tools and processes.
● Proficiency with versioning tools (Git, svn).
● Excellent oral and written English communication skills with a customer-centric perspective.
● Strong troubleshooting and critical thinking skills.
● Ability to work both independently and collaboratively in a team environment.
● High attention to detail and organizational skills.
● Proficiency in at least one programming language.
Preferred Qualifications:
● AWS certification (any level).
● 4-year college degree in a technical or quantitative science field, or equivalent work experience.
● Experience supporting end users in a service desk or technical customer support environment.
● Familiarity with virtualized infrastructure management and security best practices.
Benefits
Location : Onsite
-
Infrastructure Reliability Engineer I
3 weeks ago
Kolkata, West Bengal, India CloudHire Full timeThe Site Reliability Engineer (SRE) is responsible for maintaining high standards of quality customer service and support. In this role, you will be providing front-line customer support for our flagship product, Metworx. The Metworx product is delivered as a Platform-as-a-Service to our clients and provides a stable, scalable, and reproducible computing...
-
Infrastructure Reliability Engineer I
3 weeks ago
Kolkata, West Bengal, India CloudHire Full timeThe Site Reliability Engineer (SRE) bridges the gap between technical customer support and high-level infrastructure management. This role is responsible for supporting a proprietary, Linux-based and AWS-hosted Platform-as-a-Service offering, ensuring both operational excellence and outstanding customer satisfaction. Initially focused on front-line technical...
-
Service Reliability Engineer
3 weeks ago
Kolkata, West Bengal, India BT Group Full timeWhat you'll be doing 1.Delivers critical customer technical support for network, hardware, and infrastructure development. 2.Executes across technical areas including systems requirements data analysis, engineering, systems design, systems development, computer programming, systems testing and deployment, quality assurance, configuration management, and...
-
Site Reliability Engineer
4 weeks ago
Kolkata, West Bengal, India CloudHire Full timeThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and strategic...
-
Site Reliability Engineer
2 weeks ago
Kolkata, West Bengal, India CloudHire Full timeThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and strategic...
-
Site Reliability Engineer
4 weeks ago
Kolkata, West Bengal, India CloudHire Full timeDescriptionThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...
-
Site Reliability Engineer
2 weeks ago
Kolkata, West Bengal, India CloudHire Full timeDescriptionThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...
-
Site Reliability Engineer
2 weeks ago
Kolkata, West Bengal, India CloudHire Full timeDescription The Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...
-
Site Reliability Engineer
3 weeks ago
Kolkata, West Bengal, India CloudHire Full timeThe Site Reliability Engineer (SRE) bridges the gap between technical customer support and high-level infrastructure management. This role is responsible for supporting a proprietary, Linux-based and AWS-hosted Platform-as-a-Service offering, ensuring both operational excellence and outstanding customer satisfaction. Initially focused on front-line technical...
-
Manager - Site Reliability Engineering
3 weeks ago
Kolkata, West Bengal, India CloudHire Full timeThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and strategic...
-
Site Reliability Engineer
1 day ago
Kolkata, West Bengal, India CloudHire Full timeQualifications : - Proven experience managing technical teams, preferably in Site Reliability Engineering, DevOps, or a related field. - Strong technical background in cloud computing and infrastructure management, particularly with AWS and Linux-based systems. - Demonstrated ability to lead and mentor teams in remote and distributed environments. -...
-
Reliability and Performance Engineer
1 day ago
Kolkata, West Bengal, India Wipro Limited Full timeAs a Reliability and Performance Engineer at Wipro Limited, you will play a critical role in ensuring the repeatability, traceability, and transparency of our infrastructure automation. With a focus on hands-on design, analysis, development, and troubleshooting of highly-distributed large-scale production systems and event-driven, cloud-based services, you...
-
Site Reliability Engineering Manager
1 day ago
Kolkata, West Bengal, India CloudHire Full timeAbout UsAt CloudHire, we are committed to delivering exceptional service and solutions to our clients. Our team is comprised of experienced professionals who share a passion for innovation and excellence.We are seeking a skilled Site Reliability Engineer to join our team as a Technical Team Lead - Cloud Infrastructure. The successful candidate will be...
-
Site Reliability Engineering Manager
1 week ago
Kolkata, West Bengal, India CloudHire Full timeThe Technical Manager for Site Reliability Engineering (SRE) will lead a team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and strategic alignment with...
-
Site Reliability Engineer
1 week ago
Kolkata, West Bengal, India Wipro Limited Full timeJob Description- 5+Years of experience in system administration, application development, infrastructure development or related areas- 5+ years of experience with programming in languages like Javascript, Python, PHP, Go, Java or Ruby- 3+ years of in reading, understanding and writing code in the same- 3+years Mastery of infrastructure automation...
-
Site Reliability Engineer
3 weeks ago
Kolkata, West Bengal, India CloudHire Full timeQualifications : - Proven experience managing technical teams, preferably in Site Reliability Engineering, DevOps, or a related field. - Strong technical background in cloud computing and infrastructure management, particularly with AWS and Linux-based systems. - Demonstrated ability to lead and mentor teams in remote and distributed environments. -...
-
Site Reliability Engineer
2 weeks ago
Kolkata, West Bengal, India CloudHire Full timeQualifications : - Proven experience managing technical teams, preferably in Site Reliability Engineering, DevOps, or a related field. - Strong technical background in cloud computing and infrastructure management, particularly with AWS and Linux-based systems. - Demonstrated ability to lead and mentor teams in remote and distributed environments. -...
-
Cloud Reliability Engineering Leader
22 hours ago
Kolkata, West Bengal, India CloudHire Full time**Job Description:** We are seeking a skilled Technical Team Lead to join our CloudHire team. As the Technical Team Lead, you will be responsible for leading a remote team of Site Reliability Engineers (SREs) and overseeing day-to-day operations. The ideal candidate will have experience managing technical teams and a strong technical background in cloud...
-
Site Reliability Manager
2 weeks ago
Kolkata, West Bengal, India CloudHire Full timeThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and strategic...
-
Highly Distributed Systems Engineer
6 days ago
Kolkata, West Bengal, India Wipro Limited Full timeJob Overview:We are seeking a skilled Highly Distributed Systems Engineer to join our team at Wipro Limited. As a key member of our infrastructure development team, you will play a crucial role in designing, analyzing, and troubleshooting large-scale production systems and cloud-based services.Key Responsibilities:- Design, develop, and troubleshoot...