Reliability Enginner

3 weeks ago


Uttar Pradesh, India Caminosoft AI Full time

**Key Responsibilities**:

- Identify and resolve performance bottlenecks and infrastructure issues
- Develop and implement monitoring and alerting systems to ensure the timely detection of issues
- Work with teams to establish reliability targets and drive continuous improvement efforts
- Develop and implement disaster recovery plans to ensure business continuity in the event of a system failure
- Stay up to date with the latest advances in reliability engineering tools and methodologies, and identify opportunities to improve the reliability and scalability of our systems

**Requirements**:

- Bachelor's or Master's degree in Computer Science, Engineering, or related field
- 3+ years of experience in systems administration or software development, with a focus on reliability engineering
- Strong understanding of Linux/Unix systems administration, and experience with cloud platforms such as AWS, Azure or Google Cloud
- Experience with monitoring and alerting systems such as Nagios, Zabbix, or Prometheus
- Strong problem-solving and analytical skills
- Excellent communication and collaboration skills

We offer a competitive salary package, including benefits, and the opportunity to work on exciting products with a talented and passionate team. If you have a passion for optimizing system reliability and performance and are looking for an exciting new challenge, we want to hear from you