Reliability Engineering Specialist

3 weeks ago


Pune, Maharashtra, India Eximietas Design Full time
Job Title: Reliability Engineering Specialist

We are seeking a highly skilled Reliability Engineering Specialist to join our Sanity Operations/SRE team. The ideal candidate will have strong background in Infrastructure Management, Automation, Troubleshooting, and System Administration.

Key Responsibilities:
  • Monitor and support critical high-performance, large-scale services running on a farm of 7000+ hosts.
  • E nsure more than 95% availability for our test farm.
  • Participate in triaging and resolution of complex test infra related issues.
  • Collaborate with other engineering teams to expose any defects and constraints.
  • Collaborate with software development and Data center teams to deliver reliable, robust, and high-performance capability of the underlying infra.
  • Mine and analyze data from multiple sources to generate reports for identifying automation and scaling opportunities.
  • Solve complex problems involving multi-site infrastructure scaling, and integrating GPU test suites to infrastructure harness etc.
  • Automation and performance tuning of regression test frameworks, creation of self-healing/automated recovery solutions for multi-geo regression farms.
  • Perform Root Cause Analysis and Implement Corrective Actions for any persistent and user impacting issues.
Requirements:
  • B.Tech or Equivalent degree in CS/CE.
  • 6+ years of proven experience working in a DevOps/SRE environment.
  • Strong TCL/Python, Unix scripting skills.
  • Proficient in configuration management tools like Chef/Puppet/Ansible.
  • Proficiency in using MySQL or equivalent NoSQL databases.
  • Working Experience with Perforce, GIT or any other version control system is vital.
  • Good solid understanding of CI/CD tools like Jenkins, etc.
  • Strong problem solving and analytical skills with keen interest for solving complex problems.
  • Proven Linux and Windows systems administration experience.
  • Experience with monitoring systems such as Zabbix and/or Nagios.
  • Overall understanding of build and packaging systems.
  • Ability to self-manage, show leadership, and communicate well.

The estimated salary for this role is around ₹1200000 - ₹1500000 per annum, depending on experience and qualifications.

About Eximietas Design

Eximietas Design is a technology services and solutions company headquartered in San Jose, CA with a global footprint that extends to Bangalore, Chennai, and Bhubaneswar in India. We specialize in Cloud Computing, Cybersecurity, VLSI, Embedded Software, and Artificial Intelligence, and are dedicated to empowering businesses with cutting-edge innovations, ensuring their digital future is secure, efficient, and poised for growth.



  • Pune, Maharashtra, India Collabera Full time

    Job Overview: As a Reliability Engineering Specialist at Collabera in Pune, India, you will focus on building and maintaining highly reliable, scalable, and efficient systems. The role requires expertise in cloud infrastructure, observability, automation, and performance testing. Responsibilities:Implement SRE best practices, focusing on ensuring system...


  • Pune, Maharashtra, India Hansen Tehcnologies Full time

    About the Job:Hansen Technologies seeks an Infrastructure Reliability Specialist to join our team in Pune. The successful candidate will design and implement reliable systems for our clients' EMEA operations, focusing on scalability, performance, and high availability.This role involves triage of incidents, root cause analysis, and ensuring end-to-end...


  • Pune, Maharashtra, India Capgemini Engineering Full time

    Capgemini Engineering is a leading engineering services company that empowers its clients to achieve greater success. We are currently seeking a highly skilled Teamcenter Engineering Specialist to join our team.The estimated salary for this position is between $120,000 and $180,000 per year, depending on location and experience.Job DescriptionWe are looking...


  • Pune, Maharashtra, India Synechron Full time

    Job DescriptionWe are seeking an experienced Site Reliability Engineer to join our team at Synechron. The successful candidate will be responsible for ensuring the reliability, scalability, and performance of our infrastructure.The ideal candidate will have a strong background in software engineering, with hands-on experience with tools such as coding, CI/CD...


  • Pune, Maharashtra, India Neerinfo Solutions Full time

    Job DescriptionWe are seeking a talented Cloud Reliability Specialist to join our team at Neerinfo Solutions.The ideal candidate will have a strong background in application support, microservices architecture, and automation, with hands-on experience in monitoring and dashboarding tools like Splunk and AppDynamics. We offer a competitive salary of $120,000...


  • Pune, Maharashtra, India Alp Consulting Ltd. Full time

    Job OverviewWe are seeking an experienced Database Reliability Engineer to join our team at Alp Consulting Ltd. in [location]. This role will involve designing and supporting database environments for high-volume e-commerce and back-office applications.About the RoleThis is a full-time position that requires strong problem-solving skills, experience with...


  • Pune, Maharashtra, India HyrEzy Talent Solutions Full time

    About the RoleAs a Site Reliability Engineer, you will play a pivotal role in ensuring the health and deployment of servers and devices. With a strong background in Linux, networking, virtualization, and Microsoft tools, you will be responsible for troubleshooting complex technical issues, implementing break fixes, and analyzing applications to identify...


  • Pune, Maharashtra, India Eximietas Design Full time

    At Eximietas, a global technology services and solutions company, we specialize in empowering businesses with cutting-edge innovations. Our teams thrive on innovation and inclusivity, fostering a vibrant and collaborative work culture rooted in the spirit of Silicon Valley's tech dynamism.Job Overview: We are seeking an elite Infrastructure Reliability...


  • Pune, Maharashtra, India Synechron Full time

    About SynechronWe are a leading global digital consulting firm providing innovative technology solutions for business.Our StoryWe began our journey in 2001 as a small team of technology specialists and have since grown to 14,500+ people across 58 offices in 21 countries.Customized SolutionsWe offer end-to-end solutions that drive business value and growth...


  • Pune, Maharashtra, India Lifelancer Full time

    About the Role:We are seeking a Reliable Biotechnology Support Specialist to join our team in Pune, India. In this role, you will be part of the India Equipment services organization and work remotely with customers across various biopharmaceutical industries.Key Responsibilities:You will accompany service engineers on pre-installation work, equipment demos,...


  • Pune, Maharashtra, India Avant Garde Corporate Services Full time

    About Avant Garde Corporate ServicesWe are a dynamic organization that values expertise and teamwork. We are looking for a talented System Reliability Specialist to join our team.Job OverviewThe successful candidate can expect a salary of $125,000 per year, along with a comprehensive benefits package and opportunities for professional growth.Key...


  • Pune, Maharashtra, India Juniper Consultancy Services Full time

    Job OverviewJuniper Consultancy Services is seeking a highly skilled Cloud Reliability Engineer to join our team. As a Cloud Reliability Engineer, you will be responsible for driving reliability and resilience assessments, defining/building strategies, and creating innovative solutions to meet business needs.ResponsibilitiesEnable adoption and maturity of...


  • Pune, Maharashtra, India Synechron Full time

    About the Role:Synechron is a global consulting firm that combines creativity and innovative technology to deliver industry-leading digital solutions. We are seeking an experienced System Engineer - SRE who can take ownership of initiatives and assets and provide high-quality customer service.As a key member of our infrastructure team, you will be...


  • Pune, Maharashtra, India Synechron Full time

    Job OverviewWe are seeking a highly skilled Cloud Engineering and Automation Specialist to join our team at Synechron. As an SRE, you will play a critical role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.


  • Pune, Maharashtra, India HealthEdge Full time

    OverviewWelcome to HealthEdge, a leading provider of innovative healthcare solutions. We are seeking an experienced Senior Application Support Engineer to join our team and play a critical role in ensuring the reliability and performance of our applications.About the RoleThe successful candidate will be responsible for providing expert-level technical...


  • Pune, Maharashtra, India Avant Garde Corporate Services Full time

    Job OverviewAvant Garde Corporate Services is seeking an experienced Systems Reliability Engineer to join our team. As a key member of our infrastructure team, you will play a critical role in ensuring the reliability and performance of our production environment.About YouWe are looking for someone with a strong background in Linux, Shell Scripting,...


  • Pune, Maharashtra, India Hansen Tehcnologies Full time

    About the Role:We are seeking an experienced Site Reliability Engineer to join Hansen Technologies' Pune team. As a key player, you will ensure the reliability, performance, and scalability of our clients' systems across EMEA regions.As an SRE, you will combine technical expertise with creative problem-solving and exceptional customer relationship skills....


  • Pune, Maharashtra, India Capgemini Engineering Full time

    Capgemini Engineering is a leader in engineering and R&D services. We are seeking a skilled Digital Transformation Specialist to join our team.The estimated salary for this position is $120,000 - $180,000 per year, depending on location and experience.We are looking for an experienced professional with 8+ years of industry experience in mobile application...


  • Pune, Maharashtra, India Capgemini Engineering Full time

    About the RoleAs a Senior SAP PP Integration Specialist at Capgemini Engineering, you will play a crucial role in delivering large-scale, complex projects that combine processes with technology to help our clients from Steel / Metals industry achieve their business objectives.This position requires strong analytical and problem-solving skills, as well as...


  • Pune, Maharashtra, India Avant Garde Corporate Services Full time

    Job Description:We are looking for a highly skilled Reliability Engineering Lead to join our team. The successful candidate will be responsible for designing, developing, and standardizing Monitoring and Alerting mechanism for the supported applications.The ideal candidate will have experience in Incident Response, CI/CD pipeline, ITSM activities, and DevOps...