Site Reliability Engineer

3 weeks ago


Gurugram, India Gemini Solutions Pvt Ltd Full time

Position Summary

In this role, you will play a crucial part in shaping the firm's infrastructure reliability and efficiency by implementing robust Site Reliability Engineering practices. Your contribution will be pivotal in ensuring the availability, scalability, and performance of our systems and applications. Leveraging your strong technical skills and expertise in DevOps principles, you will work towards enhancing the reliability of our infrastructure and minimizing downtime, thus enabling the organization to deliver high-quality software with maximum efficiency.

Responsibilities

  • Ensure 24*7 uptime and stability of production systems
  • Investigate and troubleshoot production issues
  • Collaborate with developers to optimize system performance
  • Participate in on-call rotation to provide 24/7 support for critical systems
  • Work on automation and enhancements to reduce manual processes / intervention.
  • Relevant 5+ years of experience in SRE / Production/Product Support role, with a track record of implementing SRE practices
  • Basic understanding of cloud solutions provided by providers such as AWS or Azure.
  • Basic-Intermediate knowledge of Scripting in either of Bash/Python/PowerShell.
  • Good presentation, communication and interpersonal skills with the ability to collaborate effectively with cross-functional teams and stakeholders across different countries and cultures.
  • Good problem solving and troubleshooting skills
  • Continuous learning mindset and willingness to adapt to new technologies and industry trends.
  • Good Understanding of Operating System Commands (Linux), SQL (Ability to write, analyze queries and deduce / build important information per requirement)
  • In-depth knowledge of Trading Life Cycle: The candidate should possess a comprehensive understanding of trading life cycle, including order management, trade execution, settlement and post-trade processes. Familiarity with various financial products like Equities, Derivatives, Currencies, Commodities, FX is a plus.
  • Incident and Problem Management Expertise: The candidate must demonstrate strong problem-solving skills and the ability to manage incidents frequently and efficiently within a fast paced trading environment. This includes identifying, analyzing and resolving issues related to trading systems and processes as well as collaborating with cross-functional teams to implement long-term solutions and improve operational efficiency.
  • Good Understanding of Tools

(a) Orchestration – Autosys / Airflow or Cron

(b) Monitoring & Logging – PagerDuty, Prometheus & Grafana or Datadog, Splunk

(c) Project Management / ITSM – Service Now (Basic ability to navigate / create change tickets / incidents), Jira (Basic ability to create Jira Tickets, ability to filter your work)

Qualifications

Bachelor's degree or master's in computer science, Engineering, Software Engineering or a relevant field



  • Gurugram, India Freecharge Full time

    Job Title: Site Reliability Engineer (SRE)3 Years Experience About the Role: We are looking for a Site Reliability Engineer (SRE) with 3 years of experience to join our team. You will be responsible for ensuring the reliability, scalability, and efficiency of our production systems. This role requires a balance of software engineering, system administration,...


  • Gurugram, Pune, India Prerna Malhotra (Proprietor Of Praxis Hr Solutions) Full time

    Job Description Description We are seeking a skilled Site Reliability Engineer (SRE) to join our dynamic team in India. The SRE will be responsible for ensuring the reliability, availability, and performance of our applications and services. This role requires a combination of software engineering and systems engineering to build and maintain scalable and...


  • Gurugram, Gurugram, India Impronics Technologies Full time

    Job Description We are seeking a seasoned Site Reliability Engineer (SRE) with a solid background in payment systems and high-availability architectures. The ideal candidate will have hands-on experience managing large-scale, distributed systems in production, with a deep understanding of reliability, scalability, and performance tuning in the financial...


  • Gurugram, Hyderabad, India Talent Hired-the Job Store Full time ₹ 15,00,000 - ₹ 25,00,000 per year

    9+ years of experience in a Site Reliability Engineering or DevOps role.Hands-on experience with Dynatrace and Splunk for monitoring, logging, and alerting.Strong proficiency in Terraform for infrastructure provisioning (AWS, Azure, or GCP).Experience with CI/CD pipelines and automation tools (e.g., Jenkins, GitLab CI, Azure DevOps).Deep understanding of...


  • Gurugram, India Leapwork Full time

    At Leapwork, our vision is to break down the barriers between humans and computers through the world's most accessible automation platform. We are the leading global AI-powered visual test automation solution, enabling some of the world's largest enterprises to adopt, scale, and maintain automation – in under 30 days. In today's environment, where...


  • Gurugram, India S&P Global Market Intelligence Full time

    About the Role:  OSTTRA India The RoleSite Reliability Engineer The TeamSRE is a global team that provides technical support across the suite of OSTTRA products. The SRE team works closely with a highly competent Technical Operation Centre (TOC), Development and Infrastructure teams to deliver proactive tasks to improve the supportability of our...


  • Gurugram, Haryana, India Realign LLC Full time

    **Job Type: Full Time**: **Job Category: IT**: Job Title: Site Reliability Engineer Job Summary: Responsibilities and Duties: - Implement and maintain automated monitoring and alerting systems to proactively identify and mitigate issues - Collaborate with development teams to design and implement scalable and reliable services - Troubleshoot and resolve...


  • Bengaluru, Chennai, Gurugram, India Natwest Digitalx Full time ₹ 1,04,000 - ₹ 1,30,878 per year

    Join us as a Site Reliability EngineerIn this key role, youll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and servicesYoull enjoy significant stakeholder interaction, working in...


  • Gurugram, India GSPANN Full time

    Hiring for SRE - Exp- 6+ Years Notice Period - Immediate - 15 days About the Role We are seeking a skilled and passionate Observability Engineer (SRE) to join our team and drive reliability, performance, and visibility across our infrastructure and applications. You will play a key role in designing and implementing observability solutions, improving system...


  • Gurugram, India TheThreeAcross Full time

    Job Description : Role : SRE/ Devops Support Engineer TradeExperience : 4-9 YearsLocation : GurugramShift Timings : 9 to 5 and 12 : 00PM to 8 : 00PMJob Description : As a key team member, you will combine the responsibilities of an Application Support Engineer and Site Reliability Engineer (SRE) to ensure the stability, reliability, and performance of...