Site Reliability Engineer

2 weeks ago


Gurugram, India Gemini Solutions Pvt Ltd Full time

We are looking for 3-9 yrs experience candidate in Devops SRE ,In this you will play a crucial part in shaping the firm's infrastructure reliability and efficiency by implementing robust Site Reliability Engineering practices. Your contribution will be pivotal in ensuring the availability, scalability, and performance of our systems and applications. Leveraging your strong technical skills and expertise in DevOps principles, you will work towards enhancing the reliability of our infrastructure and minimizing downtime, thus enabling the organization to deliver high-quality software with maximum efficiency.



EXPERIENCE AND REQUIRED SKILL SETS


  • Experience in Data Engineering Operations: Proven experience in monitoring, maintaining, and troubleshooting data engineering systems and data pipelines, including data ingestion processes.
  • Data Pipelines & ETL Systems: Experience working with data pipelines and ETL systems (e.g., Informatica, Apache Airflow, AWS Glue). Understanding of the complete data lifecycle from ingestion to transformation and storage.
  • Monitoring & Alerting: Strong experience in setting up monitoring and alerting for data pipelines, using tools such as Prometheus, Grafana, AWS CloudWatch, Datadog, Splunk or similar tools.
  • Data Quality Frameworks: Experience in implementing and maintaining data quality checks, ensuring data integrity across ingestion pipelines and downstream systems.
  • Development Mindset: A strong background in Python development, with the ability to write scripts for automation, monitoring, and troubleshooting. Familiarity with Python libraries such as Pandas, NumPy, or PySpark for processing and validating data is a plus.
  • Incident Management & Troubleshooting: Expertise in incident response, root cause analysis (RCA), and managing production issues. Ability to diagnose and resolve issues in a timely manner.
  • Cloud Platforms (AWS): Familiarity with AWS-based data platforms and services (e.g., AWS S3, Redshift, Lambda, RDS). Experience with cloud monitoring tools is a plus.
  • Automation & Scripting: Proficient in automating operational processes using Python, or other scripting languages.
  • Version Control & CI/CD: Experience with Git for version control and CI/CD pipelines for automated testing and deployment of monitoring and operational scripts.


EDUCATION

Bachelor’s degree or master’s in computer science, Engineering, Software Engineering or a relevant field



  • gurugram, India Bijak Full time

    As a Site Reliability Engineer I at Bijak, you will play a crucial role in ensuring the reliability, scalability, and performance of our infrastructure. You will collaborate with cross-functional teams to support and monitor applications in Production. This role offers an exciting opportunity to contribute to a cutting-edge technology environment and drive...


  • Gurugram, India Recro Full time

    Designation - SRE EngineerLocation - Gurgaon Key Responsibilities: Design, develop, and maintain scalable, reliable, and secure infrastructure to support applications and services.Implement Site Reliability Engineering (SRE) best practices to ensure high availability and performance of systems.Collaborate with cross-functional teams to enhance system...


  • Gurugram, India Recro Full time

    Designation - SRE EngineerLocation - Gurgaon Key Responsibilities: Design, develop, and maintain scalable, reliable, and secure infrastructure to support applications and services.Implement Site Reliability Engineering (SRE) best practices to ensure high availability and performance of systems.Collaborate with cross-functional teams to enhance system...


  • Gurugram, India Recro Full time

    Designation - SRE Engineer Location - Gurgaon Key Responsibilities: Design, develop, and maintain scalable, reliable, and secure infrastructure to support applications and services. Implement Site Reliability Engineering (SRE) best practices to ensure high availability and performance of systems. Collaborate with cross-functional teams to enhance system...


  • gurugram, India Haachi Full time

    Job Title: Site Reliability Engineer (SRE) Location: Gurgaon, India Client: World-Leading High-Frequency Trading Firm About the Role: Be a part of a world-class trading operation! Our client, a global leader in high-frequency trading, is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join their dynamic team in Gurgaon. In this...


  • Gurugram, India Haachi Full time

    Job Title: Site Reliability Engineer (SRE) Location: Gurgaon, India Client: World-Leading High-Frequency Trading Firm About the Role: Be a part of a world-class trading operation! Our client, a global leader in high-frequency trading, is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join their dynamic team in Gurgaon. In this...


  • Gurugram, India Haachi Full time

    Job Title: Site Reliability Engineer (SRE)Location: Gurgaon, IndiaClient: World-Leading High-Frequency Trading FirmAbout the Role:Be a part of a world-class trading operation! Our client, a global leader in high-frequency trading, is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join their dynamic team in Gurgaon. In this role,...


  • Gurugram, India Haachi Full time

    Job Title: Site Reliability Engineer (SRE)Location: Gurgaon, IndiaClient: World-Leading High-Frequency Trading FirmAbout the Role:Be a part of a world-class trading operation! Our client, a global leader in high-frequency trading, is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join their dynamic team in Gurgaon. In this role,...


  • Gurugram, India Recro Full time

    Key Responsibilities: Design, develop, and maintain scalable, reliable, and secure infrastructure to support applications and services.Implement Site Reliability Engineering (SRE) best practices to ensure high availability and performance of systems.Collaborate with cross-functional teams to enhance system reliability, observability, and scalability.Automate...


  • Gurugram, India Recro Full time

    Key Responsibilities: Design, develop, and maintain scalable, reliable, and secure infrastructure to support applications and services. Implement Site Reliability Engineering (SRE) best practices to ensure high availability and performance of systems. Collaborate with cross-functional teams to enhance system reliability, observability, and scalability....


  • gurugram, India Recro Full time

    Designation - SRE EngineerLocation - Gurgaon Key Responsibilities: Design, develop, and maintain scalable, reliable, and secure infrastructure to support applications and services.Implement Site Reliability Engineering (SRE) best practices to ensure high availability and performance of systems.Collaborate with cross-functional teams to enhance system...


  • Gurugram, India UnitedHealth Group Full time

    Optum is a global organization that delivers care, aided by technology to help millions of people live healthier lives. The work you do with our team will directly improve health outcomes by connecting people with the care, pharmacy benefits, data and resources they need to feel their best. Here, you will find a culture guided by diversity and inclusion,...


  • Gurugram, Haryana, India Realign LLC Full time

    **Job Type: Full Time**: **Job Category: IT**: Job Title: Site Reliability Engineer Job Summary: Responsibilities and Duties: - Implement and maintain automated monitoring and alerting systems to proactively identify and mitigate issues - Collaborate with development teams to design and implement scalable and reliable services - Troubleshoot and resolve...


  • Gurugram, India AMEX Full time

    You Lead the Way. Weve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global and diverse community of colleagues with an unwavering commitment to back our customers, communities and each other. Here, youll learn and grow as we help you create a career...

  • Senior Engineer

    2 months ago


    Gurugram, India Callisto Talent Solutions Private limited Full time

    AM - Site Reliability Engineer - F2F Interviews onlyOur client is a leading global investment banking firm specializing in Financial Services like Advisory and capital raising, financing, investing, leasing, research, trading and hedging, and banking, advice and intermediary services, and funds management. They are setting up new SRE team based in Gurugram...


  • gurugram, India Cvent Full time

    Site Reliability is about combining development and operations knowledge and skills to help make the organization better. Whether you have a development background and are interested in learning more about operations or are a DevOps/Systems Engineer who is interested in developing internal tools – Cvent SRE can benefit from your skillsets. Ultimately, we...


  • Gurugram, India Cvent Full time

    Site Reliability is about combining development and operations knowledge and skills to help make the organization better. Whether you have a development background and are interested in learning more about operations or are a DevOps/Systems Engineer who is interested in developing internal tools – Cvent SRE can benefit from your skillsets. Ultimately, we...


  • Gurugram, India Cvent Full time

    Site Reliability is about combining development and operations knowledge and skills to help make the organization better. Whether you have a development background and are interested in learning more about operations or are a DevOps/Systems Engineer who is interested in developing internal tools – Cvent SRE can benefit from your skillsets. Ultimately, we...


  • gurugram, India Cvent Full time

    Site Reliability is about combining development and operations knowledge and skills to help make the organization better. Whether you have a development background and are interested in learning more about operations or are a DevOps/Systems Engineer who is interested in developing internal tools – Cvent SRE can benefit from your skillsets. Ultimately, we...


  • Gurugram, India Cvent Full time

    Site Reliability is about combining development and operations knowledge and skills to help make the organization better. Whether you have a development background and are interested in learning more about operations or are a DevOps/Systems Engineer who is interested in developing internal tools – Cvent SRE can benefit from your skillsets. Ultimately, we...