Associate Site Reliability Engineer

1 day ago


Gurgaon, Haryana, India Acquia Full time US$ 90,000 - US$ 1,20,000 per year

Job Title:  Associate Site Reliability Engineer 

Acquia is the open source digital experience company. We provide the world's most ambitious brands with technology that allows them to embrace innovation and create customer moments that matter. At Acquia we believe in the power of community and collaboration – giving our customers the freedom to build tomorrow on their terms.

Headquartered in Boston, we have been named as one of North America's fastest growing software companies as reported by Deloitte and Inc. Magazine, and have been rated a leader by the analyst community and named one of the Best Places to Work by the Boston Business Journal. We are Acquia. We are building for the future of the web, and we want you to be a part of it.

Site Reliability Engineering (SRE) is what you get when you treat operations as if it's a software problem. Our mission is to improve, maintain, and provide for the software and systems behind all of Acquia's services – with an ever-watchful eye on their availability, latency, performance, and capacity.

As an Associate SRE, you will be working on ensuring the reliability and performance of our data infrastructure, including Snowflake data warehouses, data pipelines on Kubernetes, and analytics systems, while coding in Python and exploring AI-powered automation to enhance data platform reliability.

As an Associate Site Reliability Engineer, you will…

  • Work in an Agile team designing, writing and delivering software to improve the availability, scalability, performance, and efficiency of Acquia's data infrastructure and pipelines
  • Develop and maintain monitoring, alerting, and observability solutions for Snowflake data warehouses, Dagster pipelines, and data processing workflows
  • Build automation tools using Python and infrastructure-as-code technologies to ensure reliable data pipeline operations and reduce manual overhead
  • Explore and implement AI-driven solutions for data quality monitoring, pipeline anomaly detection, and automated data infrastructure scaling
  • Collaborate with Data Engineering teams to implement reliability best practices for data pipelines including SLI/SLO definition for data freshness and quality
  • Monitor and optimize Snowflake performance, cost efficiency, and resource utilization across multiple environments
  • Participate in on-call rotations for data infrastructure incidents, contributing to post-incident reviews and reliability improvements
  • Build and enhance CI/CD pipelines for data pipeline deployments with automated testing and rollback capabilities
  • Ensure data infrastructure security, access controls, and compliance monitoring across all data systems
  • Contribute to the evolution of our data platform SRE practices and tooling standards

What you'll need to be successful:

  • 1-3 years of experience in SRE or DevOps Engineer and data engineering roles
  • Experience with data warehouse technologies (Snowflake preferred) and understanding of data pipeline architectures
  • Familiar with container based products like docker and kubernetes
  • Programming or Scripting exp in Python for automation, data pipeline tooling, and system integration
  • Experience with Infrastructure as Code tools (Terraform, Ansible) for data infrastructure management
  • Hands-on experience with data monitoring and observability tools (DataDog, Snowflake monitoring, or similar data-focused tools)
  • Knowledge of cloud platforms (AWS, GCP, Azure) with focus on their data services (S3, BigQuery, etc.)
  • Understanding of CI/CD principles for data pipeline deployments and data platform operations
  • Strong problem-solving skills and systematic approach to troubleshooting data systems and pipeline issues
  • Ability to work collaboratively with Data Engineering teams and communicate effectively about data reliability
  • BS degree in Computer Science or related technical field, or equivalent practical experience

Extra credit if you:

  • Have experience implementing SLIs (Service Level Indicators) and SLOs (Service Level Objectives and error budgets for data pipelines and data freshness/quality metrics
  • Interest in AI/ML applications for data quality monitoring, pipeline automation, and intelligent data operations
  • Interest in exploring MLOps infrastructure and AI-powered data platform reliability challenges
  • Knowledge of data workflow orchestration tools (Dagster, Airflow, Prefect) and their operational challenges
  • Experience with data quality frameworks and automated data validation systems
  • Familiarity with Snowflake administration, performance tuning, and cost optimization
  • Contributions to open-source data infrastructure or data reliability tools and communities
  • Understanding of data governance, lineage tracking, and compliance automation for data systems
  • Experience with incident management for data pipeline failures and data quality issues

We are an organization that embraces innovation and the potential of AI to enhance our processes and improve our work. We are always looking for individuals who are open to learning new technologies and collaborating with AI tools to achieve our goals.

Acquia is proud to provide best-in-class benefits to help our employees and their families maintain a healthy body and mind. Core Benefits include: competitive healthcare coverage, wellness programs, take it when you need it time off, parental leave, recognition programs, and much more

Individuals seeking employment at Acquia are considered without regard to race, color, religion, caste, creed, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, or sexual orientation. Whatever you answer will not be considered in the hiring process or thereafter.



  • Gurgaon, Haryana, India RBS Full time US$ 1,50,000 - US$ 2,00,000 per year

    Join us as a Site Reliability EngineerIn this key role, you'll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and servicesYou'll enjoy significant stakeholder interaction, working in...


  • Gurgaon, Haryana, India RBS Full time ₹ 9,00,000 - ₹ 12,00,000 per year

    Join us as a Site Reliability EngineerIn this key role, you'll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and servicesYou'll enjoy significant stakeholder interaction, working in...


  • Gurgaon, Haryana, India FxConsulting Full time

    Job Title : Site Reliability EngineerLocation : Gurgaon, IndiaExperience : 6 to 9 yearsEmployment Type : the Role :We are seeking an experienced Site Reliability Engineer (SRE) to join our high-performance infrastructure and operations team. As an SRE, you will be responsible for ensuring the availability, scalability, performance, and reliability of our...


  • Gurgaon, Haryana, India Impronics Technologies Full time

    Job DescriptionRequired Skills & Experience:- 8+ years of overall experience in infrastructure engineering or SRE roles, with at least 3+ years in thepayments/fintech domain.- Strong understanding ofpayment protocols(UPI, IMPS, RTGS, NEFT, SWIFT, etc.) and transaction processing systems.- Proven expertise inLinux systems administration, cloud platforms (AWS,...


  • Gurgaon, Haryana, India RBS Full time ₹ 15,00,000 - ₹ 20,00,000 per year

    Join us as a Site Reliability EngineerIn this key role, you'll improve, drive, and embed non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and servicesYou'll enjoy significant stakeholder interaction, working in...


  • Gurgaon, Haryana, India Edge Executive Search Full time

    Job DescriptionWe are seeking a Site Reliability Engineer to lead capacity management, operational support, and incident resolution for our OTC Derivative and FX platforms. This role requires a professional with a background in both SRE and application support, who can collaborate with development and infrastructure teams to ensure the reliability and...


  • Gurgaon, Haryana, India TheThreeAcross Full time

    Job Description : Role : SRE/ Devops Support Engineer TradeExperience : 4-9 YearsLocation : GurugramShift Timings : 9 to 5 and 12 : 00PM to 8 : 00PMJob Description : As a key team member, you will combine the responsibilities of an Application Support Engineer and Site Reliability Engineer (SRE) to ensure the stability, reliability, and performance of...


  • Gurgaon, Haryana, India BT Group Full time ₹ 5,00,000 - ₹ 10,00,000 per year

    Why this job matters The Site Reliability Engineering Associate 3 assists with a range of routine activities in the service performance, reliability and availability that internal and external customers expect. What you'll be doing 1. Assists with routine activities in the implementation of new software development life cycle automation tools, frameworks,...


  • Gurgaon, Haryana, India beBeeReliability Full time ₹ 15,00,000 - ₹ 20,00,000

    Job Title: System Reliability EngineerWe are seeking a highly skilled System Reliability Engineer to lead capacity management, operational support, and incident resolution for our platforms. This role requires a professional with a background in both SRE and application support, who can collaborate with development and infrastructure teams to ensure the...


  • Gurgaon, Haryana, India Siemens Full time

    Looking for a challenging role If you really want to make a difference - make it with us Siemens Energy is focused on helping customers navigate the worlds most pressing energy problems As a world leader in developing and producing the most advanced engineering technologies we improve lives and further human achievements worldwide while also protecting...