Site Reliability Engineer- Logging Metrics

4 weeks ago


bangalore, India Athenahealth Full time

Join us as we work to create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.

Our services are highly visible and used every day by teams all across Athena to develop, monitor, troubleshoot and scale their web services. The team is responsible for collecting and hosting large volumes of metrics and log data; we do this by running large scale distributed, fault tolerant systems to collect and host all this data. Our team has a big impact on productivity of hundreds of developers all across athena

In a typical week, our engineers work on problems ranging from tuning performance, scaling services to debugging hard problems. They will introduce new features and partner with development teams to solve their pressing monitoring and logging issues. We work in an agile, sprint-based schedule running daily standups and work in both the private and public cloud

Job Responsibilities

Automate deployment of Logging and Metrics services using configuration management with puppet Work on production incidents and resolve them using your Linux administration and engineering skills Develop metrics dashboards, alert criteria to monitor and scale services Work on weeklong on call in rotation alongside other team members Support development teams to refine their logging and metrics collection

Typical Qualifications

Hands on experience with configuration management using Puppet, Chef or Ansible Sysadmin, devops skills for running services in Linux environment Experience operating production services in Linux environment and serving on call rotations Intermediate level or greater experience with multiples of: Bash scripting, Ruby, Python, Ruby, Perl, C++, Java, Golang Develop deployment templates for services in the public cloud using cloudformation, terraform Ability to be flexible and change with environment and business demands

Additional Qualifications

Solid understanding of Linux operating system and commands Experience managing large server fleets in production Experience with debugging linux issues Experience with performance analysis of services Experience with relevant technologies: fluentd, kafka, elasticsearch, graphite, clickhouse, terraform, prometheus, grafana, graylog, AWS cloudformation, docker containers, jenkins, load balancers, git. Experience with tcpdump, wireshark, or other protocol analyzers

About athenahealth

Here’s ourvision:  To create a thriving ecosystem that delivers accessible, high-quality, and sustainable healthcare for all.

What’s unique about our locations?
From an historic, 19thcentury arsenal to a converted, landmark power plant,allofathenahealth’s offices were carefully chosen to represent our innovative spirit and promote the most positive and productive work environment for our teams. Our10offices across the United States and India —plus numerous remote employees —all work to modernize the healthcare experience, together.
Our company culture might be our best feature.
We don't take ourselves too seriously. But our work? That’s another story.athenahealth develops andimplements products and services that support US healthcare: It’sour chance to create healthier futures for ourselves, for our family and friends, for everyone.

Our vibrant and talented employees — orathenistas, as we call ourselves — spark the innovation and passion needed to accomplishour goal. We continue to expand our workforce with amazing people who bring diverse backgrounds, experiences, and perspectives at every level, and foster an environment where every athenista feels comfortable bringing theirbestselves to work.

Our size makes a difference, too: We are small enoughthatyourindividual contributionswill stand out— butlarge enoughto grow your career with ourresources and established business stability.
Giving back is integral to our culture. OurathenaGivesplatform strives tosupport food security, expand access to high-quality healthcare for all, and support STEM education to develop providers and technologists who will provide access to high-quality healthcare for all in the future. As part of the evolution of athenahealth’sCorporate Social Responsibility(CSR)program, we’ve selected nonprofit partners that align with our purpose and let us foster long-term partnerships for charitable giving, employee volunteerism, insight sharing, collaboration, and cross-team engagement.

What can we do for you?
Along with health and financial benefits,athenistasenjoy perks specific to eachlocation, including commuter support, employee assistance programs, tuition assistance,employeeresource groups, and collaborative workspaces — some offices even welcome dogs.

In addition to our traditional benefits and perks, we sponsor events throughout the year, includingbook clubs, external speakers, and hackathons. And weprovideathenistaswithacompany culturebased onlearning,the support of anengaged team,andan inclusive environment where all employees are valued. 

We alsoencourage a better work-life balance forathenistaswith our flexibility. Whilewe know in-office collaboration is critical to our vision, we recognize that not all work needs to be done within an office environment, full-time. With consistent communication and digital collaboration tools, athenahealth enables employees to find a balance that feels fulfilling and productive for each individual situation. 



  • bangalore, India ViewSonic Full time

    Job Requirements:Bachelor's degree in Computer Science, Engineering, or a related field.1+ year of experience in a relevant role, such as Site Reliability Engineer, DevOps Engineer, or similar, is preferred but not mandatory.Basic understanding of AWS solutions including EC2, S3, CloudWatch, Lambda, and RDS.Interest and understanding of Platform Engineering...


  • bangalore, India TalentOla Full time

    Monitoring and Automation: Proactively monitor software systems to prevent incidents and automate routine tasks. 2. Effective Monitoring: Build monitoring systems that alert based on symptoms rather than outages. 3. Application Performance Monitoring (APM): Implement and utilize APM tools such as New Relic or Dynatrace to monitor application...


  • bangalore, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff to...


  • Bangalore, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff...


  • Bangalore, Karnataka, India Cyitechsearch Full time

    We are hiring for Site Reliability Engineer Skills : - Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.- Collaborate with development operations staff to...


  • bangalore, India Kunato Full time

    Site Reliability Engineer (SRE) - Python/GolangJob Description:We are seeking a highly skilled and passionate Site Reliability Engineer (SRE) to join our technology team. The ideal candidate will possess strong programming skills with expertise in Python, Golang, or both. This role is pivotal in ensuring the high availability, performance, and security of...


  • bangalore, India Ensono Full time

    About RoleEnsono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • bangalore, India Qure.ai Full time

    About the jobJob Title: Site Reliability EngineerDepartment: EngineeringLocation: BangaloreYears of experience: 2-5 yearsType: Full Time EmploymentAbout Qure.ai:Qure.ai is one of the fastest-growing startups in India, which develops Artificial Intelligence enabled products and platforms for healthcare diagnostics. We create cutting-edge solutions that...


  • bangalore, India Kunato Full time

    Site Reliability Engineer (SRE) - Python/Golang Job Description: We are seeking a highly skilled and passionate Site Reliability Engineer (SRE) to join our technology team. The ideal candidate will possess strong programming skills with expertise in Python, Golang, or both. This role is pivotal in ensuring the high availability, performance, and security...


  • bangalore, India Ensono Full time

    About Role Ensono is continuing its growth and building a cloud-native managed service offering for our clients. We are looking for energetic and skilled remote Site Reliability Engineers to join us on this exciting new journey. As a Site Reliability Engineer, you and your team will be responsible for between four and ten of Ensono cloud-native managed...


  • bangalore, India Encora Inc. Full time

    Position: Site Reliability Engineer Location: Bangalore Experience: 4+ Years  Job Mode: Full-time Work Mode: Remote Responsibilities and Duties Collaborate with cross-functional teams to design, implement, and maintain reliable and scalable infrastructure solutions on the Azure cloud platform. Implement and maintain monitoring and...


  • Bangalore, India TERRAGIG LLP Full time

    Role : Site Reliability EngineerExperience : 5+ Years Work Model : Remote / Contract 3 years Skills :- Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.-...


  • Bangalore, Karnataka, India TERRAGIG LLP Full time

    Role : Site Reliability EngineerExperience : 5+ Years Work Model : Remote / Contract 3 years Skills :- Develop and provide operational support for full-stack software applications.- Relevant industry certifications, such as through the Site Reliability Engineering (SRE) Foundation.- Five years' experience as a site reliability engineer or similar role.-...


  • bangalore, India Flipkart Full time

    At Flipkart, Site reliability engineers (SREs) combine engineering experience and an innate drive to improve existing systems and processes, with the creativity to develop novel solutions to evolving challenges that can hamper reliability, performance and availability of critical platform services and applications. SRE builds solutions (Process + tools) to...


  • bangalore, India Meesho Full time

    Site Reliability Engineer II Bangalore, Karnataka Tech Infrastructure /Full Time Employee /On-Site About the Team : When 5% of Indian households shop with us, its important to build resilient systems to manage millions of orders every day. Weve done this with zero downtime! Sounds impossible? Well, thats the kind of Engineering muscle that has helped...


  • bangalore, India First American (India) Full time

    The Role:A SRE Manager is ultimately responsible for system reliability, developer productivity and reducing time to market by striving to reduce technical debt of the services your SRE team supports. We seek managers who are passionate about site reliability to influence and drive the strategic SRE mission.As a Site Reliability Engineering Manager working...


  • bangalore, India Qure.ai Full time

    About the job Job Title: Site Reliability Engineer Department: Engineering Location: Bangalore Years of experience: 2-5 years Type: Full Time Employment About Qure.ai: Qure.ai is one of the fastest-growing startups in India, which develops Artificial Intelligence enabled products and platforms for healthcare diagnostics. We create cutting-edge...

  • Engineering Director

    4 weeks ago


    Bangalore, India CareerNet Technologies Full time

    Job Description : Site Reliability Engineers (SREs) at Coupang is a mission-critical role that combines software and system engineering to build, run, and scale our complex, large-scale ecommerce systems. As part of the Site Reliability Engineering team, you will be responsible for ensuring all our customer-facing services are healthy, monitored, automated,...

  • Engineering Director

    4 weeks ago


    bangalore, India CareerNet Technologies Full time

    Job Description : Site Reliability Engineers (SREs) at Coupang is a mission-critical role that combines software and system engineering to build, run, and scale our complex, large-scale ecommerce systems. As part of the Site Reliability Engineering team, you will be responsible for ensuring all our customer-facing services are healthy, monitored, automated,...

  • Engineering Director

    1 month ago


    Bangalore, Karnataka, India CareerNet Technologies Full time

    Job Description :Site Reliability Engineers (SREs) at Coupang is a mission-critical role that combines software and system engineering to build, run, and scale our complex, large-scale ecommerce systems. As part of the Site Reliability Engineering team, you will be responsible for ensuring all our customer-facing services are healthy, monitored, automated,...