
SRE, Observability System Administrator
3 weeks ago
The Observability System Administrator role at Toast fits within the Observability Enablement & Administration team, which is part of Site Reliability Engineering, responsible for overseeing Toast production services, with a commitment to quality, reliability, and low latency. The Observability Enablement & Administration team is responsible for setting the overall observability strategy, choosing the right tools and technologies, developing best practices, and providing guidance to other teams, while maintaining, governing cost, and administering the observability platform and log pipelines.
About this roll * (Responsibilities)
In this role you will be responsible for the administration, maintenance, and enhancement of our observability platforms, ensuring optimal performance and availability for our critical security and business operations. In this role you will:
- Participate in observability architecture design, support, and platform management
- Gather and analyze metrics from operating systems and applications that enable development teams with observability insights
- Manage users and roles, monitor platform performance, and ensure security and high availability.
- Automate operational toil for observability focused administrative tasks
- Build and support automation for legal and compliance requirements
- Support end-users with training and technical guidance on observability tools and capabilities.
- Maintain accurate documentation of configurations, workflows, and procedures.
- Manage data ingestion and parsing to ensure data integrity and availability.
- Design and manage dashboards, reports, alerts, and visualizations.
- Implement strategies to increase observability system reliability and performance through on-call rotation and process optimization
- Utilize observability tools to diagnose application and infra issues and incidents
Do you have the right ingredients* ? (Requirements)
- Polyglot technologist/generalist with a thirst for learning
- Understanding of cloud and microservice architecture
- Experience with tools such as APM, RUM, Synthetics, Splunk, OTEL, Log pipelines, SIEM, Terraform etc.
- Automation/scripting experience with Go, Python, etc
- Splunk power user/administrator experience preferred
- Industry experience with at least 2 years observability experience with a focus on SRE or observability platform management
AI at Toast
At Toast we’re Hungry to Build and Learn. We believe learning new AI tools empowers us to build for our customers faster, more independently, and with higher quality. We provide these tools across all disciplines, from Engineering and Product to Sales and Support, and are inspired by how our Toasters are already driving real value with them. The people who thrive here are those who embrace changes that let us build more for our customers; it’s a core part of our culture.
Our Spread* of Total Rewards
We strive to provide competitive compensation and benefits programs that help to attract, retain, and motivate the best and brightest people in our industry. Our total rewards package goes beyond great earnings potential and provides the means to a healthy lifestyle with the flexibility to meet Toasters’ changing needs. Learn more about our benefits at .
*Bread puns encouraged but not required
Diversity, Equity, and Inclusion is Baked into our Recipe for Success
At Toast, our employees are our secret ingredient—when they thrive, we thrive. The restaurant industry is one of the most diverse, and we embrace that diversity with authenticity, inclusivity, respect, and humility. By embedding these principles into our culture and design, we create equitable opportunities for all and raise the bar in delivering exceptional experiences.
We Thrive Together
We embrace a hybrid work model that fosters in-person collaboration while valuing individual needs. Our goal is to build a strong culture of connection as we work together to empower the restaurant community. To learn more about how we work globally and regionally, check out: .
Apply today
Toast is committed to creating an accessible and inclusive hiring process. As part of this commitment, we strive to provide reasonable accommodations for persons with disabilities to enable them to access the hiring process. If you need an accommodation to access the job application or interview process, please contact .
------
For roles in the United States, It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.
-
SRE, Observability System Administrator
1 week ago
Bengaluru, India Toast Full timeJob Description The Observability System Administrator role at Toast fits within the Observability Enablement & Administration team, which is part of Site Reliability Engineering, responsible for overseeing Toast production services, with a commitment to quality, reliability, and low latency. The Observability Enablement & Administration team is responsible...
-
SRE, Observability System Administrator
3 weeks ago
Bengaluru, India Toast Full timeThe Observability System Administrator role at Toast fits within the Observability Enablement & Administration team, which is part of Site Reliability Engineering, responsible for overseeing Toast production services, with a commitment to quality, reliability, and low latency. The Observability Enablement & Administration team is responsible for setting the...
-
SRE – Cloud Security
4 weeks ago
Bengaluru, India Xebia Full timeSRE – Cloud Security & ObservabilityLocation: Bangalore (Hybrid – 3 days office per week)We are looking for a Cloud Site Reliability Engineer (SRE) with strong expertise in Cloud Security and Observability to design, build, and scale resilient cloud platforms.ResponsibilitiesArchitect and optimize Terraform modules for multi-environment...
-
Observability Platform and SRE Engineer
1 week ago
Bengaluru, Karnataka, India Kotak Mahindra Bank Full time ₹ 8,00,000 - ₹ 20,00,000 per yearDev Ops Engineering III-SUPPORT SERVICES-Applications-CTB Title : Observability Platforms and SRE Engg. The Company : World of Kotak product suite encompasses a powerful suite of cross banking assets, all-in-one stop banking services, securities, and investment banking; insights across a wide spectrum of the major financial and banking markets. ...
-
SRE Observability Architect
3 weeks ago
Bengaluru, India Virtusa Full timeSRE Observability Architect - Description Experience: • Minimum 10 years of relevant work experience with monitoring setup using any product (Dynatrace, Datadog, ELK stack, Splunk, Grafana/Prometheus, etc.) set up in critical production environments. • Minimum 5-6 years of work experience in end-to-end observability covering technical, user experience...
-
System Analyst
5 days ago
Bengaluru, India Mastek Full timeWe have an exciting opportunity for you!We are seeking a observability expert with experience in Datadog, Splunk and Gitlab. Sharing below Job Description, please apply if you are interested. Job Location - Bangalore Experience - 5+ years Principal Duties and Responsibilities:Design and maintain observability dashboards and alerting systems using Splunk and...
-
System Analyst
7 days ago
Bengaluru, India Mastek Full timeWe have an exciting opportunity for you!We are seeking a observability expert with experience in Datadog, Splunk and Gitlab. Sharing below Job Description, please apply if you are interested.Job Location - BangaloreExperience - 5+ yearsPrincipal Duties and Responsibilities:Design and maintain observability dashboards and alerting systems using Splunk and...
-
System Analyst
6 days ago
Bengaluru, India Mastek Full timeWe have an exciting opportunity for you!We are seeking a observability expert with experience in Datadog, Splunk and Gitlab. Sharing below Job Description, please apply if you are interested.Job Location - BangaloreExperience - 5+ yearsPrincipal Duties and Responsibilities:Design and maintain observability dashboards and alerting systems using Splunk and...
-
System Analyst
5 days ago
Bengaluru, India Mastek Full timeWe have an exciting opportunity for you!We are seeking a observability expert with experience in Datadog, Splunk and Gitlab. Sharing below Job Description, please apply if you are interested. Job Location - Bangalore Experience - 5+ years Principal Duties and Responsibilities:Design and maintain observability dashboards and alerting systems using Splunk and...
-
SRE – Cloud Security and Observability
5 days ago
Bengaluru, Karnataka, India RapidCircle Advisory Full time ₹ 12,00,000 - ₹ 36,00,000 per yearMaking a difference and driving positive change is what we do every day at Rapid Circle. Our Cloud Pioneers help our clients in their digital transformation. Are you someone who goes for constant, positive change? Then this vacancy is for youAs a Cloud Pioneer at Rapid Circle, you will work with our customers on different projects. For example, making impact...