Site Reliability Engineer II
12 hours ago
Backblaze is the object storage leader in the open cloud movement, fueling customer success with cloud storage built purposefully to unlock budgets, unburden administrators, and unleash innovators. Together with our partners, we're helping customers break free from the restrictive, overpriced legacy solutions that hold them back, and blaze forward with the full power of the open cloud in their hands.
Founded in 2007, we scaled the business with less than $3 million in outside funding until 2021, when we did a traditional IPO on the Nasdaq stock exchange. Today, Backblaze generates over $100m in revenue and is the leading specialized storage cloud - managing over three billion gigabytes of data storage for 500K+ customers in 175+ countries, including businesses, developers, IT professionals, and individuals.
About the RoleWe are seeking a Site Reliability Engineer II (SRE II) to help ensure the stability, scalability, and reliability of our services and infrastructure. This role focuses on building automation, maintaining observability, and supporting incident response to keep customer-facing systems performing at their best. The SRE will collaborate with engineering, product, and operations teams to embed reliability practices into day-to-day development and operations while contributing to tools and processes that improve efficiency and reduce manual effort.
Key ResponsibilitiesService Reliability & Operations
Support the availability and durability of critical services across production environments.Monitor service health using SLIs, SLOs, and error budgets, and escalate issues when thresholds are at risk.Participate in on-call rotations, incident response, and post-incident reviews to drive service improvements.Follow established ITIL/OSS processes (incident, change, problem, and capacity management).Automation & Tooling
Develop automation for common operational tasks, reducing manual intervention and toil.Contribute to monitoring, logging, and alerting frameworks (e.g., Prometheus, Grafana, Catchpoint,ELK).Work with CI/CD pipelines, configuration management, and infrastructure as code tools (Terraform, Ansible, Jenkins).Write scripts (Bash, Python, Go, etc.) to improve system reliability and efficiency.Collaboration
Partner with engineering, product, and operations teams to support resilient system design and operations.Assist in capacity planning and disaster recovery exercises.Work with vendors and service providers to troubleshoot service issues and track SLA performance.Document systems, share learnings, and help grow a reliability-minded engineering culture.Continuous Improvement
Contribute to playbooks, runbooks, and operational documentation.Identify recurring issues and propose long-term improvements.Promote reliability-focused practices within development and operations teams.Qualifications
Education & Experience
Bachelor's degree in Computer Science, Engineering, or related field (or equivalent experience).2–4 years of experience in site reliability, systems engineering, or operations.Exposure to large-scale, production-grade systems.Technical Skills
Solid Linux systems administration and troubleshooting skills.Familiarity with service reliability concepts - monitoring, alerting, incident response, and root cause analysis.Proficiency in at least one scripting language (Python, Bash, or Go).Understanding of containers (Kubernetes, Docker) and microservices concepts.Knowledge of incident response and operational best practices.Preferred Attributes
Experience in a SaaS, service provider, or distributed systems environment.Familiarity with ITIL/OSS practices and SLO/SLA'sStrong problem-solving skills and willingness to learn new technologies.Experience with cloud platforms (AWS, GCP, or Azure).Ability to work independently, take ownership, and drive projects from problem discovery through resolution.
At this point, we hope you're feeling excited about the job description you're reading. Even if you don't meet every requirement, we still encourage you to apply. Learning, developing, and growing are key parts of our culture. We're eager to meet people who believe in our mission and can contribute to our team in various ways. We want people to feel comfortable expressing their true selves and to come, stay, and do their best work here.
At Backblaze, we value being fair and good to our customers, partners, and employees. That's why diversity, equity, and inclusion are at the core of our values. We are committed to fostering a workforce where all employees feel a sense of belonging regardless of race, ethnicity, nationality, gender, sexual orientation, age, religion, socio-economic status, ability, veteran status, and education. We believe that our dedication to cultivating a diverse workspace not only allows us to better serve our customers in over 175 countries, but further reinforces our commitment to doing the right thing. We are proud to be an Equal Opportunity Employer.
-
Site Reliability Engineer II
1 week ago
Bengaluru, Karnataka, India Microsoft Full time ₹ 8,00,000 - ₹ 24,00,000 per yearThe Production Engineering and Artificial Intelligence (AI) Group, part of the Linux Systems Group within Microsoft, plays a critical role in powering Azure Cloud. This team ensures that Azure operates with the latest version of Linux software at the highest levels of quality and performance, serving as the gatekeeper for production software. The team...
-
Site Reliability Engineer II
2 weeks ago
Bengaluru, Karnataka, India JPMorganChase Full time US$ 80,000 - US$ 1,20,000 per yearDescriptionPlay a key role in ensuring system reliability at one of the world's most iconic and largest financial institutions.As a Site Reliability Engineer II at JPMorgan Chase within the Corporate Technology, Finance Last Mile Reporting team, you will use technology to solve business problems and leverage software engineering best practices as we strive...
-
Site Reliability Engineer
1 week ago
Bengaluru, Karnataka, India d416f97b-2589-437a-8e64-3348cfe4008b Full time ₹ 12,00,000 - ₹ 36,00,000 per yearHiring Site Reliability EngineersExp : 2.5 +years [Excluding internship]Location : BangaloreApply Here : The engineer will work in the Reliability and Productivity Engineering team and is responsible for building industry standard large scale platforms to be utilised across FK that helps to significantly improve the reliability of systems and bring...
-
Site Reliability Engineer II
2 weeks ago
Bengaluru, Karnataka, India Microsoft Full time ₹ 12,00,000 - ₹ 24,00,000 per yearThe Production Engineering and Artificial Intelligence (AI) Group, part of the Linux Systems Group within Microsoft, plays a critical role in powering Azure Cloud. This team ensures that Azure operates with the latest version of Linux software at the highest levels of quality and performance, serving as the gatekeeper for production software. The team...
-
Site Reliability Engineer II
1 week ago
Bengaluru, Karnataka, India CME Group Full timeCME Group is the world's leading and most diverse derivatives marketplace, offering futures and options across a wide range of industries. We are seeking a passionate SRE to join our dynamic team.The Application Site Reliability Engineer II will help ensure the reliability and performance of our Markets trading and real-time post-trade systems; systems where...
-
Site Reliability Engineer II
6 days ago
Bengaluru, Karnataka, India CME Group Full time ₹ 8,00,000 - ₹ 12,00,000 per yearDescription:CME Group is seeking a SRE II to help, build, operate and scale systems in our Markets portfolio. Markets SREs work on products and applications related to CME's Globex trading platform. Our systems deliver an exceptional combination of low-latency performance and rock-solid reliability to seamlessly handle the world's busiest trading days.The...
-
Site Reliability Engineer
2 weeks ago
Bengaluru, Karnataka, India Ivanti Full time ₹ 8,00,000 - ₹ 24,00,000 per yearAre you ready to help elevate the reliability and performance of cloud services for global enterprise clients? Join Ivanti's growing Site Reliability Engineering (SRE) team and play a vital role in deploying, automating, and securing SaaS solutions trusted by organizations worldwide. If you thrive in a collaborative, fast-paced environment and love solving...
-
Site Reliability Engineering
1 week ago
Bengaluru, Karnataka, India Thakral One Full time US$ 60,000 - US$ 1,20,000 per yearCompany DescriptionThakral One, headquartered in Singapore, is a technology consulting and services company with a strong presence across Asia. The company specializes in technology-driven consulting, custom solution development, data analytics, and leveraging cloud capabilities to deliver enhanced decision support and practical outcomes. Collaborating...
-
Site Reliability Engineering
6 days ago
Bengaluru, Karnataka, India Viraaj HR Solutions Private Limited Full time ₹ 12,00,000 - ₹ 36,00,000 per yearSite Reliability Engineer (SRE)About The OpportunityA fast-growing organization in the Enterprise Cloud Infrastructure & SaaS sector delivering highly available, mission-critical services to enterprise customers. We are hiring an on-site Site Reliability Engineer in India to own reliability, automation, and operational excellence across cloud-native...
-
Site Reliability Engineer II
6 days ago
Bengaluru, Karnataka, India Sabre Full time ₹ 8,00,000 - ₹ 12,00,000 per yearSabre is a technology company that powers the global travel industry. By leveraging next-generation technology, we create global technology solutions that take on the biggest opportunities and solve the most complex challenges in travel.Positioned at the center of the travel, we shape the future by offering innovative advancements that pave the way for a...