Senior Site Reliability Engineer
4 weeks ago
This role will own the health and uptime of our mission-critical application , Cloud infrastructure , database system , and monitoring infrastructure .
About Us At BQE, our mission is to transform the operational landscape of professional services firms, empowering them to achieve more and serve their customers better.
These firms play a crucial role in building infrastructure that significantly impacts global progress.
BQE CORE serves as the operational backbone for these firms, providing an all-in-one Saa S solution.
Our platform enables them to efficiently manage projects, improve budget tracking and profitability, and streamline processes through automation.
With a robust customer base, we are on a trajectory of continuous growth, constantly innovating to meet the evolving needs of our customers and the industries they influence.
Why Join Us Work with a modern tech stack in a high-impact reliability role.
Be a key part of our Cloud Ops and App Reliability strategy .
A collaborative and supportive engineering culture.
Responsibilities: Ensure application uptime , performance, and scalability.
Own incident management , including on-call rotations, root cause analysis, and incident reviews.
Manage and monitor MS SQL Server clusters and high-availability configurations.
Set up and improve monitoring, alerting, and observability using New Relic, Logz.io, Cloud Watch , and other tools.
Proactively identify system bottlenecks and improve system reliability and automation.
Define and improve SLOs/SLAs across services.
Drive disaster recovery testing and availability simulations.
Collaborate with Cloud Ops and Dev Ops for infrastructure automation and enhancements.
Work with Jira and JSM to manage operational tasks, incidents, and changes.
Qualifications & Experience: Bachelor's degree in computer science, Engineering, or related field (or equivalent experience).
5-8 years of experience in Site Reliability Engineering, Cloud Ops, Dev Ops or related roles.
Must Have Skills : Certifications in AWS, Microsoft, Windows, SQL Server, or SRE disciplines .
Exposure to New Relic APM, Ia C automation is a plus.
Experience working in a 24x7 on-call rotation .
Strong knowledge of Windows OS eco-system , IIS , MS SQL Server administration, clustering, performance tuning, and failover.
Deep experience with monitoring/logging tools like New Relic, Logz.io, AWS Cloud Watch .
Experience with AWS (EC2, ASG, Cloud Watch, Cloud Trail, VPC) and infrastructure management.
Good understanding of networking , DNS , load balancing , and security principles .
Proficient in scripting languages such as Power Shell, Python .
Strong understanding of incident response, change management, postmortem culture .
Experience using Jira and Jira Service Management for operational workflows.
Ability to work independently and drive technical initiatives.
-
Site Reliability Engineering Manager
4 weeks ago
India CloudHire Full timeJob SummaryThe Technical Manager for Site Reliability Engineering (SRE) will lead a remote team of Site Reliability Engineers, ensuring operational excellence and fostering a high-performing team culture. Reporting to the US-based Director of Systems and Security, this role is responsible for overseeing day-to-day operations, technical mentorship, and...
-
Senior Site Reliability Engineer- ELK Expert
4 weeks ago
India iVedha Inc. Full timeSenior Site Reliability Engineer (SRE) – ELK Expert | Platform Engineering PracticeLocation: India (Remote) - Must be available to work in the EST (US/Canada) Time Zone.Role Summary:Are you a Senior Site Reliability Engineer (SRE) with deep ELK expertise, ready to take ownership of large-scale observability infrastructure?We're looking for an SRE with 7+...
-
Senior Site Reliability Engineer
1 day ago
India Cimpress Full timeSenior Site Reliability EngineerWho We Are:Cimpress Technology develops cutting-edge, best-in-world software that our mass customization businesses use to create personalized products for over 17 million global customers. Our Mass Customization Platform consists of modular, multi-tenant services. Our businesses can choose the solutions that work for them, or...
-
Site Reliability Engineer
2 days ago
Remote, India Rackspace Technology Full timeJob DescriptionSite Reliability Engineer / Observability EngineerPublic Cloud - Offerings and Delivery - Workforce Mgmt & Delivery Ops /Full - Time / RemoteRackspace is building up its Professional Services Center of Excellence on Application Performance Monitoring Suites.If you enjoy solving complex business problems and can contribute to building next...
-
Senior Site Reliability Engineer
2 weeks ago
India MindBrain Full timePosition SITE Reliability Engineer Budget- 1.7 LPMExp- 10 yrsDuration- 6 monthsTechnical Skills:Programming: Proficiency in languages like Python.Operating Systems: Deep understanding of Linux/Windows operating systems and networking concepts. Cloud Technologies: Experience with Azure including services, architecture, and best practices. Containerization and...
-
Site Reliability Engineer
4 weeks ago
India CES Full timeWe're looking for a highly skilled Site Reliability Engineer to help us build, manage, and scale modern infrastructure systems for high-availability applications. If you're passionate about automation, cloud platforms, and solving tough operational challenges, we would love to hear from you.Key Skills and Competencies3+ years of extensive experience with...
-
Junior Site Reliability Engineer
3 weeks ago
India JoVE Full timeJo VE is the world-leading producer and provider of science video solutions with the mission to improve scientific research and education.Millions of scientists, educators and students use Jo VE for their research, teaching and learning.Our institutional clients comprise over 1,000 universities, colleges, and biopharma companies, including such leaders as...
-
Junior Site Reliability Engineer
4 weeks ago
India JoVE Full timeJoVE is the world- leading producer and provider of video solutions with the mission to improve scientific research and education. Millions of scientists, educators and students use JoVE for their research, teaching and learning. Our institutional clients comprise over 1,000 universities, colleges, and biopharma companies, including such leaders as Harvard,...
-
Urgent Search Site Reliability Engineer
3 weeks ago
India pythian Full timeRemote Site Reliability Engineering - Site Reliability Engineering Full Time Remote Site Reliability Engineer India Multiple Timezones Remote Work from Home Why Pythian At Pythian we are experts in strategic database and analytics services driving digital transformation and operational excellence Pythian a multinational company was...
-
Site Reliability Engineering Manager
3 weeks ago
India Coinbase Full timeJob DescriptionReady to be pushed beyond what you think youre capable ofAt Coinbase, our mission is to increase economic freedom in the world. Its a massive, ambitious opportunity that demands the best of us, every day, as we build the emerging onchain platform and with it, the future global financial system.To achieve our mission, were seeking a very...