Site Reliability Engineer
8 hours ago
We are looking for an experienced Site Reliability Engineer (SRE) to ensure the availability, scalability, and reliability of our systems and services. You will design, implement, and maintain infrastructure solutions that enable seamless operations and high system performance. In this role, you will collaborate with development, operations, and product teams to integrate best practices, automate workflows, and drive continuous improvement.
At Axlerant, we are committed to fostering an environment where innovation and operational excellence thrive. As an SRE, you'll have the opportunity to work on challenging, large-scale problems using cutting-edge tools and technologies. You will also solve impactful issues that benefit large masses, collaborating with talented professionals to make a meaningful impact on our systems and services.
Your Job Responsibilities- Design and implement reliable and scalable infrastructure to support business-critical applications and services.
- Collaborate with cross-functional teams to define and implement service level objectives (SLOs) and monitor key performance indicators (KPIs).
- Develop and manage Infrastructure as Code (IaC) solutions using tools like Terraform and Ansible.
- Automate repetitive operational tasks to enhance efficiency and reduce manual intervention.
- Troubleshoot and resolve system performance issues to minimize downtime and ensure high availability.
- Drive the adoption of cloud-native technologies and best practices.
- Participate in an on-call rotation to ensure prompt resolution of critical incidents and maintain system availability.
- Manage and keep documentation and runbooks up to date to ensure effective incident response and operational continuity.
- Implement robust monitoring, logging, and alerting systems to proactively identify and resolve issues, and set up and leverage observability tools to ensure the platform operates as expected.
- Deploy and manage workloads on container orchestration systems like Kubernetes.
- Ensure security and compliance standards are integrated into the infrastructure.
- Proven experience as a Site Reliability Engineer, with 3-4 years of experience and a strong track record of designing and implementing large-scale data solutions.
- Proficiency in Infrastructure as Code (IaC) tools like Terraform and Ansible.
- Experience with container orchestration platforms such as Kubernetes, including deployment and management.
- Strong knowledge of Linux operating systems, including administration and optimization.
- Experience setting up and implementing workload management and deployment using GitOps tools like ArgoCD.
- Familiarity with monitoring and observability tools like Prometheus, Grafana, or Datadog.
- Solid understanding of networking concepts, load balancers, and distributed systems.
- Experience with scripting and automation using languages like Python, Bash, or Go.
- Knowledge of CI/CD pipelines and tools like Jenkins, GitLab CI, or CircleCI.
- Strong problem-solving and troubleshooting skills with a proactive mindset.
- Excellent communication skills to collaborate with technical and non-technical stakeholders.
- Certification in AWS or a similar cloud provider, with hands-on experience managing cloud infrastructure.
Good To Have
- Experience with multi-cloud architectures.
- Understanding of serverless architectures and tools.
- Experience with disaster recovery planning and implementation.
- Knowledge of machine learning workflows and data pipelines.
- Be part of an AI-first, remote-first digital agency that's shaping the future of customer experiences.
- Collaborate with global teams and leading platform partners to solve meaningful challenges.
- Enjoy a culture that supports autonomy, continuous learning, and work-life harmony.
As a global company that puts care into employee happiness, engineering excellence, and customer success, we are in striking contrast to the typical outsourcing option. We are a diverse team working remotely across many time zones, with success stories that back up capabilities, and a reputation for an unconventional work environment that empowers. We are the individuals directly challenging what it means to do global delivery differently for employees and partners.
Success management as our service framework operationally is part of who we are at Axelerant. All of our processes and practices are driven by this core, continuously iterated method. What this means is success management teams and success journey mapping for our partners.
-
Site Reliability Engineer
10 hours ago
Remote, India Luma Financial Technologies Full timeAbout Luma Financial TechnologiesFounded in 2018, Luma Financial Technologies ("Luma") has pioneered a cutting-edge fintech software platform that has been adopted by broker/dealer firms, RIA offices, and private banks around the world. By using Luma, institutional and retail investors have a fully customizable, independent, buy-side technology platform that...
-
Site Reliability Engineer
2 days ago
Remote (India) Luma Financial Technologies Full time ₹ 30,000 - ₹ 60,000 per yearAbout the roleAt Luma, our Site Reliability Engineer (SRE) team keeps our platform reliable, secure, and lightning fast. They own everything from AWS infrastructure and Kubernetes clusters to CI/CD pipelines, monitoring, and alerting. If you're passionate about tackling big challenges, automating at scale, and making systems more resilient, we'd love to have...
-
Software Engineer, Site Reliability Engineering
2 weeks ago
Remote, India ECOH Full time ₹ 8,00,000 - ₹ 16,00,000 per yearMinimum qualifications:Bachelor's degree in Computer Science, a related field, or equivalent practical experience.1 year of experience with software development in one or more programming languages during coursework/projects, research, internships, or practical experience in school, work, or Open Source projects.Strong problem-solving and analytical...
-
Site Reliability Engineer 3
1 week ago
Remote, India Granicus Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Summary:Opening from Default - All locations The Company Serving the People Who Serve the People Granicus is driven by the excitement of building, implementing, and maintaining technology that is transforming the Govtech industry by bringing governments and their constituents together. We are on a mission to support our customers by meeting the needs of...
-
Site Reliability Engineer III
2 weeks ago
Remote, India HighLevel Full time ₹ 1,00,000 - ₹ 2,50,000 per yearAbout HighLevel:HighLevel is an AI powered, all-in-one white-label sales & marketing platform that empowers agencies, entrepreneurs, and businesses to elevate their digital presence and drive growth. We are proud to support a global and growing community of over 2 million businesses, comprised of agencies, consultants, and businesses of all sizes and...
-
Site Reliability Engineer, Contract
4 days ago
Remote, India 66degrees Full time ₹ 12,00,000 - ₹ 36,00,000 per yearOverview of 66degrees66degrees is a leading consulting and professional services company specializing in developing AI-focused, data-led solutions leveraging the latest advancements in cloud technology. With our unmatched engineering capabilities and vast industry experience, we help the world's leading brands transform their business challenges into...
-
Sr. Site Reliability Engineer
4 days ago
Remote, India Veza Technologies Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWe are seeking a highly motivated Site Reliability Engineer (SRE) with a strong operational focus to join our growing team. In this role, you will play a vital role in ensuring the smooth operation and performance of our critical infrastructure and services. You'll work cross-functionally to create alignment and deliver results alongside builders who have...
-
Site Reliability Engineer
4 days ago
India - Remote Newfold Digital Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWho we are.Newfold Digital is a leading web technology company serving millions of customers globally. Our customers know us through our robust portfolio of brands. We have some of the industry's most prominent and storied go-to-market brands, including Bluehost, HostGator, , Network Solutions, and We help customers of all...
-
Site Engineer
1 week ago
Remote, India Ray Enterprise Full time ₹ 1,80,000 - ₹ 2,40,000 per yearThis is a full-time on-site role for a Civil Engineer located in Kolkata. The Civil Engineer will be responsible for the planning, design, and management of various civil engineering projects. Day-to-day tasks may include creating detailed engineering designs, managing loading unloading of medical equipment. For project purpose, you may have to travel...
-
Engineer II
2 weeks ago
Remote, India CrowdStrike Full timeJob Description As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn't changed - we're here to stop breaches, and we've redefined modern security with the world's most advanced AI-native platform. We work on large scale distributed systems, processing...