
Reliable Systems Engineer
1 week ago
We treat infrastructure and operations as software engineering problems.
Our mission is to build and progress software platforms that enable the provisioning and managing of services in safe, reliable, and scalable ways.
We challenge the status quo, use new technologies to build platforms and tooling for engineering teams.
In this role, you will make significant decisions with a huge impact on building banking technology.
You will be part of a team responsible for designing and architecting solutions, finding creative ways to optimize existing ones that improve agility for managing microservices infrastructures.
- A strong believer in automating DevOps and SRE aspects like infrastructure provisioning, deployment, observability, incident lifecycle, uptime SLA, etc.
- Bold to challenge, open to get challenged, curious to learn and grow.
The day-to-day activities include working with Kubernetes clusters hosted in cloud environments, using InfrastructureAsCode tooling like Terraform and Ansible to manage resources.
Engage with development teams throughout the life cycle to help develop software for reliability and scale. Coach team's SRE best practices.
Troubleshoot priority incidents, facilitate post-mortems, and ensure permanent closure of incidents.
Perform analytics on previous incidents and usage patterns to predict issues and take proactive actions.
Build and drive adoption for greater self-healing and resiliency patterns.
Design automated software and product upgrades, change management, and release management solutions.
Design, code, test, and deliver software to automate manual operational work. Own your tools and services end-to-end.
Performance and cost optimization for infrastructure.
Be part of an on-call rotation for 24x7 support coverage as needed.
Succeed, fail, and learn together with other talented people.
Key Qualifications- Bachelor's degree in information systems, information technology, computer science, or similar.
- 3+ years of professional experience.
- Experience with administering Kubernetes clusters.
- Experience with managing Infrastructure as code using Terraform.
- Direct production operations experience in a cloud environment.
- Experience contributing to technology and product strategy.
- Experience leading capability-building initiatives across diverse areas such as infrastructure and operations automation, observability, incident management, architecting HA systems, and other core engineering.
- Demonstrated experience in driving operational efficiency and transparency of a growing organization.
-
Reliable Systems Expert
1 week ago
Salem, Tamil Nadu, India beBeeEngineering Full time ₹ 20,00,000 - ₹ 25,00,000As a Site Reliability Engineer, you'll play a crucial role in maintaining our digital foundation's uptime for millions of users. Your mission is to minimize incidents, automate processes, and help us scale efficiently.About the Role:Ensure System Stability: Identify potential system issues early, implement preventive measures, and boost system...
-
Site Reliability Engineer
1 week ago
Salem, Tamil Nadu, India beBeeSiteReliabilityEngineer Full time ₹ 1,10,00,000 - ₹ 1,70,00,000The Role of a Site Reliability Engineer is to ensure the stability and scalability of financial platforms.This position requires building automation, implementing monitoring, improving incident response, and championing DevOps practices to enable Finance and Accounting systems to operate with consistency and trustworthiness.Key Responsibilities...
-
Site Reliability Engineering Executive
1 week ago
Salem, Tamil Nadu, India beBeeSre Full time ₹ 1,80,00,000 - ₹ 2,00,00,000Reliability Engineer LeaderJob DescriptionThis is an exciting opportunity to shape the SRE function within our organisation and be part of a founder member of the Group SRE team.We are seeking a highly skilled and experienced engineer to join our team at Natobotics. As a system reliability leader, you will define, drive, and implement the SRE strategy across...
-
Cloud Reliability Engineer Leader
1 week ago
Salem, Tamil Nadu, India beBeeReliability Full time US$ 15,00,000 - US$ 20,00,000Job Overview:The Cloud Reliability Engineer Leader will play a critical role in ensuring the stability, scalability, and operational excellence of Accounting and Finance platforms.This role is focused on leading the operational health of these platforms, ensuring the delivery of highly reliable financial applications and data services that meet the demanding...
-
Senior DevOps System Reliability Engineer
1 week ago
Salem, Tamil Nadu, India beBeeDevops Full time US$ 15,00,000 - US$ 20,00,000Job OverviewWe are seeking a skilled Senior DevOps Engineer to join our production group. This is an exciting opportunity for an experienced professional to lead the development of scalable systems and guide best practices across DevOps, SRE, and security.This role requires a strong background in infrastructure automation, incident management, and on-call...
-
Reliable Platform Engineer
1 week ago
Salem, Tamil Nadu, India beBeeSite Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Job OpportunityWe are currently seeking a skilled Observability Engineer Site Reliability to join our team. This role will involve building and fine-tuning platform components for the Observability product, working closely with the Lead engineer, performance team, data ingestion, platform DevOps and data visualization teams under Observability product.This...
-
Reliability Expert
1 week ago
Salem, Tamil Nadu, India beBeePerformance Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Job DescriptionSeeking an experienced professional to join our team as a Site Reliability Engineer.The ideal candidate will have a strong understanding of distributed systems, cloud platforms, and microservices architecture.Responsibilities include monitoring, observability, and performance optimization for web and mobile applications.This is a challenging...
-
Site Reliability Professional
1 week ago
Salem, Tamil Nadu, India beBeeReliability Full time US$ 1,50,000 - US$ 2,00,000System Reliability ExpertAs a System Reliability Engineer, you will play a pivotal role in ensuring the stability and efficiency of our systems. Your primary responsibilities will include identifying potential system issues early, implementing preventive measures, and enhancing system resilience.Key Responsibilities:Implement reliability engineering:...
-
Engineer, Site Reliability T500-20169
1 week ago
Salem, Tamil Nadu, India ANSR Full timeANSR is hiring for one of its client:About T-Mobile:T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America's supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional...
-
Senior Site Reliability Engineer
4 weeks ago
Salem, Tamil Nadu, India MindBrain Full timePosition SITE Reliability Engineer Budget- 1.7 LPM Exp- 10 yrs Duration- 6 months Technical Skills: Programming: Proficiency in languages like Python. Operating Systems: Deep understanding of Linux/Windows operating systems and networking concepts. Cloud Technologies: Experience with Azure including services, architecture, and best practices. ...