
Reliable Systems Innovator
2 days ago
Solution Focused Reliability Expert
Job DescriptionOverview:
We are seeking a skilled reliability professional to join our team. As a seasoned Site Reliability Engineer, you will leverage your technical expertise to enhance service/product reliability by incorporating Observability solutions.
In this role, you will collaborate with cross-functional teams to achieve reliability objectives. The ideal candidate will possess the drive, ownership, and technical acumen to tackle complex problems in a dynamic environment.
You will be responsible for working closely with product development teams, Cloud Infrastructure, and other SRE teams to ensure effective observability and reliability of our products, SLDC, and Infrastructure.
Key Responsibilities:
- Enable and empower fast-growing multi-disciplinary teams across applications and locations.
- Tackle complex development, automation, and business process challenges while championing Cvent standards and best practices.
- Ensure scalability, performance, and resilience of Cvent products and processes.
- Collaborate with product development teams, Cloud Automation, and other SRE teams to identify and resolve observability gaps.
- Identify recurring problems and anti-patterns in development, operational, and security processes, and help respective teams build observability.
- Develop automation that targets multiple regions seamlessly.
- Pursue Open-Source projects and contribute to their development.
Requirements:
- Excellent communication skills and experience working in distributed teams.
- A passion for making things better for peers.
- Experience managing AWS services or operational knowledge of managing applications in AWS via automation.
- Fluency in at least one scripting language: Typescript, Javascript, Python, Ruby, or Bash.
- Experience with SDLC methodologies (preferably Agile).
- Knowledge of Observability (Logging, Metrics, Tracing) and SLI/SLO.
- Experience with APM, monitoring, and logging tools (Datadog, New Relic, Splunk).
- Understanding of containerization concepts: docker, ECS, EKS, Kubernetes.
- Self-motivation and minimal supervision requirements.
- Troubleshooting and responding to incidents, setting standards for issue prevention.
- Good to have: Experience with Infrastructure as Code tools (CloudFormation, CDK, Terraform).
- Managing 3-tier application stacks.
- Basic networking understanding.
- Server configuration through Chef, Puppet, Ansible, or equivalent.
- NoSQL database experience (MongoDB, Couchbase, Postgres).
- APM data analysis for performance bottleneck identification.
-
System Reliability Specialist
1 week ago
Kanpur, Uttar Pradesh, India beBeeEngineer Full time ₹ 20,00,000 - ₹ 25,00,000We are looking for a seasoned reliability expert to join our news team. The ideal candidate will be responsible for ensuring the stability, performance, and scalability of our systems.The successful candidate will have 8+ years of experience in Site Reliability Engineering and will possess expertise in Kubernetes, Docker, and container orchestration.Key...
-
System Reliability Specialist
4 days ago
Kanpur, Uttar Pradesh, India beBeeReliabilityEngineer Full time ₹ 1,80,00,000 - ₹ 2,50,00,000Job Title: Site Reliability EngineerWe are seeking an experienced reliability engineer to play a critical role in ensuring the scalability and reliability of our internal platform.As a key member of our engineering team, you will work closely with development teams to ensure they have the tools, practices, and expertise needed to deliver high-quality...
-
Chief System Reliability Architect
1 week ago
Kanpur, Uttar Pradesh, India beBeePerformance Full time ₹ 1,80,00,000 - ₹ 2,20,00,000Reliability Engineering LeadAs a pivotal figure in building and scaling robust systems, the Reliability Engineering Lead oversees the reliability, scalability, and performance of our critical systems. This position combines software engineering and systems engineering expertise to build and maintain high-performing, reliable systems.Key...
-
Building Reliable Systems
2 days ago
Kanpur, Uttar Pradesh, India beBeeSite Full time ₹ 23,04,000 - ₹ 2,59,20,000Reliable System EngineerTalent500 offers a leading platform for job opportunities at Global Capability Centres or GCCs in India. We help the best tech and non-tech talent find their dream jobs at renowned companies, resulting in transformative career experiences.This is an exciting opportunity to play a crucial role in keeping our digital backbone running...
-
AI/ML System Reliability Engineer
1 week ago
Kanpur, Uttar Pradesh, India beBeeSiteReliability Full time ₹ 13,04,000 - ₹ 26,12,000Transform Your Career with AI/ML Site ReliabilityWe seek an experienced professional to ensure the reliability and scalability of cloud-based AI/ML systems.Key Responsibilities:Design, implement, and maintain scalable and reliable Azure infrastructure (storage, networking, security, IAM)Collaborate with cross-functional teams to develop and deploy Databricks...
-
Reliable Financial System Specialist
2 days ago
Kanpur, Uttar Pradesh, India beBeeReliability Full time ₹ 20,00,000 - ₹ 25,00,000About our ideal candidate, we are looking for a skilled professional with expertise in reliability engineering. The Site Reliability Engineer will be responsible for ensuring the stability and scalability of Accounting and Finance platforms.Key responsibilities include:Ensuring Accounting and Finance platforms meet defined SLAs, SLOs, and SLIs for...
-
Maintenance and Reliability Expert
1 week ago
Kanpur, Uttar Pradesh, India beBeeMaintenance Full time ₹ 1,50,00,000 - ₹ 2,50,00,000Job Title: Maintenance and Reliability ExpertWe are seeking a skilled and experienced professional to fill the role of Maintenance and Reliability Expert. As a key member of our team, you will be responsible for ensuring the smooth operation of all plant utilities, mechanical and electrical systems, and equipment across the pharmaceutical facility.The...
-
Reliable Systems Expert
1 week ago
Kanpur, Uttar Pradesh, India beBeeResponsibility Full time ₹ 18,00,000 - ₹ 26,40,000Job OverviewThis is a key position for a skilled Site Reliability Engineer to join our team.Experience working with microservices on a Kubernetes background and possessing a strong understanding of observability tools and metrics.
-
Senior System Reliability Engineer
1 week ago
Kanpur, Uttar Pradesh, India beBeeMonitoring Full time ₹ 18,00,000 - ₹ 20,00,000System Health MonitorThe Insight Global team is hiring a full-time Monitoring Engineer to join the LLM Proxy Team. This role involves monitoring system health via Grafana dashboards, managing incident communications, and ensuring high reliability of globally deployed web applications.Key Responsibilities:Monitor Grafana dashboards and observability tools to...
-
Senior Financial Systems Reliability Specialist
12 hours ago
Kanpur, Uttar Pradesh, India beBeeSiteReliabilityEngineer Full time ₹ 1,50,00,000 - ₹ 2,00,00,000Reliability Engineer Job SummaryThis is a critical role for ensuring the stability, scalability, and operational excellence of accounting and finance platforms. The ideal candidate will have experience in Site Reliability Engineering (SRE), DevOps, or Production Engineering, with a strong background in monitoring/observability tools and automation...