Cloud Engineering Ops Lead
4 days ago
We are seeking a Cloud Engineering Ops Lead responsible for ensuring the stability, observability, security, and cost-efficiency of our AWS environments and customer-facing applications. This role is critical in maintaining production operations that are reliable, predictable, and optimized for performance and resilience.
Key Responsibilities:
1. AWS Platform Operations
- Manage and maintain AWS core services including EC2, EKS, RDS, ALB/CloudFront, IAM/OIDC, VPC, Transit Gateways, and Security Groups.
- Ensure system hygiene, patching, and infrastructure health.
- Automate operational workflows using Terraform, Ansible, or Python.
2. Application Support
- Ensure production readiness through runbooks, pre-deployment validations, performance baselines, and rollback mechanisms.
- Support releases with deployment assistance, smoke testing, and incident troubleshooting.
- Drive continuous improvement in application stability and availability.
3. Observability & Monitoring
- Build and maintain dashboards, logs, metrics, traces, and synthetic monitoring.
- Ensure alert accuracyeliminate noise and ensure targeted notifications.
- Track SLOs, error budgets, and system performance.
- Lead incident response, RCA, and implement corrective actions.
4. Backup & Disaster Recovery
- Define and manage backup and restore operations with schedules, retention rules, replication, and validation.
- Conduct regular DR drills to ensure RPO/RTO targets are consistently met.
- Maintain up-to-date documentation on disaster recovery processes.
5. Cost Optimization
- Enforce cost governance through tagging, right-sizing, reservation planning, and lifecycle management (EBS, EIP, AMIs).
- Generate cost analysis reports with actionable recommendations to improve efficiency.
6. Team Leadership & Enablement
- Lead high-severity incident bridges (Sev-1/Sev-2) with clear communication.
- Mentor team members in operational excellence and preventive practices.
- Develop reusable runbooks and automation to eliminate repetitive tasks.
- Promote a culture of reliability, transparency, and proactive improvement.
Success Metrics:
- Visibility: Dashboards and alerts are reliable, actionable, and service-specific.
- Backup Health: 100% backup success rate with monthly restore testing.
- Reliability: Reduced MTTR, increased deployment success rate, and runbook-driven resolutions.
- Change Management: Stable release cycles with tested rollback strategies.
- Cost Control: Optimized AWS expenditure with over 95% tagging compliance.
Required Skills & Experience:
- 10+ years in cloud and application operations with deep expertise in AWS.
- Proven leadership in managing production incidents and driving operational excellence.
- Strong knowledge of observability tools: CloudWatch, Prometheus, Grafana, Datadog, etc.
- Hands-on experience with Terraform, Ansible, and/or Python for automation (IaC).
- Expertise in backup strategies and disaster recovery practices with real-world restore testing.
- Solid understanding of AWS cloud networking including VPCs, routing, security groups, and transit gateways.
- Excellent communication, mentoring ability, and problem-solving mindset.
-
LLM Ops Engineer
2 weeks ago
Hyderabad, Telangana, India Apple Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWe work on Apple scale opportunities and challenges. We are engineers at heart. We like solving technical problems. We believe a good engineer has the curiosity to dig into inner workings of technology and is always experimenting, reading and in constant learning mode. If you are a software engineer with passion to code and dig deeper into any technology,...
-
Support Engineer
3 days ago
Hyderabad, Telangana, India WaferWire Cloud Technologies Full time ₹ 2,00,000 - ₹ 6,00,000 per yearJob DescriptionJob Title: Support Engineer (MS Teams)Job Location: Hyderabad, IndiaWorksite: Work From OfficeAbout WCT:WaferWire Technology Solutions (WCT) specializes in delivering comprehensive Cloud, Data and AI solutions through Microsoft's technology stack. Our services include Strategic Consulting, Data/AI Estate Modernization, and Cloud Adoption...
-
lead Cloud Engineer
7 days ago
Hyderabad, Telangana, India PROADSW3 Relations pvt. ltd. Full time ₹ 2,64,000 - ₹ 6,72,000 per yearJob Opening: Lead Cloud EngineerExperience: 8+ YearsLocation: NoidaMode: ContractBudget: ₹2.2 LPM Role OverviewWe are seeking an experienced Lead Cloud Engineer to spearhead cloud platform engineering initiatives, build cloud-agnostic infrastructure solutions, and drive automation across multiple cloud environments. The ideal candidate will have deep...
-
SW Dev Ops Engineer IV
3 hours ago
Hyderabad, Telangana, India NCR Atleos Full time ₹ 10,00,000 - ₹ 25,00,000 per yearAbout NCRNCR Corporation (NYSE: NCR) is a leader in transforming, connecting and running technology platforms for self-directed banking, stores and restaurants. NCR is headquartered in Atlanta, Ga., with 38,000 employees globally. NCR is a trademark of NCR Corporation in the United States and other countries.Title: SW Dev Ops Engineer IVGrade: 12Location:...
-
IT Ops Leader
7 days ago
Hyderabad, Telangana, India Genzeon Full time ₹ 20,00,000 - ₹ 25,00,000 per yearPosition: IT Ops leaderLocation: Hyderabad, IndiaAbout GenzeonGenzeon is a leading provider of digital engineering, intelligent automation, security,compliance, cloud, and managed services. We empower our clients to adapt and be agile in anever-evolving digital landscape.SummaryWe are seeking an experienced IT Operations to oversee Genzeon's IT systems...
-
AI Lead
2 weeks ago
Hyderabad, Telangana, India Innocito Full time ₹ 12,00,000 - ₹ 24,00,000 per yearLooking for a Lead Software Engineer (8+ yrs) in AI/ML with strong Python, cloud, and ML-Ops expertise, plus leadership experience. Must excel in TensorFlow/PyTorch and AI areas like NLP, neural networks, or time series analysis.
-
Sr. Lead Digital Ops
2 days ago
Hyderabad, Telangana, India Systechcorp Inc Full time ₹ 20,00,000 - ₹ 25,00,000 per yearSr. Lead Digital OpsLocation: Hyderabad(Hybrid)Interview Mode: Face to FacePreferred candidate profile15+ Years of overall exp. with 12 years relevant on any of the skills with leadUI Path/Blueprism/Kofax
-
Sr . L2 Ops Engineer
2 weeks ago
Hyderabad, Telangana, India Blue Spire Inc Full time ₹ 8,00,000 - ₹ 12,00,000 per yearAbout the Role:We are seeking a highly skilled Senior L2 Ops Engineer to join our dynamic team. You will play a critical role in maintaining the stability, performance, and reliability of our systems through robust observability practices, incident response readiness, and a commitment to operational excellence.This role focuses on payment solutions and...
-
Lead Data Engineer
4 days ago
Hyderabad, Telangana, India Proclink Full time ₹ 12,00,000 - ₹ 24,00,000 per yearDescriptionProclink is looking for Data Engineer Lead.Experience :9+ years.Location :Hyderabad.Position OverviewWe are seeking an experienced Data Engineer with over 10 years of experience in designing, developing, and optimizing enterprise-scale data solutions.The ideal candidate will lead our data engineering strategy, architect cloud-based data platforms,...
-
Offshore Delivery Manager – Tech Ops Group Lead
2 weeks ago
Hyderabad, Telangana, India Spectral Consultants Full time ₹ 12,00,000 - ₹ 24,00,000 per yearWe're Hiring: Offshore Delivery Manager – Tech Ops Group Lead Location: Hyderabad | Domain: Reporting & Data Operations Experience: 10+ years | Leadership in Tech Ops DeliveryWe're looking for aTech Ops Group Leadto anchor and scale offshore delivery across reporting, data operations, and field alerting systems. This is a senior leadership role for someone...