Cloud System Debug Engineer
14 hours ago
Job Description Job Title: Cloud System Debug Engineer Position Overview We are seeking an experienced Cloud System Debug Engineer with deep expertise in cloud infrastructure, Kubernetes, OpenStack, Linux systems, and Ceph storage. This role focuses on diagnosing, analyzing, and resolving complex issues across large-scale cloud and distributed environments. You will work across multi-cloud, hybrid, and private cloud platforms to ensure high availability, performance, and reliability of mission-critical systems. Key Responsibilities - Debug complex issues across large-scale public, private, and hybrid cloud environments. - Knowledge of microservices debugging and cloud-native application behavior. - Investigate failures in cloud infrastructure components such as networking, storage, virtualization, and orchestration layers. - Diagnose and resolve system issues in Kubernetes clusters, including nodes, pods, networking (CNI), and storage (CSI). - Troubleshoot problems with container runtimes such as Docker, containerd, and CRI-O. - Debug OpenStack components including Nova, Neutron, Cinder, Keystone, Glance, Horizon, and related APIs. - Debug and optimize Ceph storage clusters, including OSD issues, MON behavior, CRUSH map analysis, and performance bottlenecks. - Perform deep Linux system debugging, including kernel-level issues, network stack debugging, storage subsystem issues, and performance anomalies. - Conduct thorough Root Cause Analysis (RCA) and implement long-term corrective actions. - Improve system observability by enhancing monitoring, logging, and tracing using tools like Prometheus, Grafana, ELK/EFK, and Jaeger. - Develop and refine internal tools and automation for diagnostics, system debugging, and infrastructure monitoring. - Support production operations through an on-call rotation, addressing high-impact incidents quickly and effectively. - Optimize cloud and on-premise infrastructure for performance, scalability, and reliability. - Collaborate with DevOps, SRE, platform engineering, and development teams to resolve infrastructure and cloud platform issues. - Produce high-quality technical documentation, runbooks, and troubleshooting guides for system and cloud operations. Required Skills & Qualifications - 4+ years of experience in cloud infrastructure, distributed systems, Linux administration, or systems engineering. - Good expertise with cloud platforms (AWS, GCP, Azure) or large-scale private cloud environments. - Strong proficiency with Kubernetes cluster debugging, scaling, and cloud-native architectures. - Hands-on experience with OpenStack cloud components and troubleshooting. - Good knowledge of Ceph distributed storage systems and cluster tuning. - In-depth understanding of Linux internals, including networking, kernel behavior, process management, and storage subsystems. - Strong scripting/automation experience (Bash, Python, Ansible, Terraform, Helm). - Experience analyzing system logs, traces, crashes, and performance metrics in distributed systems. - Proficiency with observability stacks such as Prometheus, Grafana, OpenTelemetry - Ability to debug complex interactions between cloud services, orchestration tools, and infrastructure layers. - Strong analytical, communication, and documentation skills. Preferred Qualifications - Certifications in AWS/Azure/GCP, CKA/CKAD/CKS, OpenStack, or Ceph. - Experience with cloud networking (VXLAN, BGP, SDN, overlay networks). - Experience designing, analyzing or operating high-availability, multi-region distributed architectures. Education - Bachelor's or Master's degree in Computer Science, Engineering, or a related field (or equivalent experience).
-
Cloud System Debug Engineer
3 days ago
Bengaluru, Karnataka, India Ola Full time ₹ 12,00,000 - ₹ 24,00,000 per yearJob Title: Cloud System Debug EngineerPosition OverviewWe are seeking an experiencedCloud System Debug Engineerwith deep expertise incloud infrastructure, Kubernetes, OpenStack, Linux systems, andCeph storage. This role focuses on diagnosing, analyzing, and resolving complex issues across large-scale cloud and distributed environments. You will work across...
-
Hardware Engineer – Debug
4 days ago
Bengaluru, India Microsoft Full timeOverview Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN,...
-
Hardware Engineer – Debug
1 week ago
Bengaluru, Karnataka, India Microsoft Full time ₹ 12,00,000 - ₹ 24,00,000 per yearMicrosoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox...
-
System Debug
4 days ago
Bengaluru, Karnataka, India Capgemini Engineering Full time**Good experience and knowledge of Windows and Linux internals.** **Good experience and knowledge of Windows and Linux device drivers.** **Extensive and real-time experience with protocols such as USB, I2C, PCI, and PCIe.** **Proficient in testing and debugging firmware, BIOS, and BIOS stitching using various tools (Intel tool knowledge...
-
Sr. Hardware Engineer – Debug
1 week ago
Bengaluru, India Microsoft Full timeOverview Microsoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN,...
-
Sr. Hardware Engineer – Debug
2 weeks ago
Bengaluru, Karnataka, India Microsoft Full time ₹ 1,00,00,000 - ₹ 2,00,00,000 per yearMicrosoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox...
-
Software Debug Engineer
1 week ago
Chennai, India Zoho Full timeJob Description Job Role : Software Debug Engineer Experience : 0 - 3 Years Work Location : Chennai Interview Location : Coimbatore We are looking for a hands-on Software Debug Engineer who is passionate about troubleshooting, debugging, and resolving production issues across multiple layers - infrastructure, deployment pipelines, application and runtime...
-
Hardware Engineer – Debug
2 days ago
Bengaluru, Karnataka, India Microsoft Full time ₹ 15,00,000 - ₹ 25,00,000 per yearMicrosoft Silicon, Cloud Hardware, and Infrastructure Engineering (SCHIE) is the team behind Microsoft's expanding Cloud Infrastructure and responsible for powering Microsoft's "Intelligent Cloud" mission. SCHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox...
-
Senior Cloud Development Engineer
1 week ago
Bengaluru, Karnataka, India Cloud Software Group Full timeJob Description:As a Senior Software Engineer, you will design and implement Enterprise grade web applications and REST API services in large Public Clouds or on premise setups. The major technology stack includes .NET and C#, the Azure services, RDBMS, and advanced knowledge on CI/CD (TeamCity/Jenkins). Engineering a solution that can withstand failure and...
-
IT Cloud Systems Engineer
4 weeks ago
Bengaluru, India Hotwire Asia Pacific Full timeJob Description Who we are Hotwire is a global communications and marketing consultancy that helps the world's most innovative technology brands ignite their possibilities. With over 20 years of experience and a presence in 11 countries, we specialize in making the technical irresistible through data-led insights and integrated campaigns. Our approach...