Openstack Engineer
1 week ago
We are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system availability, reliability, and performance. You will be responsible for identifying and addressing simple issues, as well as escalating more complex problems to senior SREs when needed.
The ideal candidate should have a basic understanding of cloud infrastructure (especially OpenStack and Kubernetes), containerized environments, and system monitoring. This position offers an excellent opportunity for someone looking to grow into a more advanced SRE or DevOps role.
Key Responsibilities:
For L0 Support (Level 0):
- Incident Monitoring & Triage:
- Respond to system alerts, monitor infrastructure health using tools like Prometheus, Grafana, and Observability for both OpenStack and Kubernetes.
- Identify low-level issues and follow runbooks or predefined scripts to perform first-level triage.
- Document and escalate unresolved incidents to L1 or L2 based on established escalation protocols.
- System Health Checks:
- Perform daily health checks for Kubernetes pods, nodes, and OpenStack instances.
- Verify basic functionality of VMs, containers, and network services within the environment.
- Basic Troubleshooting:
- Resolve simple issues such as VM reboots, pod failures, and network connectivity issues within OpenStack or Kubernetes environments.
- Follow the predefined steps for basic troubleshooting tasks like restarting services or clearing logs.
- Ticket Management:
- Log incidents and issues into a ticketing system (e.g., JIRA, ServiceNow) for tracking and escalation.
- Update incident tickets and provide relevant information for ongoing resolution efforts.
=========================================================================================================
For L1 Support (Level 1):
- Incident Resolution:
- Investigate and resolve more complex issues compared to L0, such as Kubernetes pod crashes, network misconfigurations in OpenStack, and minor service disruptions.
- Work with tools like kubectl to troubleshoot Kubernetes pods and nodes, and OpenStack CLI to diagnose problems with VMs, storage, and networks.
- Automation & Scripting:
- Automate routine tasks, such as VM provisioning, pod deployments, or status checks, using basic scripting languages (Python, Bash).
- Improve automation workflows based on feedback and frequently encountered issues.
- Log Aggregation & Monitoring:
- Review logs and metrics collected from ELK Stack, Prometheus, Grafana, or other logging tools to detect trends and potential issues.
- Analyze logs and metrics from OpenStack and Kubernetes clusters to pinpoint underlying problems (e.g., high CPU usage, memory leaks).
- Basic Network & Storage Management:
- Investigate networking issues related to Neutron (for OpenStack) and CNI configurations (for Kubernetes).
- Manage storage resources within OpenStack and Kubernetes (e.g., creating persistent volumes, debugging storage access issues).
- Collaboration & Escalation:
- Work closely with L2 and L3 engineers for complex troubleshooting or advanced system issues that require in-depth knowledge.
- Share knowledge with the team and assist in creating new documentation or updating existing troubleshooting guides.
- User and Permissions Management:
- Perform basic user management tasks within OpenStack (e.g., creating and managing tenants, security groups).
- Review and modify Kubernetes RBAC (Role-Based Access Control) settings based on user access needs.
Skills & Qualifications:
Required Skills:
- Basic Cloud & Kubernetes Knowledge:
- Familiarity with OpenStack architecture (e.g., Nova, Neutron, Cinder).
- Basic understanding of Kubernetes components, including pods, services, deployments, and namespaces.
- Systems & Networking:
- Knowledge of Linux/Unix-based operating systems (e.g., Ubuntu, CentOS, Red Hat).
- Understanding of networking concepts like DNS, IP routing, and VLANs in cloud environments.
- Monitoring & Alerting Tools:
- Familiarity with monitoring tools like Prometheus, Grafana, Zabbix, or CloudWatch for alert management and system health monitoring.
- Troubleshooting & Incident Response:
- Experience in using log aggregation tools (ELK stack, Splunk) and interpreting logs for incident detection.
- Ability to perform basic troubleshooting steps (e.g., restarting services, running basic shell commands) to resolve issues.
- Communication Skills:
- Strong communication skills to collaborate effectively with senior SREs, developers, and other teams.
- Ability to document incidents, solutions, and troubleshooting steps clearly.
Preferred Skills:
- Basic Scripting & Automation:
- Exposure to scripting languages such as Bash, Python, or Go to automate basic administrative tasks.
- Cloud Platform Experience:
- Familiarity with other cloud technologies such as AWS, Azure, or Google Cloud Platform.
- Certifications:
- Basic certifications such as CompTIA Linux+, AWS Certified Solutions Architect, Kubernetes Fundamentals (CKA), or OpenStack COA are a plus.
-
OpenStack Engineer
5 days ago
Bengaluru, Karnataka, India Talent Worx Full time ₹ 12,00,000 - ₹ 36,00,000 per yearTitle: OpenStack EngineerExp : 6- 10 yearsJob Description:SummaryWe are seeking a highly skilled Red Hat OpenStack Subject Matter Expert (SME) to join our team. This individual will be responsible for designing, deploying, managing, and troubleshooting complex OpenStack environments. As a key technical expert, you will provide leadership in architecture,...
-
openstack Engineer
1 week ago
Bengaluru, Karnataka, India UJS Consultancy Pvt. Ltd. Full time ₹ 8,00,000 - ₹ 15,00,000 per yearJob Title: OpenStack EngineerExperience: 3+ YearsLocation: Koramangala, Bengaluru/PuneWork Mode: 5 Days – Work From Office (WFO), Rotational Shifts & On-call supportMandatory Skills:-Strong hands-on experience with OpenStack core components (Atleast 2 out of Nova, Neutron, Keystone, Glance, Cinder, Horizon, etc.)-Familiarity with OpenStack CLI and APIs...
-
Openstack Engineer
2 weeks ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time ₹ 5,00,000 - ₹ 12,00,000 per yearWe are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes. In this role, you will focus onmonitoring,basic troubleshooting, andincident response, helping to maintain high system availability,...
-
OpenStack Principal Engineer
2 days ago
Bengaluru, Karnataka, India Shavish Hr And Digital Marketing Full time ₹ 20,00,000 - ₹ 25,00,000 per yearExp in OpenStack architecture, with substantial expertise in deploying and managing OpenStack in cloud environmentsAdvanced expertise in OpenStack components such as Nova, Neutron, Cinder, and Swift.Exp in designing and managing (SDI) architecture Required Candidate profileLead the design and implementation of OpenStack environments for highly scalable...
-
Lead Engineer-OpenStack
2 days ago
Bengaluru, Karnataka, India Capgemini Full time ₹ 15,00,000 - ₹ 20,00,000 per yearBelow is the complete JDTotal Experience is Required- 7 to 14 YearsLead the design, deployment, and management of OpenStack-based private cloud infrastructure.Oversee and guide the team in configuring and optimizing OpenStack components (e.g., Nova, Neutron, Keystone, Cinder, Glance, Horizon).Deep understanding of OpenStack architecture and...
-
OpenStack Administrator
6 days ago
Bengaluru, Karnataka, India Capgemini Full time ₹ 15,00,000 - ₹ 25,00,000 per yearChoosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be supported and inspired by a collaborative community of colleagues around the world, and where you'll be able to reimagine what's possible. Join us and help the world's leading organizations unlock the value of technology and...
-
Site Reliability Engineer
6 days ago
Bengaluru, Karnataka, India, Karnataka WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Private Cloud Security Engineer
2 weeks ago
Bengaluru, Karnataka, India, Karnataka Objectways Full timeJob Title: Private Cloud Security EngineerLocation: Bangalore (Hybrid – 3 days in office)Experience Required: 5+ yearsRole OverviewAs a Private Cloud Security Engineer, you will play a vital role in safeguarding our on-premise or privately hosted cloud environments. You will be responsible for designing, implementing, and monitoring robust security...
-
Site Reliability Engineer
7 days ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time ₹ 9,00,000 - ₹ 12,00,000 per yearWe are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes. In this role, you will focus onmonitoring,basic troubleshooting, andincident response, helping to maintain high system availability,...
-
Network Principal Engineer
2 days ago
Bengaluru, Karnataka, India Shavish Hr And Digital Marketing Full time ₹ 15,00,000 - ₹ 25,00,000 per yearExp in network architecture, with substantial expertise in deploying and managing networks in cloud environments utilizing OpenStack and KuberneteExp in using network monitoring and management tools tailored to OpenStack and Kubernetes environment