Openstack Engineer
1 week ago
We are looking for a
L0 and L1 Site Reliability Engineer (SRE) Support
to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by
OpenStack
and
Kubernetes
. In this role, you will focus on
monitoring
,
basic troubleshooting
, and
incident response
, helping to maintain high system availability, reliability, and performance. You will be responsible for identifying and addressing simple issues, as well as escalating more complex problems to senior SREs when needed.
The ideal candidate should have a basic understanding of
cloud infrastructure
(especially
OpenStack
and
Kubernetes
),
containerized environments
, and system monitoring. This position offers an excellent opportunity for someone looking to grow into a more advanced SRE or DevOps role.
Key Responsibilities:
For L0 Support (Level 0):
- Incident Monitoring & Triage
: - Respond to system alerts, monitor infrastructure health using tools like
Prometheus
,
Grafana
, and
Observability
for both OpenStack and Kubernetes. - Identify low-level issues and follow
runbooks
or predefined scripts to perform first-level triage. - Document and escalate unresolved incidents to L1 or L2 based on established escalation protocols.
- System Health Checks
: - Perform daily health checks for Kubernetes pods, nodes, and OpenStack instances.
- Verify basic functionality of
VMs
,
containers
, and
network services
within the environment. - Basic Troubleshooting
: - Resolve simple issues such as VM reboots, pod failures, and network connectivity issues within OpenStack or Kubernetes environments.
- Follow the predefined steps for basic troubleshooting tasks like restarting services or clearing logs.
- Ticket Management
: - Log incidents and issues into a
ticketing system
(e.g.,
JIRA
,
ServiceNow
) for tracking and escalation. - Update incident tickets and provide relevant information for ongoing resolution efforts.
=========================================================================================================
For L1 Support (Level 1):
- Incident Resolution
: - Investigate and resolve more complex issues compared to L0, such as Kubernetes pod crashes, network misconfigurations in OpenStack, and minor service disruptions.
- Work with tools like
kubectl
to troubleshoot Kubernetes pods and nodes, and
OpenStack CLI
to diagnose problems with VMs, storage, and networks. - Automation & Scripting
: - Automate routine tasks, such as VM provisioning, pod deployments, or status checks, using basic scripting languages (
Python
,
Bash
). - Improve automation workflows based on feedback and frequently encountered issues.
- Log Aggregation & Monitoring
: - Review logs and metrics collected from
ELK Stack
,
Prometheus
,
Grafana
, or other logging tools to detect trends and potential issues. - Analyze logs and metrics from OpenStack and Kubernetes clusters to pinpoint underlying problems (e.g., high CPU usage, memory leaks).
- Basic Network & Storage Management
: - Investigate networking issues related to
Neutron
(for OpenStack) and
CNI
configurations (for Kubernetes). - Manage storage resources within OpenStack and Kubernetes (e.g., creating persistent volumes, debugging storage access issues).
- Collaboration & Escalation
: - Work closely with L2 and L3 engineers for
complex troubleshooting
or advanced system issues that require in-depth knowledge. - Share knowledge with the team and assist in creating new documentation or updating existing troubleshooting guides.
- User and Permissions Management
: - Perform basic user management tasks within OpenStack (e.g., creating and managing tenants, security groups).
- Review and modify Kubernetes RBAC (Role-Based Access Control) settings based on user access needs.
Skills & Qualifications:
Required Skills:
- Basic Cloud & Kubernetes Knowledge
: - Familiarity with
OpenStack
architecture (e.g.,
Nova
,
Neutron
,
Cinder
). - Basic understanding of
Kubernetes
components, including
pods
,
services
,
deployments
, and
namespaces
. - Systems & Networking
: - Knowledge of Linux/Unix-based operating systems (e.g.,
Ubuntu
,
CentOS
,
Red Hat
). - Understanding of networking concepts like
DNS
,
IP routing
, and
VLANs
in cloud environments. - Monitoring & Alerting Tools
: - Familiarity with monitoring tools like
Prometheus
,
Grafana
,
Zabbix
, or
CloudWatch
for alert management and system health monitoring. - Troubleshooting & Incident Response
: - Experience in using log aggregation tools (
ELK stack
,
Splunk
) and interpreting logs for incident detection. - Ability to perform basic troubleshooting steps (e.g., restarting services, running basic shell commands) to resolve issues.
- Communication Skills
: - Strong communication skills to collaborate effectively with senior SREs, developers, and other teams.
- Ability to document incidents, solutions, and troubleshooting steps clearly.
Preferred Skills:
- Basic Scripting & Automation
: - Exposure to scripting languages such as
Bash
,
Python
, or
Go
to automate basic administrative tasks. - Cloud Platform Experience
: - Familiarity with other cloud technologies such as
AWS
,
Azure
, or
Google Cloud Platform
. - Certifications
: - Basic certifications such as
CompTIA Linux+
,
AWS Certified Solutions Architect
,
Kubernetes Fundamentals
(CKA), or
OpenStack COA
are a plus.
-
OpenStack Engineer
1 day ago
Bengaluru, Karnataka, India Talent Worx Full time ₹ 9,00,000 - ₹ 12,00,000 per yearTitle: OpenStack EngineerExp : 6- 10 yearsJob Description:SummaryWe are seeking a highly skilled Red Hat OpenStack Subject Matter Expert (SME) to join our team. This individual will be responsible for designing, deploying, managing, and troubleshooting complex OpenStack environments. As a key technical expert, you will provide leadership in architecture,...
-
openstack Engineer
5 days ago
Bengaluru, Karnataka, India UJS Consultancy Pvt. Ltd. Full time ₹ 8,00,000 - ₹ 15,00,000 per yearJob Title: OpenStack EngineerExperience: 3+ YearsLocation: Koramangala, Bengaluru/PuneWork Mode: 5 Days – Work From Office (WFO), Rotational Shifts & On-call supportMandatory Skills:-Strong hands-on experience with OpenStack core components (Atleast 2 out of Nova, Neutron, Keystone, Glance, Cinder, Horizon, etc.)-Familiarity with OpenStack CLI and APIs...
-
Openstack Engineer
4 weeks ago
Bengaluru, Karnataka, India, Karnataka WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
OpenStack Administrator
1 day ago
Bengaluru, Karnataka, India Capgemini Full time ₹ 15,00,000 - ₹ 25,00,000 per yearChoosing Capgemini means choosing a company where you will be empowered to shape your career in the way you'd like, where you'll be supported and inspired by a collaborative community of colleagues around the world, and where you'll be able to reimagine what's possible. Join us and help the world's leading organizations unlock the value of technology and...
-
OpenStack Architect
1 day ago
Bengaluru, Karnataka, India Huawei Technologies Full time ₹ 15,00,000 - ₹ 25,00,000 per yearCompany Description About Huawei :Huawei Technologies provides innovative and customized network solutions for telecom carriers around the world. Huawei holds leading positions in the global market in switching, next generation network (NGN), integrated access network, DSLAM, and intelligent network. Specializing in the areas of fixed network, mobile...
-
Lead Devops Engineer, Portworx
2 weeks ago
Bengaluru, Karnataka, India Pure Storage Full time ₹ 1,20,000 - ₹ 1,80,000 per yearWe're in an unbelievably exciting area of tech and are fundamentally reshaping the data storage industry. Here, you lead with innovative thinking, grow along with us, and join the smartest team in the industry.This type of work—work that changes the world—is what the tech industry was founded on. So, if you're ready to seize the endless opportunities and...
-
SDN, OpenFlow, Software Engineer
23 hours ago
Bengaluru, Karnataka, India ph7 Full time ₹ 5,00,000 - ₹ 25,00,000 per yearCompany Description Processor Company Job Description Job Location: BangaloreRole: DeveloperSkills: Networking, Linux Kernel, OpenFlow, SDN, Telecom/OEMC, C++, SDN, Openflow, OpenStack, Linux Kernel, Embedded Systems, embedded c, Python, Internals, ARM, PowerPC, vx works, x86, risc, sparc, NetworkSDN,Openflow,OpenStack, Linux Kernel, Embedded...
-
Site Reliability Engineer
3 days ago
Bengaluru, Karnataka, India WhiteLotus Talent Partners Full time ₹ 9,00,000 - ₹ 12,00,000 per yearWe are looking for aL0 and L1 Site Reliability Engineer (SRE) Supportto join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered byOpenStackandKubernetes. In this role, you will focus onmonitoring,basic troubleshooting, andincident response, helping to maintain high system availability,...
-
Site Reliability Engineer
3 weeks ago
Bengaluru, Karnataka, India, Karnataka WhiteLotus Talent Partners Full timeWe are looking for a L0 and L1 Site Reliability Engineer (SRE) Support to join our Krutrim Cloud Site Reliability operations team and ensure the smooth functioning of our cloud infrastructure powered by OpenStack and Kubernetes. In this role, you will focus on monitoring, basic troubleshooting, and incident response, helping to maintain high system...
-
Cloud Automation Engineer
1 day ago
Bengaluru, Karnataka, India Krazy Mantra HR Solutions Pvt. Ltd Full time ₹ 15,00,000 - ₹ 25,00,000 per yearWe are looking for a skilled Cloud Automation Engineer with 5-7 years of experience to join our team in Bangalore. The ideal candidate will have expertise in cloud automation, Ansible scripting, and cloud services such as AWS and OpenStack.Roles and ResponsibilityDesign and implement automated deployment scripts using Ansible.Develop and maintain...