
System Reliability Engineer and AI Specialist
2 weeks ago
We are seeking an experienced System Reliability Engineer + AI Specialist to design, implement, and maintain scalable and reliable systems that integrate AI technologies.
Key Responsibilities:
- Linux System Administration: Manage, configure, and optimize Linux servers (RHEL, Ubuntu, or similar), including applying patches, performing security hardening, and tuning for performance.
- Kubernetes Management: Deploy, maintain, and troubleshoot Kubernetes clusters to ensure high availability and scalability.
- Data Center & Hardware Operations: Oversee physical infrastructure including servers, storage systems, and networking hardware in on-premises data centers.
- Security & Compliance: Apply updates and security patches to Linux-based Kubernetes environments and ensure alignment with internal security and compliance policies.
- Collaboration & Support: Work closely with global SRE and Platform teams to support enterprise systems and Kubernetes clusters.
- Incident & Case Management: Manage service tickets effectively using tools such as ServiceNow or Salesforce.
Required Skills and Qualifications:
- Hands-on experience in Linux system administration (RHEL, Ubuntu, or equivalent) with certifications like RHCSA or RHCE being a plus.
- Solid knowledge and practical experience in Kubernetes administration with certifications like CKA or CKS being an advantage.
- Experience working with bare-metal infrastructure, including servers, storage arrays, and networking components.
- Good understanding of networking protocols and technologies such as TCP/IP, DNS, firewalls, and load balancers with familiarity with Juniper OS being a plus.
- Strong troubleshooting skills across infrastructure layers including hardware, OS, and Kubernetes.
- Experience with automation tools like Ansible, Bash, or Python being beneficial.
- Knowledge of monitoring and observability tools such as Prometheus, Grafana, and ELK Stack being a plus.
Soft Skills:
- Excellent communication and collaboration abilities to work with global teams.
- Strong problem-solving skills with a focus on reliability, scalability, and automation.
- Adaptability to fast-paced, evolving environments—especially those involving AI/ML technologies.
This role requires a unique blend of technical expertise and soft skills. As a System Reliability Engineer and AI Specialist, you will be responsible for designing, implementing, and maintaining scalable and reliable systems that integrate AI technologies.
With your hands-on experience in Linux system administration and solid knowledge of Kubernetes administration, you will be able to tackle complex challenges and drive results. Your strong troubleshooting skills and experience with automation tools will also enable you to streamline processes and improve efficiency.
As a collaborative team player, you will work closely with global SRE and Platform teams to support enterprise systems and Kubernetes clusters. Your excellent communication and problem-solving skills will also help you navigate complex situations and build strong relationships with stakeholders.
Ultimately, this role is ideal for someone who is passionate about AI, has a keen eye for detail, and is not afraid to learn and adapt quickly. If you're excited about the prospect of joining a dynamic team and contributing to cutting-edge projects, we encourage you to apply today.
-
System Reliability Specialist
1 week ago
Palakkad, Kerala, India beBeeReliability Full time US$ 12,00,000 - US$ 18,00,000System Reliability Position">Key responsibilities for this role include ensuring the scalability and reliability of our systems. As a key member of the engineering group, you will be responsible for implementing scalable and highly available systems.">Design and implement systems that meet scalability and availability requirements.Collaborate with...
-
Job Posting: AI
2 weeks ago
Palakkad, Kerala, India Ai PRACTICE MANAGEMENT LLC Full timeCompany Description At Ai PRACTICE MANAGEMENT LLC, our mission is to revolutionize medical billing by delivering seamless, efficient, and automated solutions. With over a decade of expertise across various specialties, we are committed to simplifying financial operations for healthcare providers. By prioritizing quality and accuracy, we ensure faster...
-
AI Engineering Specialist
2 weeks ago
Palakkad, Kerala, India beBeeArtificialintelligence Full time ₹ 2,00,00,000 - ₹ 2,50,00,000Ai Engineering SpecialistWe are seeking an experienced AI/ML engineer to design and implement cutting-edge solutions in the field of Generative AI and Large Language Models.This role involves leading the development of intelligent agents, prompt optimization, and scalable AI pipelines, while mentoring team members and driving innovation in applied AI.Key...
-
AI Model Deployment Specialist
2 weeks ago
Palakkad, Kerala, India beBeeArtificial Full time ₹ 20,00,000 - ₹ 25,00,000Job Title: AI Integration SpecialistWe are seeking a skilled and detail-oriented AI Integration Engineer to join our organization in the region. In this role, you will be responsible for deploying and integrating artificial intelligence and machine learning models into production environments, ensuring their scalability, reliability, and security.Key...
-
AI Systems Lead
2 weeks ago
Palakkad, Kerala, India beBeeMachineLearning Full time ₹ 1,50,00,000 - ₹ 2,00,00,000">Lead AI/ML System Developer">We are seeking a highly skilled Lead AI/ML System Developer to drive the development and deployment of cutting-edge AI/ML systems, focusing on Large Language Models (LLMs), Retrieval-Augmented Generation (RAG), AI agents, and intelligent automation.">This role involves working closely with cross-functional teams to translate...
-
Site Reliability Infrastructure Specialist
1 week ago
Palakkad, Kerala, India beBeeInfrastructure Full time ₹ 15,00,000 - ₹ 25,00,000Site Reliability EngineerWe are seeking a seasoned Site Reliability Engineer who can craft scalable infrastructure on cloud platforms using Kubernetes, Docker, and Terraform.This role is ideal for an experienced engineer who thrives at the intersection of software development and systems operations, and wants to build high-performance infrastructure that...
-
Reliable Systems Engineer
2 weeks ago
Palakkad, Kerala, India beBeeSystem Full time ₹ 12,00,000 - ₹ 24,00,000About This RoleAs a Site Reliability Engineer, you will be responsible for ensuring the smooth operation of our distributed systems. You will work closely with developers to establish and uphold quality and performance benchmarks, ensuring that applications meet necessary criteria before they are deployed to production.Key Responsibilities:Oversee and...
-
Advanced System Reliability Specialist
2 weeks ago
Palakkad, Kerala, India beBeeReliability Full time ₹ 1,50,00,000 - ₹ 2,50,00,000System Reliability Engineer OpportunityDelta Tech Hub is a hub of innovation and technology, contributing to the company's objectives by delivering niche solutions that support various teams and functions across the company.We execute incident, change management, and problem management processes to ensure seamless operations.We build and support reliable...
-
Reliable System Specialist
1 week ago
Palakkad, Kerala, India beBeeResiliency Full time ₹ 1,00,00,000 - ₹ 2,00,00,000Key ResponsibilitiesAs a Resiliency Tester, you will be responsible for designing and implementing resiliency-focused testing strategies for enterprise-grade applications and infrastructure.Develop, execute, and maintain test plans, scripts, and scenarios that validate fault tolerance, system recovery, and high availability. Perform stress, load, chaos, and...
-
AI Data Specialist
2 weeks ago
Palakkad, Kerala, India beBeeData Full time ₹ 18,00,000 - ₹ 27,00,000AI Data SpecialistWe are seeking a highly skilled AI Data Specialist to join our team of experts in developing and implementing data-driven solutions for education and research.The ideal candidate will have a strong background in data engineering, with expertise in designing, building, and operating scalable, fault-tolerant data infrastructure to support...