TechOps Expert

1 day ago


hubballi, India beBeeTechOps Full time

Agentic & AI Tech Ops ExpertWe are seeking an experienced professional to oversee the reliability, scalability, and efficiency of our AI and Agentic AI systems in production.Deployment & InfrastructureDeploy and manage AI models, agentic systems, and infrastructure across cloud platforms.Implement continuous integration and deployment pipelines for AI/ML and agentic applications.Optimize cloud resources for cost and scalability.Monitoring & Incident ManagementDevelop monitoring, logging, and alerting solutions for AI systems.Handle incident response, root cause analysis, and maintain runbooks.Automation & Operational ExcellenceAutomate deployments and maintenance using Python/Bash and infrastructure as code tools.Enforce security, compliance, and operational best practices.Collaboration & DocumentationWork with AI developers, data scientists, and architects for smooth production transitions.Maintain clear documentation and provide feedback on system performance.Required Skills and QualificationsBachelor's degree in Computer Science, Information Technology, Engineering, or related field.2–4+ years of experience in Tech Ops, DevOps, Site Reliability Engineering, or MLOps roles.Hands-on experience with cloud platforms (Google Cloud Platform—Vertex AI, Amazon Web Services, Microsoft Azure).Proficiency in Python/Bash scripting, continuous integration and deployment tools.Experience with Docker, Kubernetes, monitoring tools.Knowledge of networking, security, and infrastructure as code principles.Strong troubleshooting and communication skills.