AI Platform
1 week ago
Role Description:
We are looking for a Cloud/Site Reliability Engineer (SRE) to join our AI Platform team, focused on building and maintaining highly available, scalable, and secure infrastructure for AI/ML workloads. This role is critical to ensure the reliability and performance of our AI services and platform components across cloud environments.
You will work closely with software engineers, ML engineers, and platform architects to design and implement robust monitoring, alerting, and incident response systems. Youll also contribute to automation of infrastructure provisioning, deployment pipelines, and performance tuning of AI workloads in production.
Roles & Responsibilities:
- Design and implement scalable, resilient cloud infrastructure to support AI/ML workloads.
- Develop and maintain observability tools including monitoring, logging, and alerting systems for AI platform services.
- Automate infrastructure provisioning and deployment using Infrastructure-as-Code (IaC) tools.
- Collaborate with engineering teams to ensure high availability and performance of AI services.
- Lead incident response and root cause analysis for platform outages or performance degradation.
- Implement security best practices and compliance controls across cloud environments.
- Optimize resource usage and cost efficiency of AI workloads in cloud environments.
- Participate in sprint planning and contribute to platform architecture and reliability strategy.
Must-Have Skills:
- Strong experience with cloud platforms (AWS, GCP, Azure) and cloud-native services.
- Proficiency in scripting and automation (Python, Bash, Terraform, etc.).
- Experience with containerization and orchestration (Docker, Kubernetes).
- Familiarity with monitoring and logging tools (Prometheus, Grafana, ELK, Datadog).
- Understanding of CI/CD pipelines and DevOps practices.
- Experience with incident management, root cause analysis, and reliability engineering.
- Knowledge of security principles and cloud compliance frameworks.
- Ability to learn quickly, be organized and detail oriented.
Good-to-Have Skills:
- Exposure to AI/ML workloads and performance tuning for model inference and training.
- Experience with MLOps tools (MLflow, Kubeflow, Airflow).
- Familiarity with service mesh technologies (Istio, Linkerd).
- Experience with cost optimization strategies in cloud environments.
- Knowledge of distributed systems and fault-tolerant architecture.
Education and Professional Certifications:
- Bachelors degree in computer science, Engineering, or related field.
- 5 to 9 years of experience in cloud infrastructure, DevOps, or SRE roles.
- Certifications in cloud platforms (AWS Solutions Architect, Azure Administrator, Google Cloud SRE) are a plus.
Soft Skills:
- Excellent analytical and troubleshooting skills.
- Strong verbal and written communication skills.
- Ability to work effectively with global, virtual teams.
- High degree of initiative and self-motivation.
- Ability to manage multiple priorities successfully.
- Team-oriented, with a focus on achieving team goals.
- Strong presentation and public speaking skills.
-
Platform Engineer
2 weeks ago
Hyderabad, Telangana, India Soul Ai Full time ₹ 8,00,000 - ₹ 24,00,000 per yearStep into the world of AI innovation with the Experts Community of Soul AI (By Deccan AI). We are looking for Indias top 1% Platform Engineers for a unique job opportunity to work with the industry leaders.- Who can be a part of the community. We are looking for Platform Engineers focusing on building scalable and high-performance AI/ML platforms. ...
-
Ai engineer- Genrative Ai
1 day ago
Hyderabad, Telangana, India Weekday AI Full time ₹ 24,00,000 - ₹ 42,00,000 per yearThis role is for one of the Weekday's clientsSalary range: Rs Rs ie INR 20-35 LPA)Min Experience: 5 yearsLocation: Hyderabad, ChennaiJobType: full-timeAs an AI/ML Engineer, you will be responsible for creating end-to-end machine learning solutions—from data exploration to model deployment. You will work closely with cross-functional teams to understand...
-
AI Trainer
3 days ago
Hyderabad, Telangana, India Soul AI Full time ₹ 5,00,000 - ₹ 15,00,000 per yearAbout UsSoul AI is a pioneering company founded by IIT Bombay and IIM Ahmedabad alumni, with a strong founding team from IITs, NITs, and BITS. We specialize in delivering high-quality human-curated data, AI-first scaled operations services, and more. Based in SF and Hyderabad, we are a young, fast-moving team on a mission to build AI for Good, driving...
-
Enterprise AI Analyst
3 days ago
Hyderabad, Telangana, India Turium Ai Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAre you ready to build the future? At the intersection of AI, automation, and quantum-secure communications, Turium AI is looking for India's brightest minds to solve problems once thought impossible.Turium AI builds enterprise-grade AI systems that are deployed globally to tackle high-stakes challenges for industries and governments. Our new Global...
-
Lead – AI Platform
6 days ago
Hyderabad, Telangana, India G1 GLOBAL Full time ₹ 12,00,000 - ₹ 24,00,000 per yearLocation: Hyderabad (On-site) Full TimeDepartment: R& D EngineeringReports To: Head of Engineering / CTOOffice Timings: 12PM - 9PMPosition OverviewWe are hiring a technically hands-on Senior Architect to lead the design, development, and deployment of a modular AI-driven automation platform, integrating AI Agent framework, Rag Model solution, Large Language...
-
Lead AI Engineer
6 days ago
Hyderabad, Telangana, India Weekday AI Full time ₹ 12,00,000 - ₹ 36,00,000 per yearThis role is for Weekday's client.Role OverviewAs the Lead AI Engineer, you will be responsible for spearheading the design, development, and deployment of AI solutions. You will work with various large language models (LLMs)—both open-source and proprietary—optimizing them through fine-tuning, prompt engineering, agentic frameworks, and...
-
AI Functional Specialist
6 days ago
Hyderabad, Telangana, India WeXL AI Full time ₹ 8,00,000 - ₹ 20,00,000 per yearAbout CompanyWeXL aims to be a Global leader in AI and provide Innovative Technology Solutions for Next Gen Education ecosystem. We deliver AI Powered, "Make in India", Patented products/solutions that transform how K12 (Schools), Colleges, Universities, and Corporates empower their learners (students and employees).Our Innovative products, with a strong...
-
Senior AI Platform Engineer
20 hours ago
Hyderabad, Telangana, India Medtronic Full time ₹ 12,00,000 - ₹ 36,00,000 per yearAt Medtronic you can begin a life-long career of exploration and innovation, while helping champion healthcare access and equity for all. You'll lead with purpose, breaking down barriers to innovation in a more connected, compassionate world.A Day in the LifeOur Global Diabetes Capability Center in Pune is expanding to serve more people living with diabetes...
-
Hyderabad, Telangana, India Genpact Full time ₹ 2,00,00,000 - ₹ 2,50,00,000 per yearReady to build the future with AI? At Genpact, we don't just keep up with technology—we set the pace. AI and digital innovation are redefining industries, and we're leading the charge. Genpact's AI Gigafactory, our industry-first accelerator, is an example of how we're scaling advanced technology solutions to help global enterprises work smarter, grow...
-
Platform Engineer-Agentic AI, Python
2 weeks ago
Hyderabad, Telangana, India Nexifyr Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob DescriptionWe are looking for a Platform Engineer with 3–4 years of experience to join our team. In this role, you will work closely with our product and engineering teams to deliver high-quality, scalable solutions. You will leverage your technical expertise in Python and cloud platforms to design, build, and maintain the backend systems that power...