Algotale-Senior AI/ML Solution Architect
7 days ago
Senior AI/ML Solution Architect - Generative AI & Agentic Systems
Algotale is a premier IT staffing and software solutions provider, delivering top-tier talent and custom-built technology to drive business success. With a strong network of skilled professionals across software development, cloud solutions, and project management, we help companies scale efficiently and execute projects seamlessly. Our flexible engagement models cater to both short-term and long-term needs, ensuring precision-matched expertise for every requirement. From IT staffing to full-cycle software development, Algotale empowers businesses with innovative, high-impact solutions.
Position Overview
We are looking for a Senior AI/ML Solution Architect with deep expertise in Generative AI and agentic systems to lead the design and implementation of enterprise-scale AI solutions. This role requires a unique blend of hands-on technical expertise in both Large Language Models (LLMs) and Small Language Models (SLMs), combined with the architectural vision to deploy these solutions across diverse computing environments.
The ideal candidate will architect scalable agentic solutions, implement advanced fine-tuning strategies, and design comprehensive integration systems that connect AI capabilities with enterprise applications. You will be at the forefront of our AI transformation initiatives, working with cutting-edge technologies while maintaining a practical approach to deployment and optimization.
Experience Requirements
Overall Experience: 8+ years in technology and software development
Generative AI Experience: 2+ years of hands-on experience with LLMs and generative AI systems
Solution Architecture Experience: 4+ years architecting enterprise-scale solutions
Key Responsibilities
Architecture & Design
Design and architect scalable agentic solutions using advanced LLM capabilities
Implement Model Context Protocol (MCP) integrations to connect applications with diverse external
services and APIs
Develop multi-agent orchestration systems for complex workflow automation
Design context and memory management systems for persistent agent interactions
Technical Implementation
Build and optimize Retrieval-Augmented Generation (RAG) systems for efficient knowledge retrieval
Implement agent frameworks (LangChain, LangGraph, Semantic Kernel, Agno) for various deployment environments
Design and deploy model inference pipelines optimized for different computing environments (cloud, edge, on-premises)
Develop comprehensive fine-tuning strategies for both Large Language Models (LLMs) and Small Language Models (SLMs)
Architect SLM deployment strategies for resource-constrained environments
Implement model compression and quantization techniques for efficient inference
Integration & Connectivity
Architect REST/gRPC/GraphQL APIs and SDK integrations for seamless service connectivity
Implement event-driven architectures using webhooks and message buses
Design secure authentication and authorization systems (SSO/OIDC)
Build connectors for popular platforms (Slack, Jira, Salesforce, CRM/ERP systems)
Data & Model Management
Design comprehensive data preprocessing pipelines including cleaning, deduplication, and PII reduction
Implement embedding creation and re-embedding strategies for optimal retrieval
Develop chunking and windowing strategies for mobile-optimized content processing
Establish model selection criteria and evaluation frameworks
Required Technical Skills
Core AI/ML Expertise
Foundation Models: Deep experience with GPT-4, Claude, LLaMA, and other state-of-the-art LLMs
Small Language Models (SLMs): Expertise in deploying and optimizing SLMs (Phi-3, Gemma, TinyLlama) for mobile environments
Agent Frameworks: Proficiency in LangChain, LangGraph, Microsoft Semantic Kernel, Agno, and custom agent development
RAG Systems: Advanced knowledge of retrieval-augmented generation, vector databases, and semantic search
Fine-tuning & Adaptation
Advanced fine-tuning techniques: LoRA/QLoRA, DoRA, AdaLoRA for parameter-efficient training
Model compression: Pruning, quantization (INT8/INT4), knowledge distillation
Prompt-tuning, adapters, prefix tuning, and P-tuning v2 methodologies
RLHF/RLAIF techniques for alignment and preference learning
Domain-specific fine-tuning for mobile use cases and vertical applications
Deployment & Optimization
SLM Deployment: Expertise in deploying Small Language Models across various computing environments
Multi-Platform Optimization: Experience optimizing both LLMs and SLMs for cloud, edge, and on- premises deployment
Efficient Inference: Knowledge of quantization (GPTQ, AWQ, GGML), pruning, and distillation techniques
Model Compression: Advanced techniques for reducing model size while maintaining performance
Real-time Processing: Expertise in streaming inference and adaptive reasoning depth control
Performance Optimization: Proficiency in autoscaling, rate limiting, and resource management
Adaptive Fine-tuning
Environment-specific model adaptation and optimization
Federated learning approaches for distributed fine-tuning
Few-shot and zero-shot learning techniques for resource-efficient adaptation
Integration Technologies
MCP Implementation: Deep understanding of Model Context Protocol for service integration
API Development: Expertise in designing and implementing REST, gRPC, and GraphQL APIs
Event Systems: Experience with event buses, webhooks, and real-time communication
Security: Knowledge of secure storage, caching, and access control systems
Development Frameworks
Libraries: TensorFlow, PyTorch, Hugging Face Transformers, LlamaIndex
Application Development: Web frameworks, desktop applications, API development
Cloud Platforms: AWS, GCP, Azure with focus on AI/ML services
DevOps: CI/CD pipelines, containerization (Docker/Kubernetes), monitoring
Preferred Qualifications
Master's or PhD in Computer Science, AI, Machine Learning, or related field
Published research or contributions to open-source AI/ML projects
Experience with multi-modal models and cross-modal applications
Knowledge of MLOps best practices and model lifecycle management
Experience with regulatory compliance in AI systems (GDPR, AI Act, etc.)
Track record of leading AI transformation initiatives in enterprise environments
Certifications in cloud platforms (AWS, GCP, Azure) with focus on AI/ML services
Technical Competencies to Be Assessed
System design and architecture for distributed AI systems
Code review and optimization for production AI deployments
Performance benchmarking and model evaluation methodologies
Cost optimization strategies for large-scale AI deployments
Security and privacy considerations in AI systems
Scalability patterns for AI applications
-
Ai/ml Solutions Architect
2 weeks ago
Remote, India North Hires Full time1. **Machine Learning and AI** - **Machine Learning Algorithms**: - Deep understanding of various machine learning algorithms, model evaluation, and feature engineering. - **AI Solution Development**: - Expertise in developing and prototyping advanced AI solutions, including generative AI systems (e.g., working with models similar to GPT-3, T5). - **AI/ML...
-
Ai / Ml Soultion Architect
2 days ago
Remote, India vega consulting Full time**Role & responsibilities** He/she should be able to handle AI/ML workload across India. He/she will work with Sales team and OEMs in architecting the solution and running POCs/Pilots for the customers. Develop comprehensive AI/ML solution architectures, considering scalability, performance, and security. Work closely with the engineering teams to ensure...
-
AI Solutions Architect
1 week ago
Remote, India Aloola Full time ₹ 12,00,000 - ₹ 36,00,000 per yearWe are seeking a highly skilled and experienced AI Solutions Architect with expertise in Generative AI, Large Language Models (LLMs), and cloud-based deployments. This role is ideal for someone with a strong technical background, hands-on experience in building AI-driven solutions, and the ability to translate cutting-edge research into scalable, real-world...
-
Senior Software Engineer, ML
2 weeks ago
India, Karnataka-Bengaluru(Remote) Relyance AI Full time ₹ 12,00,000 - ₹ 36,00,000 per yearJob Description - Senior Software Engineer, MLAs Relyance AI's ML/NLP Engineer, you will strategize, drive, and execute on the initiatives in NLP for information extraction from legal documents, ML/NLP for information extraction from code and general ML in code analysis, as well as overall AI backend initiatives. You will partner with cross-functional...
-
Algotale-Lead Angular UI Developer
2 weeks ago
Remote, India Nexthire Full time ₹ 1,20,000 - ₹ 3,00,000 per yearLead Angular UI DeveloperExperience: 8+yrsSubmit only if:Have 5+ years of hands-on coding experience, strong expertise in Angular 16+ versions (this is a migration project from an older version to a new Angular version) and leading Angular web application development projects for clients. (so strong communication is skills is must to drive client...
-
Ai & Ml Intern
2 weeks ago
Remote, India Ultracures Full time**AI/ML Intern (Artificial Intelligence / Machine Learning)** **Location**: Remote **Duration**: 2 months **Type**: Internship (Unpaid) **About Us**: Ultracures is a growing organization working on innovative solutions using Artificial Intelligence and Machine Learning. We are looking for motivated interns who are passionate about AI/ML to join our...
-
GCP AI Architect
1 week ago
Remote, India TriDevSofts Full timeWe're Hiring: GCP AI Architect (Contractual Role) (Remote)Are you passionate about building intelligent solutions on Google Cloud Platform? We're looking for a skilled GCP AI Architect to join our team and lead the design and deployment of cutting-edge AI/ML systems.What You'll Do:Architect scalable AI/ML solutions using GCP services like Vertex AI,...
-
AI Architect
1 week ago
Remote, India Excellanto Ventures Pvt Ltd Full timeAI Architect (Remote)Experience: 10+ YearsJob Type: Full-Time, Remote (India)About the RoleWe are looking for an experienced AI Architect to lead the design, architecture, and implementation of Generative AI solutions on AWS and Azure cloud platforms.This role is ideal for someone who thrives on innovation, enjoys solving complex problems, and wants to be at...
-
Ml Solution Architect
4 days ago
Remote, India Ingenworks Full time**ML Solution Architect with Digital ADV** **Location: Remote** **Responsibilities**: - Collaborate with stakeholders to understand business objectives and requirements related to digital advertising campaigns - Design and implement machine learning solutions to optimize ad targeting, bidding strategies, and campaign performance - Analyze large datasets...
-
Senior AI/ML Engineer
2 weeks ago
Remote, India Techmora Full time ₹ 12,00,000 - ₹ 24,00,000 per yearDescription : Job Description : Senior Engineer - AI/ML (Databricks Certified) We are seeking a highly skilled AI Engineer with proven experience in developing, deploying, and optimizing AI models using the Databricks AI platform. The ideal candidate will have a strong background in machine learning, distributed data processing, and will be a champion...