
Senior CloudOps
19 hours ago
⚙️ Senior CloudOps, MLops Infrastructure Engineer Job Description ⚙️Join Rapid7: Secure the Future with AIOverviewWe are seeking an experienced and highly specialised Senior MLops Infrastructure Engineer to manage, automate, and secure our production cloud infrastructure and Machine Learning (ML)/Large Language Model (LLM) operational pipelines. This role is strictly focused on the operations and infrastructure that support our data science and engineering teams—it is not a data science or core LLM development position.Key Responsibilities and Required ExpertiseThe successful candidate will be an expert in all the following areas, driving high availability, scalability, and security.I. Cloud Infrastructure & Automation- Infrastructure as Code (IaC): Deep expertise in managing and provisioning infrastructure using Terraform. - Containerization & Orchestration: Advanced deployment, scaling, and management of services using Docker/Kubernetes. - Networking & Services: Architecting and maintaining high-performance API Layers & Microservices. - AWS CloudOps: Expert proficiency in AWS operational services, including EventBridge and Step Functions, for building robust automation flows. - Data Storage: Managing and optimizing critical AWS data services, including S3, DynamoDB, Redshift, and Kinesis.II. MLOps Tooling & Monitoring- ML/LLM Tooling Support: Provide and maintain the operational infrastructure for ML/LLM systems, including Model Registry/Versioning tools like MLflow/SageMaker. - Pipeline Automation (CI/CD): Designing and implementing robust CI/CD pipelines for ML/LLM deployments using tools like GitHub Actions/Jenkins. - Model Operations: Building the infrastructure to support Drift Detection & Retraining capabilities. - Monitoring & Alerting: Implementing comprehensive observability stacks using Prometheus/Grafana/CloudWatch. - Incident Management: Leading resolution efforts for production issues, including expertise with PagerDuty and On-call responsibilities.III. Security & Compliance (FinOps)- Cloud Security: Establishing and enforcing strong security policies and best practices across the cloud environment (IAM, VPC, Secrets). - AWS Security Services: Expert knowledge and application of specific AWS security tools like IAM, KMS, and Secrets Manager. - Cost Optimization: Leading initiatives for Cost Optimization (FinOps), balancing performance and efficiency across all cloud resources.
-
Lead DBA cum Ops Engineer
3 days ago
New Delhi, India Black Duck Full timeLead / Staff Software Engineer (PostgreSQL DB Ops: PostgreSQL Admin + CloudOps):Black Duck Software, Inc. helps organizations build secure, high-quality software, minimizing risks while maximizing speed and productivity. Black Duck, a recognized pioneer in application security, provides SAST, SCA, and DAST solutions that enable teams to quickly find and fix...
-
Sr MLOPS Infra Engineer
4 days ago
New Delhi, India Rapid7 Full time⚙️ Senior MLops Infrastructure Engineer Job Description ⚙️Join Rapid7: Secure the Future with AIOverviewWe are seeking an experienced and highly specialised Senior MLops Infrastructure Engineer to manage, automate, and secure our production cloud infrastructure and Machine Learning (ML)/Large Language Model (LLM) operational pipelines. This role is...
-
Senior Site Reliability Engineer
4 days ago
New Delhi, India Poshmark Full timeWe’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through...
-
Senior Site Reliability Engineer
19 hours ago
New Delhi, India Poshmark Full timeWe’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through...
-
Lead Engineer- Cloud Ops
4 weeks ago
Delhi, India Pentair Full timeJob DescriptionPosition Title: CloudOps Engineer/Senior CloudOps Engineer – L2Reports to (Title):Service Delivery Manager – Managed Services Experience- 2-4 Years Location:Noida Position SummaryPentair is currently seeking Managed Services CloudOps for IoT projects in the Smart Products & IoT Strategic Innovation Centre in India team. This role is...
-
Senior Site Reliability Engineer
3 weeks ago
Delhi, India Poshmark Full timeWe’re looking for an experienced Site Reliability Engineer to fill the mission-critical role of ensuring that our complex, web-scale systems are healthy, monitored, automated, and designed to scale. You will use your background as an operations generalist to work closely with our development teams from the early stages of design all the way through...