GCP Infrastructure Engineer
20 hours ago
Before you apply to a job, select your language preference from the options available at the top right of this page.Explore your next opportunity at a Fortune Global 500 organization. Envision innovative possibilities, experience our rewarding culture, and work with talented teams that help you become better every day. We know what it takes to lead UPS into tomorrow—people with a unique combination of skill + passion. If you have the qualities and drive to lead yourself or teams, there are roles ready to cultivate your skills and take you to the next level.Job Description:Job Summary:We are seeking a highly skilled GCP Infrastructure Engineer to design, build, and manage the cloud infrastructure that powers Generative AI (GenAI) applications at scale. In this role, you will leverage Google Cloud Platform (GCP) Vertex AI, IBM Watsonx, and containerization technologies such as Docker and Kubernetes (GKE) to deliver secure, scalable, and high-performance AI solutions. You will own the end-to-end infrastructure lifecycle — from design and provisioning to automation, monitoring, and optimization — while enabling data scientists and ML engineers to seamlessly deploy and operate GenAI workloads.Key Responsibilities:Cloud Infrastructure & Platform EngineeringDesign, provision, and maintain scalable, secure, and cost-efficient infrastructure for GenAI applications on GCP.Deploy and manage containerized workloads using Docker and Kubernetes (GKE).Configure and optimize Vertex AI and IBM Watsonx platforms for training, fine-tuning, and serving LLMs and other generative models.Implement high-performance GPU/TPU clusters to support distributed training and large-scale inference.Ensure business continuity through backup, disaster recovery, and multi-region deployments.Automation & ReliabilityDevelop and maintain Infrastructure as Code (IaC) templates with Terraform, or Cloud Deployment Manager.Adopt GitOps practices (Flux) for infrastructure lifecycle management.Build and optimize CI/CD pipelines for data pipelines, model workflows, and GenAI applications.Apply SRE principles (SLIs, SLOs, SLAs) to guarantee platform reliability and uptime.Security, Governance & ComplianceEmbed DevSecOps best practices across the infrastructure lifecycle, including policy-as-code, vulnerability scanning, and secrets management.Enforce identity and access management (IAM), network segmentation, and data encryption in compliance with standards (HIPAA, SOX, GDPR, FedRAMP).Collaborate with enterprise security and compliance teams to implement governance frameworks for GenAI platforms.Monitoring, Observability & Cost OptimizationImplement observability stacks (Prometheus, Grafana, Cloud Monitoring, Datadog) for both infra health and ML-specific metrics (model drift, data anomalies).Define KPIs to monitor system health, performance, and adoption across AI workloads.Optimize cloud cost efficiency for GPU/TPU-intensive workloads using autoscaling, preemptible instances, and utilization monitoring.Collaboration & EnablementPartner with data scientists, ML engineers, and software teams to streamline GenAI application development and deployment.Provide onboarding, documentation, and reusable templates to enable faster adoption of AI infrastructure.Stay current with the latest advancements in GenAI, cloud-native infrastructure, and container orchestration.Required QualificationsEducationBachelor's or master's degree in computer science, Software Engineering, or a related field.Experience5+ years of experience in cloud infrastructure engineering, DevOps, or platform engineering.Experience with GenAI use cases (chatbots, content generation, code assistants, etc.).Strong hands-on expertise with Google Cloud Platform (GCP), especially Vertex AI.Experience with IBM Watsonx for AI application deployment and management.Proven skills in Docker, Kubernetes (GKE), and container orchestration at scale.Proficiency in Python, Bash, or other relevant scripting languages.Strong understanding of cloud networking, IAM, and security best practices.Experience with CI/CD tools (GitHub Actions, GitLab CI, Jenkins) and IaC tools (Terraform, Pulumi, Ansible, Deployment Manager).Familiarity with data pipelines and integration tools (Dataflow, Apache Beam, Pub/Sub, Kafka).Excellent problem-solving, debugging, and communication skills.Preferred ExperienceExperience in MLOps practices for model deployment, monitoring, and retraining.Exposure to multi-cloud or hybrid cloud environments (GCP, AWS, Azure, on-prem).Hands-on experience with feature stores (Vertex AI Feature Store, Feast) and ML observability tools (EvidentlyAI, Fiddler).Knowledge of distributed training frameworks (Horovod, DeepSpeed, PyTorch Distributed).Contributions to open-source projects in infrastructure, MLOps, or GenAI.Experience managing infrastructure in regulated industries.Preferred Certifications:Google Cloud Certified - Professional Cloud ArchitectGoogle Cloud Certified - Machine Learning EngineerCertified Kubernetes Administrator (CKA) or Certified Kubernetes Application Developer (CKAD)IBM Certified Watsonx Generative AI Engineer – AssociateIBM Certified Solution Architect - Cloud Pak for DataOther relevant certifications in AI, Machine Learning, or Cloud-Native technologies.Employee Type: PermanentUPS is committed to providing a workplace free of discrimination, harassment, and retaliation.
-
india Cortex Consultants Full timeJob Title: Site Reliability Engineer (SRE) – GCP Infrastructure Experience: 5+ years Location: Bangalore (Work from Office) Shift Timing: 2:00 PM - 10:00 PM About the Role: We're on the lookout for an experienced and motivated Site Reliability Engineer (SRE) to join our team in Bangalore. As an SRE, you'll be responsible for ensuring the reliability and...
-
India Cortex Consultants Full time ₹ 15,00,000 - ₹ 25,00,000 per yearJob Title: Site Reliability Engineer (SRE) – GCP Infrastructure Experience: 5+ years Location: Bangalore (Work from Office) Shift Timing: 2:00 PM - 10:00 PM About the Role: We're on the lookout for an experienced and motivated Site Reliability Engineer (SRE) to join our team in Bangalore. As an SRE, you'll be responsible for ensuring the reliability and...
-
GCP Data Platform Engineer
16 hours ago
Bengaluru, India Innova ESI Full timeJob Description Role: GCP Data Platform Engineer Experience: 6+ Years Location: Bangalore | Hyderabad | Gurugram | Noida | Pune Notice: Immediate Joiners Only Job Description for Data Platform Engineer role in cloud services- - Hands on project experience on GCP core products and services, GCP Networking, VPCs, VPCSC , Google Artefact Registry etc. -...
-
GCP Network Engineer
4 weeks ago
India Aviato Consulting Full timeJob Description Aviato Consulting, are searching for an experienced Google Cloud Network Engineer, you'll be instrumental in designing and securing cutting edge Google Cloud network infrastructure for international businesses, directly impacting clients across APAC. We are ex-Googlers and believe in true partnership understanding client challenges deeply to...
-
GCP Platform Engineer
1 day ago
India PamTen Inc Full timeRemote within India Full Time role The IT Cloud Platform Services teams is a dynamic mix of tech enthusiasts, problem solvers, and creative thinkers, united by our passion for leveraging cutting-edge technology to transform healthcare and empower our customers to take real-time control of their health. Where you come in: As a Cloud Platform Engineer 2, you...
-
GCP Platform Engineer
22 hours ago
India PamTen Inc Full timeRemote within India Full Time roleThe IT Cloud Platform Services teams is a dynamic mix of tech enthusiasts, problem solvers, and creative thinkers, united by our passion for leveraging cutting-edge technology to transform healthcare and empower our customers to take real-time control of their health.Where you come in:As a Cloud Platform Engineer 2, you will...
-
GCP Platform Engineer
14 hours ago
India PamTen Inc Full timeRemote within India Full Time role The IT Cloud Platform Services teams is a dynamic mix of tech enthusiasts, problem solvers, and creative thinkers, united by our passion for leveraging cutting-edge technology to transform healthcare and empower our customers to take real-time control of their health. Where you come in: As a Cloud Platform Engineer 2, you...
-
Infrastructure Engineer
2 weeks ago
India Orbit Core Tech Full timeOrbit Core Tech is a technology and innovation company specializing in AI-driven products and enterprise IT services. Our mission is to simplify complex challenges in healthcare, education, HR, and communication through secure, scalable, and intelligent solutions. Our Product Suite: Orbit Care | Orbit Learn | Orbit Hire | Orbit Connect Our Services: Software...
-
Data Engineer
14 hours ago
India HISH IT SERVICES Full timeWe have a new urgent GCP Data Engineer opportunity open to support a migration initiative from Teradata to Cerebro (BigQuery). This role requires a hands-on developer who can collaborate closely with our data and reporting teams to ensure smooth repointing and validation of Power BI reports. Request Details: Title: GCP Data Engineer Seniority: Mid to Senior...
-
GCP Data Engineer
1 week ago
India Xsell Resources Full time#offshorejobs #india #remotework #GCPDataengineer Seeking a GCP Certified Data Engineer to work remotely from India for our Fortune 5 healthcare client in the US. Remote work from India 2nd shift work hours Must be immediate joiner. No notice periods more than 15 days. Requirements : 5 years of proven hands-on experience with GCP data services. GCP Google...