Cloud Load Balancer, the secret to uptime for AI inference
See the detailed reference architecture → https://goo.gle/4bLQdap
Learn how to pair new cloud load balancing capabilities like custom metrics and service extensions with GKE Autopilot, which includes features like node auto-repair to automatically replace unhealthy nodes, and horizontal pod autoscaling to adjust resources based on application demand.
Subscribe to Google Cloud Tech → https://goo.gle/GoogleCloudTech
Speaker: Don McCasland
Products Mentioned: AI Infrastructure
Google Cloud Tech
Helping you build what's next with secure infrastructure, developer tools, APIs, data analytics and machine learning....