The things that you never have to consider might sometimes have the biggest effects when it comes to cloud infrastructure
The Cluster Autoscaler the part that regulates your node pools’ size automatically based on demand has been the subject of intense development by the GKE team
The community as a whole will gain from enhanced Kubernetes performance when this capability becomes open-source
Cluster Autoscaler is more intelligent about when to execute its control loop, preventing needless delays
Google Cloud assessed the time it took for each pod to become ready after deploying a workload with 5,000 replicas on Autopilot
The amount of time it took for each pod to be ready was determined using a 10-replica GPU deployment
Google Cloud used an HPA CPU goal of 50% and scaled the application on CPU to assess end-user delay for P50 and P95