You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We might want to think how this could be used with KPA. In the past we had a design document for VPA but the integration never landed. We probably want to revise that.
In place updates only really work when concurrency is very high (essentially you don't care to optimize the horizontal scaling case)
I am thinking of scenarios where you may want to grow the pod instead of the number of instances to avoid for example cold starts e.g. AI cases. You may also not to do so from the beginning so you don't request a lot of resources depending on the traffic pattern e.g. bursty.
/area autoscale
/area API
Describe the feature
cc @dprotaso @evankanderson @dsimansk
The text was updated successfully, but these errors were encountered: