Auto-scaling LLM applications and workloads

Date: