In June 2026, NVIDIA merged two pull requests into its open-source Kubernetes AI scheduler that finally shipped container-level GPU memory hard isolation, ending a decade where multiple tenants sharing one accelerator could silently oversubscribe each other into OOM crashes. The KAI Scheduler now relies on HAMi-core, a CUDA interception library, …