Skip to content

Releases: NVIDIA/KAI-Scheduler

v0.4.8

16 May 09:29

Choose a tag to compare

Fixed

  • Queue order function now takes into account potential victims, resulting in better reclaim scenarios.

CHANGED

  • Cached GetDeservedShare and GetFairShare function in the scheduler PodGroupInfo to improve performance.
  • Added cache to the binder resource reservation client.
  • More Caching and improvements to PodGroupInfo class.
  • Update pod labels after scheduling decision concurrently in the background.

v0.5.0

08 May 08:02
4607d46

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.4.7...v0.5.0

v0.4.7

29 Apr 17:16
73e280a

Choose a tag to compare

What's Changed

  • fix: snapshot tool cache.Run call by @enoodle in #102
  • Docs hotfix: Update and rename pytorch-elasitc.yaml to pytorch-elastic.yaml by @EkinKarabulut in #105
  • Adding Issue Templates for bug & feature/enhancement requests by @EkinKarabulut in #103
  • fix: gpu resource device count calculation by @enoodle in #107

Full Changelog: v0.4.6...v0.4.7

v0.4.6

24 Apr 09:03
ead9f27

Choose a tag to compare

What's Changed

Full Changelog: v0.4.5...v0.4.6

v0.4.5

23 Apr 21:37
3db910e

Choose a tag to compare

What's Changed

Full Changelog: v0.4.4...v0.4.5

v0.4.4

23 Apr 14:04
b4cf379

Choose a tag to compare

What's Changed

Full Changelog: v0.4.3...v0.4.4

v0.4.3

21 Apr 12:41
9bf2add

Choose a tag to compare

What's Changed

Full Changelog: v0.4.2...v0.4.3

v0.4.2

21 Apr 11:21
4623c77

Choose a tag to compare

What's Changed

  • Add delayedLauncherCreationPolicy handeling to the pod-grouper by @davidLif in #75
  • Bump golang.org/x/net from 0.37.0 to 0.38.0 by @dependabot in #76
  • docs: specify legacy v1 version of kubeflow-training-operator by @tgasla in #71
  • Add a simple DRA example by @guptaNswati in #59
  • BUG fix - handle pod grouper top owner climbing RBAC error by @davidLif in #77
  • Changed nvcr to ghcr in all workflows, scripts and docs by @romanbaron in #78

New Contributors

Full Changelog: v0.4.1...v0.4.2

v0.4.1

16 Apr 08:30
67310e3

Choose a tag to compare

What's Changed

  • Adding support channels to README.md by @EkinKarabulut in #64
  • storing CI artifacts on ghcr.io by @romanbaron in #58
  • Main- Bug fix - when syncing the inFlight PG status, override only if the t… by @davidLif in #70
  • fix(status-updater): memory leak fix - remove unused inFlight PGs by @enoodle in #73

New Contributors

Full Changelog: v0.4.0...v0.4.1

v0.4.0

09 Apr 10:08
b4338a1

Choose a tag to compare

What's Changed

Full Changelog: v0.3.0...v0.4.0