You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: conf.py
+1-1Lines changed: 1 addition & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -157,7 +157,7 @@
157
157
158
158
#top_banner_message="<span>⚠</span><a class='reference internal' style='color:white;' href='https://awsdocs-neuron.readthedocs-hosted.com/en/latest/general/setup/setup-troubleshooting.html#gpg-key-update'> Neuron repository GPG key for Ubuntu installation has expired, see instructions how to update! </a>"
159
159
160
-
top_banner_message="Neuron 2.20.1 is released! check <a class='reference internal' style='color:white;' href='https://awsdocs-neuron.readthedocs-hosted.com/en/latest/release-notes/index.html#latest-neuron-release'> What's New </a> and <a class='reference internal' style='color:white;' href='https://awsdocs-neuron.readthedocs-hosted.com/en/latest/general/announcements/index.html'> Announcements </a>"
160
+
top_banner_message="Neuron 2.20.2 is released! check <a class='reference internal' style='color:white;' href='https://awsdocs-neuron.readthedocs-hosted.com/en/latest/release-notes/index.html#latest-neuron-release'> What's New </a> and <a class='reference internal' style='color:white;' href='https://awsdocs-neuron.readthedocs-hosted.com/en/latest/general/announcements/index.html'> Announcements </a>"
Copy file name to clipboardExpand all lines: release-notes/index.rst
+22-5Lines changed: 22 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,6 +11,23 @@ What's New
11
11
.. _neuron-2.20.0-whatsnew:
12
12
13
13
14
+
Neuron 2.20.2 (11/20/2024)
15
+
---------------------------
16
+
17
+
Neuron 2.20.2 release fixes a stability issue in Neuron Scheduler Extension that previously caused crashes in Kubernetes (K8) deployments. See :ref:`neuron-k8-rn`.
18
+
19
+
This release also addresses a security patch update to Neuron Driver that fixes a kernel address leak issue.
20
+
See more on :ref:`neuron-driver-release-notes` and :ref:`neuron-runtime-rn`.
21
+
22
+
Addtionally, Neuron 2.20.2 release updates ``torch-neuronx`` and ``libneuronxla`` packages to add support for ``torch-xla`` 2.1.5 package
23
+
which fixes checkpoint loading issues with Zero Redundancy Optimizer (ZeRO-1). See :ref:`torch-neuronx-rn` and :ref:`libneuronxla-rn`.
24
+
25
+
Neuron supported DLAMIs and DLCs are updated with this release (Neuron 2.20.2 SDK). The Training DLC is also updated to address the
26
+
version dependency issues in NxD Training library. See :ref:`neuron-dlc-release-notes`.
27
+
28
+
NxD Training library in Neuron 2.20.2 release is updated to transformers 4.36.0 package. See :ref:`neuronx-distributed-training-rn`.
Copy file name to clipboardExpand all lines: release-notes/neuronx-distributed-training/neuronx-distributed-training.rst
+14-3Lines changed: 14 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,14 +10,25 @@ NxD Training Release Notes (``neuronx-distributed-training``)
10
10
11
11
This document lists the release notes for Neuronx Distributed Training library.
12
12
13
-
.. _neuronx-distributed-rn-1-0-0:
13
+
.. _neuronx-distributed-training-rn-1-0-1:
14
+
15
+
Neuronx Distributed Training [1.0.1]
16
+
17
+
Date: 11/20/2024
18
+
19
+
Features in this release
20
+
------------------------
21
+
22
+
* Added support for transformers 4.36.0
23
+
24
+
.. _neuronx-distributed-training-rn-1-0-0:
14
25
15
26
Neuronx Distributed Training [1.0.0]
16
27
17
28
Date: 09/16/2024
18
29
19
-
Features this release
20
-
---------------------
30
+
Features in this release
31
+
------------------------
21
32
22
33
This is the first release of NxD Training (NxDT), NxDT is a PyTorch-based library that adds support for user-friendly distributed training experience through a YAML configuration file compatible with NeMo,, allowing users to easily set up their training workflows. At the same time, NxDT maintains flexibility, enabling users to choose between using the YAML configuration file, PyTorch Lightning Trainer, or writing their own custom training script using the NxD Core.
23
34
The library supports PyTorch model classes including Hugging Face and Megatron-LM. Additionally, it leverages NeMo's data engineering and data science modules enabling end-to-end training workflows on NxDT, and providing a compatability with NeMo through minimal changes to the YAML configuration file for models that are already supported in NxDT. Furthermore, the functionality of the Neuron NeMo Megatron (NNM) library is now part of NxDT, ensuring a smooth migration path from NNM to NxDT.
Copy file name to clipboardExpand all lines: release-notes/runtime/aws-neuronx-dkms/index.rst
+9Lines changed: 9 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,6 +15,15 @@ Updated : 04/29/2022
15
15
16
16
- In rare cases of multi-process applications running under heavy stress a model load failure my occur. This may require reloading of the Neuron Driver as a workaround.
17
17
18
+
19
+
Neuron Driver release [2.18.20.0]
20
+
--------------------------------
21
+
Date: 11/20/2024
22
+
23
+
Bug Fixes
24
+
^^^^^^^^^
25
+
* This release addresses an issue with Neuron Driver that can lead to a user-space application either gaining access to kernel addresses or providing the driver with spoofed memory handles (kernel addresses) that can be potentially used to gain elevated privileges. We would like to thank `Cossack9989 <https://github.com/Cossack9989>`_ for reporting and collaborating on this issue.
0 commit comments