Skip to content

Neuron SDK Release - June 24, 2025

Choose a tag to compare

@ivashkst ivashkst released this 25 Jun 02:54
· 52 commits to master since this release
90a7fe4

Neuron version 2.24 introduces new inference capabilities including prefix caching, disaggregated inference (Beta), and context parallelization support (Beta). This release also includes NKI language enhancements and enhanced profiling visualizations for improved debugging and performance analysis. Neuron 2.24 adds support for PyTorch 2.7 and JAX 0.6, updates existing DLAMIs and DLCs, and introduces a new vLLM inference container.