Skip to content

Releases: NVIDIA-NeMo/NeMo

25.09-alpha.rc2

07 Oct 19:58
f1ba52e
Compare
Choose a tag to compare
Update lora.py

Signed-off-by: Michał Marcinkiewicz <[email protected]>

NVIDIA Neural Modules 2.5.0

07 Oct 00:36
ddcb2d6
Compare
Choose a tag to compare

Highlights

  • Collections:

    • LLM
      • Nano v2 12B and 9B
    • Speech
      • New SpeechLM2 collection
      • Streaming Softformer model
      • Deprecate Confidence Ensemble models
      • parakeet-tdt-0.6b-v3 and canary-1b-v2 models
      • Added chunk inference support with .transcribe() for canary based models
      • Enable prediction of timestamps with streaming ASR
      • Improve ASR models’ invariance to padding/batch size
      • Qwen prompt format support, SALM generation fixes
      • High-level SALM model.generate API closely resembling HF models
      • SALM model initialization with time/memory optimization
      • SpeechLM2: fixed excessive padding, support on-the-fly resampling for SALM
  • Automodel and Export-Deploy functionality are available in their individual repositories respectively and deprecated in NeMo2

Detailed Changelogs:

ASR

Changelog

TTS

Changelog

NLP / NMT

Changelog

Text Normalization / Inverse Text Normalization

Changelog

Export

Changelog

Bugfixes

Changelog
  • Bugfix for Hyena to the get_t function which comes up when doing longer context inference by @jstjohn :: PR: #14256
  • fix skipped cuHyena kernel while training by @farhadrgh :: PR: #14365
  • Remove flaky Evo2 dataset performance test by @jstjohn :: PR: #14371
  • Use module prefix in restore_modelopt_state by @jenchen13 :: PR: #14384

Uncategorized:

Changelog
Read more

NVIDIA Neural Modules 2.4.1

30 Sep 17:10
2919fed
Compare
Choose a tag to compare

Detailed Changelogs:

Uncategorized:

Changelog

NVIDIA Neural Modules 2.3.3

25 Sep 22:10
26be5f5
Compare
Choose a tag to compare

25.09-alpha.rc1

03 Oct 14:36
Compare
Choose a tag to compare
[Flux] Add cuda_graph_scope and cache images ids for full iteration c…

NVIDIA Neural Modules 2.5.0rc0

03 Aug 16:50
620d2ba
Compare
Choose a tag to compare
Pre-release

Prerelease: NVIDIA Neural Modules 2.5.0rc0 (2025-08-03)

NVIDIA Neural Modules 2.4.0

25 Jul 18:12
2381f42
Compare
Choose a tag to compare

Highlights

  • Collections:
    • Speech
      • Batched beam search for transducers (RNN-T and TDT)
        • RNNT/TDT buffered/streaming inference + batched decoding support in cache-aware
        • add support for CTC batched beam search with GPU-LM
        • Key fixes
          • Punctuation Marks in Timestamps
          • Fix timestamps when cuda graphs enabled
          • Fix masking of <pad> tokens in AED inference
          • TDT streaming inference fix
    • LLM
      • Qwen 3 235B-A22B Perf Optimized
      • DeepSeek V3 Perf Optimized
      • Gemma3 support from Google
      • Embedding and Reranker models
    • MM
      • Llama 4
      • AVLM
  • Training performance (speed)
    • NVL sharp + IB sharp for DP/FSDP-communications on H100 and B200
    • MXFP8 with TP communication overlap
    • MXFP8 with reduced memory allocation
    • FP8 sub-channel recipe (128x128 for weight and 1x128 for activation)
    • cudnn fused attention for MLA (both Hopper and Blackwell)
    • Advanced custom asymmetric pipelining (for MTP, loss func, and embd)
    • BF16 optimizer for model memory saving
    • CUDA graph fix for fine-tuning benchmarks
    • CUDA graph support for LLAMA4

Detailed Changelogs

ASR

Changelog

TTS

Changelog

NLP / NMT

Changelog

Export

Changelog

Bugfixes

Changelog

Uncategorized

Changelog
Read more

NVIDIA Neural Modules 2.3.2

08 Jul 22:29
f98ef1d
Compare
Choose a tag to compare

This release addresses known security issues. For the latest NVIDIA Vulnerability Disclosure Information visit https://www.nvidia.com/en-us/security/, for acknowledgement please reach out to the NVIDIA PSIRT team at [email protected]

NVIDIA Neural Modules 2.4.0rc2

09 Jul 00:37
7ac5a8e
Compare
Choose a tag to compare
Pre-release

Prerelease: NVIDIA Neural Modules 2.4.0rc2 (2025-07-09)

NVIDIA Neural Modules 2.4.0rc1

02 Jul 20:49
4c6fb0c
Compare
Choose a tag to compare
Pre-release

Prerelease: NVIDIA Neural Modules 2.4.0rc1 (2025-07-02)