Skip to content

NVIDIA Neural Modules 2.5.0

Latest

Choose a tag to compare

@chtruong814 chtruong814 released this 07 Oct 00:36
ddcb2d6

Highlights

  • Collections:

    • LLM
      • Nano v2 12B and 9B
    • Speech
      • New SpeechLM2 collection
      • Streaming Softformer model
      • Deprecate Confidence Ensemble models
      • parakeet-tdt-0.6b-v3 and canary-1b-v2 models
      • Added chunk inference support with .transcribe() for canary based models
      • Enable prediction of timestamps with streaming ASR
      • Improve ASR models’ invariance to padding/batch size
      • Qwen prompt format support, SALM generation fixes
      • High-level SALM model.generate API closely resembling HF models
      • SALM model initialization with time/memory optimization
      • SpeechLM2: fixed excessive padding, support on-the-fly resampling for SALM
  • Automodel and Export-Deploy functionality are available in their individual repositories respectively and deprecated in NeMo2

Detailed Changelogs:

ASR

Changelog

TTS

Changelog

NLP / NMT

Changelog

Text Normalization / Inverse Text Normalization

Changelog

Export

Changelog

Bugfixes

Changelog
  • Bugfix for Hyena to the get_t function which comes up when doing longer context inference by @jstjohn :: PR: #14256
  • fix skipped cuHyena kernel while training by @farhadrgh :: PR: #14365
  • Remove flaky Evo2 dataset performance test by @jstjohn :: PR: #14371
  • Use module prefix in restore_modelopt_state by @jenchen13 :: PR: #14384

Uncategorized:

Changelog