Skip to content

v1.1.0

Latest

Choose a tag to compare

@Adel-Moumen Adel-Moumen released this 30 Mar 14:41
· 2 commits to develop since this release

This major release extends SpeechBrain's support for SpeechLLMs and introduces several new features, recipes, and improvements.

Highlights

  • Feature Caching — Save extracted features (e.g. wav2vec embeddings) to disk and load them on the fly, skipping recomputation. This powers our first ASR SpeechLLM recipe on LibriSpeech, enabling LLM-based training with pre-computed embeddings.
  • New Recipes — SpeechLLM for ASR and translation, streaming SSL, FocalCodec, and SENSE models.

Along with internal improvements and bug fixes. Here follows a changelog of the main changes (omitting some minor bugfixes):

What's Changed

New Contributors

Full Changelog: v1.0.3...v1.1.0

What's Changed

New Contributors

Full Changelog: v1.0.3...v1.1.0