v1.0.3
Major Changes
- Add People's Speech (30,000 hours) Conformer ASR (Code from Samsung AI Center Cambridge) by @TParcollet in #2767
- Audio and Music SSL by @poonehmousavi in #2755
- Add new Audio Tokenziers by @poonehmousavi in #2751
- Libriheavy (Code from SAIC-Cambridge) by @shucongzhang in #2781
- Conformer recipe for LargeScaleASR (code from Samsung AI Center Cambridge) by @TParcollet in #2806
- Rotary Position Embedding (RoPE) for ASR (code from Samsung Cambridge) by @shucongzhang in #2799
- Voice analysis functions by @pplantinga in #2689
New Contributors
- @rogiervd made their first contribution in #2734
- @benniekiss made their first contribution in #2746
- @mirofedurco made their first contribution in #2762
- @kit1980 made their first contribution in #2797
- @IliasMAOUDJ made their first contribution in #2574
Full Changelog: v1.0.2...v1.0.3