Skip to content

Fix Whisper doc + improve max_decode_ratio#1858

Merged
anautsch merged 9 commits intospeechbrain:developfrom
Adel-Moumen:whisper-doc-max_decode_ratio
Feb 22, 2023
Merged

Fix Whisper doc + improve max_decode_ratio#1858
anautsch merged 9 commits intospeechbrain:developfrom
Adel-Moumen:whisper-doc-max_decode_ratio

Conversation

@Adel-Moumen
Copy link
Copy Markdown
Collaborator

@Adel-Moumen Adel-Moumen commented Feb 22, 2023

  • FIx docstring Whisper Model
  • Fix docstring Whisper seq2seq beam search/greedy search
  • Fix arg max_decode_ratio that is not taking into account the maximum tokens length of the Whisper decoder. Now, we can dynamically change the upper bound by changing the change_max_decoding_length function.
  • Add testing only on LibriSpeech

I checked, and everything works as expected. I was able in zero shot to get 2.44 CER, 6.4 WER on Test Clean and test CER: 6.74, test WER: 14.31 with max_decode_ratio=1.0.

I used the beautiful test recipe made by Andreas on CommonVoice/LibriSpeech and all checks passed.

(we need to fix this for the 0.5.14/0.6)

@Adel-Moumen Adel-Moumen marked this pull request as ready for review February 22, 2023 13:56
@Adel-Moumen Adel-Moumen requested a review from anautsch February 22, 2023 13:56
Copy link
Copy Markdown
Collaborator

@anautsch anautsch left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@anautsch anautsch merged commit 71c1490 into speechbrain:develop Feb 22, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants