Whisper finetuning by Adel-Moumen · Pull Request #1717 · speechbrain/speechbrain

Adel-Moumen · 2022-11-19T19:41:15Z

No description provided.

…peechbrain into whisper-finetuning

TParcollet · 2022-11-26T19:37:48Z

            enc_states, memory
        )  # TODO: switch args
        log_probs = self.softmax(dec_out[:, -1])
-        return log_probs, memory, None


Are we adding an argument here?

TParcollet · 2022-11-26T19:38:12Z

-        dec_out = self.model.forward_decoder(enc_states, memory)
+        dec_out, attn, = self.module.forward_decoder(enc_states, memory)
        log_probs = self.softmax(dec_out[:, -1])
-        return log_probs, memory, None


We should make sure that this won't break any model

I just followed what we did for the S2STransformerBeamSearch. The beam search algorithm expects to get the attn weights if provided. In our case, we have attn weights, so it makes sense to return them. One note: I think the return of attn from the decoder should be mandatory (and not optional).

I did some tests with and without attn in the beam search and did not see any changes. (might be related to how I initialized the beam search. I will investigate later tomorrow).

Avoid the inefficient conversion of input tensors to CPU NumPy arrays

TParcollet

Once the requested changes will be done and results added, the PR will be ready to be merger.

…peechbrain into whisper-finetuning

…eechbrain into whisper-finetuning

TParcollet

LGTM!

Adel-Moumen and others added 10 commits November 19, 2022 20:22

upload whisper model

6e4945b

Update huggingface_whisper.py

73b4a33

remove unused parameter

d985067

pre-commit

7f21dd9

output norm added

a123e8f

doc string

e2af7d7

torch.rand -> torch.randn

901e581

Merge branch 'speechbrain:develop' into whisper-finetuning

03d804c

cleaning and refactoring

478e84c

cleaning and refactoring

48cd682

bofenghuang reviewed Nov 23, 2022

View reviewed changes

Comment thread speechbrain/lobes/models/huggingface_whisper.py Outdated

TParcollet and others added 15 commits November 23, 2022 17:14

adding fine-tuning for whisper on libri

ec84478

remove whisper from doctesting due to external dependency

2da4c18

remove unecessary check for freezing

7e4826e

cleaning

d82f69d

cleaning

6099503

cleaning

603099f

Cast tensor to wav.dtype because np automatically cast to fp32 tensors.

cb15d18

logger warning + remove decoder if encoder_only

cabe836

pre-commit

cfc9a3b

finish fine-tuning with encoder only

55f58e0

Merge branch 'whisper-finetuning' of https://github.com/Adel-Moumen/s…

80005b8

…peechbrain into whisper-finetuning

adapt to prepare the decoding

8cc1f7e

greedy search + beam search compatible with whisper

d31429a

remove example

7f1ea0c

add attn

e772ebe

TParcollet reviewed Nov 26, 2022

View reviewed changes

TParcollet and others added 2 commits November 27, 2022 21:57

quick changes

3f349bf

Extract Mel spectrograms directly in PyTorch

84403b4

Avoid the inefficient conversion of input tensors to CPU NumPy arrays

TParcollet requested changes Dec 2, 2022

View reviewed changes

Adel-Moumen and others added 25 commits December 2, 2022 19:04

do_normalize -> normalized_transcripts

b39d551

Merge branch 'whisper-finetuning' of https://github.com/Adel-Moumen/s…

5c86904

…peechbrain into whisper-finetuning

Remove rirs

f84e337

add more doc on tokens

4817717

header

707c2a4

add doc on mask

a42d04d

remove fit_batch

453f3b0

remove init_optimizers

bb42317

tokenizer as part of the lobe

b931ce0

Whisper -> whisper

c39837c

optimizer set in brain class

fad5af1

hparams["train_csv"] -> train.csv in prepare fct

dbe278c

fix wer

2d4391a

fix optimizer

1636cfd

Merge branch 'whisper-finetuning' of http://github.com/Adel-Moumen/sp…

176aa33

…eechbrain into whisper-finetuning

update performance

3ee81b6

update results in README.md

36cbe16

Update README.md

f636d51

Update README.md

31ea417

train_with_whisper -> train_hf_whisper

c855289

fix pre-commit

c1f276f

switched hardcoded values in args

58870e9

update recipes.csv

9da8fac

fix unused hparams args

9d05b4e

fix test

60f2770

Adel-Moumen marked this pull request as ready for review December 7, 2022 08:13

Adel-Moumen changed the title ~~[WIP] Whisper finetuning~~ Whisper finetuning Dec 7, 2022

TParcollet approved these changes Dec 7, 2022

View reviewed changes

TParcollet merged commit a7901a3 into speechbrain:develop Dec 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whisper finetuning#1717

Whisper finetuning#1717
TParcollet merged 109 commits intospeechbrain:developfrom
Adel-Moumen:whisper-finetuning

Adel-Moumen commented Nov 19, 2022

Uh oh!

Uh oh!

TParcollet Nov 26, 2022

Uh oh!

TParcollet Nov 26, 2022

Uh oh!

Adel-Moumen Nov 26, 2022

Uh oh!

TParcollet left a comment

Uh oh!

TParcollet left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Conversation

Adel-Moumen commented Nov 19, 2022

Uh oh!

Uh oh!

TParcollet Nov 26, 2022

Choose a reason for hiding this comment

Uh oh!

TParcollet Nov 26, 2022

Choose a reason for hiding this comment

Uh oh!

Adel-Moumen Nov 26, 2022

Choose a reason for hiding this comment

Uh oh!

TParcollet left a comment

Choose a reason for hiding this comment

Uh oh!

TParcollet left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants