Generic adapters implementation by pplantinga · Pull Request #2563 · speechbrain/speechbrain

pplantinga · 2024-06-05T01:36:30Z

Here is a proposal for how we can add adapters (including LoRA) to the toolkit. This branch is based on #2534 - and it also implements flexible layer selection and small checkpoints.

There's a few more things that would be nice to have but I personally don't think they're necessary before merge.

a merge_and_unload() type function for LoRA-type layers that reintegrates the adapter weights to the original model
the capability to use adapters from peft library -- they have an extensive collection that will probably update regularly
more adapter types

If anyone thinks these are urgent we can work on adding them to this PR.

UPDATE (see below): This works with PEFT now.

…eleased into develop

…into develop

TParcollet · 2024-06-06T13:20:00Z

@pplantinga are the checkpointing features working as well with this easy peft adaptation? We should make sure it works with Pretrainer also, not just checkpointing I blieve.

TParcollet · 2024-07-08T13:29:19Z

@Adel-Moumen @mravanelli I think we will want this in v1.0.1 And it looks ready to me?

TParcollet · 2024-07-08T13:31:41Z

@poonehmousavi could you review and test the code as mentioned? It looks ready to me. Thanks!

poonehmousavi · 2024-07-08T13:33:31Z

@poonehmousavi could you review and test the code as mentioned? It looks ready to me. Thanks!

Sure. I will do it by tomorrow.

poonehmousavi · 2024-07-10T00:05:02Z

@pplantinga have you tested it with pretrainer using for interfaces? also have you checked how it works with quantization?(like QLORA)

pplantinga · 2024-07-13T17:42:01Z

@pplantinga have you tested it with pretrainer using for interfaces?

I tested this and it worked, but had warnings due to loading only trained params. I have fixed this now.

The yaml I used is here:

whisper_hub: openai/whisper-small.en
lora_rank: 16
language: "english"
sample_rate: 16000

min_decode_ratio: 0.0
max_decode_ratio: 1.0
test_beam_size: 8

whisper_pretrained: !new:speechbrain.lobes.models.huggingface_transformers.whisper.Whisper
    source: !ref <whisper_hub>
    save_path: .
    language: !ref <language>
    task: "transcribe"
    sampling_rate: !ref <sample_rate>

whisper: !new:speechbrain.nnet.adapters.AdaptedModel
    model_to_adapt: !ref <whisper_pretrained>
    adapter_class: !name:speechbrain.nnet.adapters.LoRA
    all_linear: True
    adapter_kwargs:
        rank: !ref <lora_rank>

test_search: !new:speechbrain.decoders.seq2seq.S2SWhisperBeamSearcher
    module: [!ref <whisper>]
    min_decode_ratio: !ref <min_decode_ratio>
    max_decode_ratio: !ref <max_decode_ratio>
    beam_size: !ref <test_beam_size>

modules:
    whisper: !ref <whisper>
    decoder: !ref <test_search>

pretrainer: !new:speechbrain.utils.parameter_transfer.Pretrainer
    loadables:
        whisper: !ref <whisper>

And python:

model = sb.inference.ASR.WhisperASR.from_hparams(".", "lora_pre.yaml", savedir="results/whisper/1987/save/CKPT+2024-06-05+18-30-33+00")
model.transcribe_file("speechbrain/asr-streaming-conformer-librispeech/test-en.wav")

also have you checked how it works with quantization?(like QLORA)

I am not very familiar with QLoRA, it seems there's additional setup needed to get this to work.

pplantinga · 2024-07-20T18:49:06Z

One epoch (100h) results for Whisper Small.en, published results are test-clean=3.05 and test-other=7.53:

speechbrain.utils.train_logger - Epoch loaded: 1 - test loss: 9.73e-01, test CER: 1.03, test WER: 2.81
speechbrain.utils.train_logger - Epoch loaded: 1 - test loss: 9.86e-01, test CER: 1.08, test WER: 2.90
speechbrain.utils.train_logger - Epoch loaded: 1 - test loss: 1.22, test CER: 3.00, test WER: 6.57

TParcollet · 2024-09-04T07:44:05Z

@pplantinga Should we merge this maybe? Maybe with a small tutorial somewhere as well?

pplantinga · 2024-09-04T12:27:42Z

There are more features that could be added but I think this is ready for merge as-is and the rest can be added later.

TParcollet

LGTM now :)

Titouan Parcollet/Embedded AI /SRUK/Engineer/Samsung Electronics and others added 30 commits February 8, 2024 12:26

shorter augmentations in yaml

47e3097

layout to 80 char

5ab888a

listed label replication

a3bf472

listed label replication

c86d687

listed label replication

761bf93

Refact CTC

09cfde3

Refact transducer

e60396f

Refact seq2seq

d6a5524

call replicate label instead of duplication

9daba50

refactor aishell

6bf2361

refactor aishell

7ec92c5

CommonLanuageÃ

ebae569

fix error + CV CTC

088a0eb

Giga OOF

bfb9bc2

Giga OOF

21353d5

Giga OOF

9971121

Giga OOF

f879302

Giga OOF

95c5ea4

Giga OOF

1b24844

Giga OOF

a5a97aa

Giga OOF

55904dd

Giga OOF

7f366bb

Finishing OOF

963bda4

final touch LULZ

922024a

fix tests

819f8c8

Tests???Ã

8ade568

fix augment in some recipes

9e73c10

merge

b2b8f56

Merge branch 'develop' of https://github.com/TParcollet/speechbrain-r…

f0e9f6d

…eleased into develop

Merge branch 'develop' of https://github.com/speechbrain/speechbrain …

afd37a1

…into develop

pplantinga added 4 commits June 5, 2024 18:49

Add Whisper+LoRA to recipe tests

37a8359

Fix doctest

63e216a

Remove unnecessary added train file

850834d

Update header on yaml

4dfa0fc

Improve interface for AdaptedModel

80d6ec0

mravanelli added the enhancement New feature or request label Jun 17, 2024

pplantinga and others added 2 commits July 12, 2024 16:28

Merge branch 'develop' into adapters

037cbcb

Add parameter transfer method to avoid warnings

2735cda

pplantinga and others added 3 commits July 16, 2024 11:11

Merge branch 'develop' into adapters

a9b411b

Merge branch 'develop' into adapters

9864011

Merge branch 'develop' into adapters

0f8401f

pplantinga added 2 commits July 20, 2024 14:56

Add readme entry for Whisper Small.en + LoRA results

c68dc6a

Merge branch 'develop' into adapters

8698094

pplantinga added this to the v1.1.0 milestone Sep 10, 2024

Merge branch 'develop' into adapters

a37a4f9

TParcollet approved these changes Sep 10, 2024

View reviewed changes

TParcollet merged commit 632a9df into speechbrain:develop Sep 10, 2024

pplantinga deleted the adapters branch September 10, 2024 18:53

pplantinga mentioned this pull request Sep 10, 2024

Adding adapters to SpeechBrain (Code from Samsung AI Center Cambridge) #2534

Closed

asumagic mentioned this pull request Sep 25, 2024

Write tutorials for Adapters #2698

Closed

Usanter mentioned this pull request Dec 6, 2024

Using Speechbrain MultiheadAttention with PEFT attribute error #2779

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Generic adapters implementation#2563

Generic adapters implementation#2563
TParcollet merged 56 commits intospeechbrain:developfrom
pplantinga:adapters

pplantinga commented Jun 5, 2024 •

edited

Loading

Uh oh!

TParcollet commented Jun 6, 2024 •

edited

Loading

Uh oh!

TParcollet commented Jul 8, 2024 •

edited

Loading

Uh oh!

TParcollet commented Jul 8, 2024

Uh oh!

poonehmousavi commented Jul 8, 2024

Uh oh!

poonehmousavi commented Jul 10, 2024

Uh oh!

pplantinga commented Jul 13, 2024 •

edited

Loading

Uh oh!

pplantinga commented Jul 20, 2024

Uh oh!

TParcollet commented Sep 4, 2024 •

edited

Loading

Uh oh!

pplantinga commented Sep 4, 2024

Uh oh!

TParcollet left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

pplantinga commented Jun 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TParcollet commented Jun 6, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TParcollet commented Jul 8, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

TParcollet commented Jul 8, 2024

Uh oh!

poonehmousavi commented Jul 8, 2024

Uh oh!

poonehmousavi commented Jul 10, 2024

Uh oh!

pplantinga commented Jul 13, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pplantinga commented Jul 20, 2024

Uh oh!

TParcollet commented Sep 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pplantinga commented Sep 4, 2024

Uh oh!

TParcollet left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pplantinga commented Jun 5, 2024 •

edited

Loading

TParcollet commented Jun 6, 2024 •

edited

Loading

TParcollet commented Jul 8, 2024 •

edited

Loading

pplantinga commented Jul 13, 2024 •

edited

Loading

TParcollet commented Sep 4, 2024 •

edited

Loading