HF Adapters by JRosenkranz · Pull Request #39 · foundation-model-stack/foundation-model-stack

JRosenkranz · 2023-09-29T17:35:53Z

Add Huggingface Adapters along with model testing for huggingface

This PR includes:

HF adapter classes for encoder, decoder, encoder-decoder model architectures
Llama implementation of HFDecoderModelArchitecture
Different LM Head implementations
model_test_suite for huggingface configuration tests, equivalence tests, generation/batch generation tests

…el testing suite

… is what meta uses; added readme example for hf

…sed for a type hint in llama model; now using _has_hf in conversion

…ch generation

…the new version of transformers which returns a different vocab size

…fixed some issues that resulted in tests failing in ci (bad imports, etc.)

nairbv · 2023-10-02T14:39:59Z

    return ibm_model, tokenizer
+
+
+def convert_hf_llama(hf_model: "LlamaForCausalLM") -> LLaMA:


this is fine for now but we'll want to revisit with #5

it could also go in llama/utils.py though? not sure if that would be preferable

We don't have a llama folder for fms, only for hf/llama. This function is not for LLaMAHF (FMS), really it is for FMS (non-HF) LLaMA. I wonder if we should address the packaging structure for this (maybe not part of this PR), but in general.

oh I meant hf/utils.py, looking at the left margin I thought that was in the llama folder. Does somewhere else in hf/llama not make sense? I guess just for keeping the initialization functions near each other? not a big deal, especially since we'll be changing that code later anyway.

I think if there was a general initializations utility for llama, that would be best. Not sure where to put it as we do not have an fms llama folder

…perties for hf model_test_suite; _hf_specific_params does not have a default now (requires overriding); fixed an issue where if you run tests with --capture_expectation, runslow tests would be run

…in tests

nairbv · 2023-10-02T20:39:38Z

+llama: LLaMA = LLaMA(config)
+
+# huggingface model backed by fms internals
+llama_hf = LLaMAHFForCausalLM.from_fms_model(llama)


question: is there any generic way to do this? i.e. if we write adapters for a few models, is there some way to from_fms_model(model) without first having to check which model architecture it is?

If they conform to similar underlying api, then most likely yes. For instance, if the _helper function that is being called in _adapt had same input/output params (not necessarily the same names, but just same purpose). If they all have the same purpose and usage, we could add an implementation that can do this. It would require taking a map where it would map specific model parameters to their common parameter name. For instance, if one model uses the name x and another uses the name x_in, we can make the common name input_ids.

model = HFModel.from_fms_model(fms_model, mapper={'x': 'input_ids'}) model2 = HFModel.from_fms_model(fms_model2, mapper={'x_in': 'input_ids'})

nairbv

lgtm

JRosenkranz added 21 commits August 31, 2023 14:00

feat: added hf adapter classes and llama hf implementation; added mod…

37454c0

…el testing suite

fix: added equivalency test; fixed weight conversions

d2ee158

fix: defaults now match llama2; hf name now using ForCausalLM as that…

eb76dea

… is what meta uses; added readme example for hf

chore: changed generator to llama_generator

c696115

Merge branch 'main' into add_hf_and_model_tests

67be88d

feat: switched to lm head mixin; added function to convert from hf

259c73d

test: added train loss test

a80e099

Merge branch 'main' into add_hf_and_model_tests

c48c45c

fix: manual merge from weight cleanup

a5edae3

build: added transformers 4.31.0

09e34cd

fix: fixed merge conflicts; reformatted llama code style

78b55a4

test: moved hf equivalency test to its own file; removed the import u…

2d66344

…sed for a type hint in llama model; now using _has_hf in conversion

fix: added __init__ to models/hf_equivalence

9f6b028

fix: removed self in hf equiv test

ccb8d05

fix: added al params in conversion

b04d72f

fix: updated to stay in line with the model tests

987900e

fix: added padding_side=left to all tokenizers as is required for bat…

244f667

…ch generation

fix: manual merge from fix rope; updated the llama2 resources to use …

4da7671

…the new version of transformers which returns a different vocab size

fix: added position fixes

a207131

chore: merged main manually including rope fixes

25a675e

fix: fixed all merge conflicts; all tests passing

37ab489

JRosenkranz added the enhancement New feature or request label Sep 29, 2023

JRosenkranz requested review from ani300 and nairbv September 29, 2023 17:35

JRosenkranz self-assigned this Sep 29, 2023

JRosenkranz changed the title ~~Add hf and model tests with rope fix~~ HF Adapters Sep 29, 2023

JRosenkranz mentioned this pull request Sep 29, 2023

added hf adapter classes and llama hf implementation; added model testing suite #2

Closed

JRosenkranz added 3 commits September 29, 2023 13:46

fix: added __init__.py to models/hf

99368c4

ci: added transformers to install

b06deee

chore: reformatted code

7e2f136

JRosenkranz added 4 commits September 29, 2023 14:49

fix:reduced dependencies on specific fixtures for hf fixture mixins; …

925527a

…fixed some issues that resulted in tests failing in ci (bad imports, etc.)

fix: fixed more broken imports

470b4ff

fix: fixed broken import in ci

d413269

fix: removed labels from prepare_inputs_for_generation

3c19e28

nairbv reviewed Oct 2, 2023

View reviewed changes

Comment thread tests/models/hf/test_llama_hf.py Outdated

fix: fixed type hint for oss_hf_model from config to model

ed63c00

nairbv reviewed Oct 2, 2023

View reviewed changes

Comment thread fms/testing/_internal/hf/model_test_suite.py Outdated

nairbv reviewed Oct 2, 2023

View reviewed changes

Comment thread tests/models/hf/test_llama_hf.py

nairbv reviewed Oct 2, 2023

View reviewed changes

Comment thread tests/models/test_llama.py

nairbv reviewed Oct 2, 2023

View reviewed changes

Comment thread tests/models/conftest.py Outdated

nairbv reviewed Oct 2, 2023

View reviewed changes

Comment thread fms/modules/positions.py Outdated

nairbv reviewed Oct 2, 2023

View reviewed changes

Comment thread fms/modules/positions.py Outdated

JRosenkranz added 4 commits October 2, 2023 10:58

chore: addressed comments regarding explicit override of abstract pro…

86e0adc

…perties for hf model_test_suite; _hf_specific_params does not have a default now (requires overriding); fixed an issue where if you run tests with --capture_expectation, runslow tests would be run

chore: fixed typo in comment

a735100

docs: added comments for hf test cases

97555f2

chore: fixed formatting

d659519

JRosenkranz requested a review from nairbv October 2, 2023 15:31

fix: removed freq from rot_emb as it is not needed; compute the freq …

e837283

…in tests

nairbv reviewed Oct 2, 2023

View reviewed changes

nairbv approved these changes Oct 2, 2023

View reviewed changes

Merge branch 'main' into add_hf_and_model_tests_with_rope_fix

92f3f61

JRosenkranz merged commit 1a53e4e into main Oct 2, 2023

JRosenkranz deleted the add_hf_and_model_tests_with_rope_fix branch October 2, 2023 23:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HF Adapters#39

HF Adapters#39
JRosenkranz merged 35 commits into
mainfrom
add_hf_and_model_tests_with_rope_fix

JRosenkranz commented Sep 29, 2023 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nairbv Oct 2, 2023

Uh oh!

nairbv Oct 2, 2023

Uh oh!

JRosenkranz Oct 2, 2023

Uh oh!

nairbv Oct 2, 2023

Uh oh!

JRosenkranz Oct 2, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nairbv Oct 2, 2023

Uh oh!

JRosenkranz Oct 2, 2023

Uh oh!

nairbv left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		return ibm_model, tokenizer


		def convert_hf_llama(hf_model: "LlamaForCausalLM") -> LLaMA:

Conversation

JRosenkranz commented Sep 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nairbv Oct 2, 2023

Choose a reason for hiding this comment

Uh oh!

nairbv Oct 2, 2023

Choose a reason for hiding this comment

Uh oh!

JRosenkranz Oct 2, 2023

Choose a reason for hiding this comment

Uh oh!

nairbv Oct 2, 2023

Choose a reason for hiding this comment

Uh oh!

JRosenkranz Oct 2, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nairbv Oct 2, 2023

Choose a reason for hiding this comment

Uh oh!

JRosenkranz Oct 2, 2023

Choose a reason for hiding this comment

Uh oh!

nairbv left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

JRosenkranz commented Sep 29, 2023 •

edited

Loading