Skip to content

Commit a3e4857

Browse files
fix-step3-readme (deepspeedai#286)
* fix-step3-readme * Update README.md --------- Co-authored-by: Zhewei Yao <zheweiy@berkeley.edu>
1 parent 354fc7f commit a3e4857

1 file changed

Lines changed: 1 addition & 4 deletions

File tree

  • applications/DeepSpeed-Chat/training/step3_rlhf_finetuning/training_scripts

applications/DeepSpeed-Chat/training/step3_rlhf_finetuning/training_scripts/README.md

Lines changed: 1 addition & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -7,9 +7,6 @@ If you don't have step 1 and step 2 models. You may simply try
77
``` bash
88
--actor_model_name_or_path facebook/opt-1.3b --critic_model_name_or_path facebook/opt-350m
99
```
10-
⚡⚡⚡ When you use above script, please make sure you comment out the following such that it won't load the model weight from previous paths.
11-
```bash
12-
applications/DeepSpeed-Chat/training/utils/model/model_utils.py#L60
13-
```
10+
⚡⚡⚡ When you use above script, please make sure you modify parameter `rlhf_training` to False when calling the `create_critic_model` function twice in [rlhf_engine.py](./../../step3_rlhf_finetuning/rlhf_engine.py) such that it won't load the model weight from previous paths.
1411

1512
For the models we support, please see [our landing page](./../../../README.md#-supported-models-)

0 commit comments

Comments
 (0)