Skip to content

Commit b50add4

Browse files
authored
Update README.md
Fix typo of FasterTransformer README.md
1 parent 56614ea commit b50add4

1 file changed

Lines changed: 2 additions & 2 deletions

File tree

FasterTransformer/v2/README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -564,7 +564,7 @@ python encoder_sample.py \
564564
`./bin/decoding_gemm` can generate the best GEMM configuration. The arguments of `decoding_gemm` are:
565565

566566
```bash
567-
./bin/decoding_gemm <batch_size> <beam_width> <head_number> <size_per_head> <sequence_length> <encoder_hidden_dim> <is_use_fp16>
567+
./bin/decoding_gemm <batch_size> <beam_width> <head_number> <size_per_head> <vocab_size> <sequence_length> <encoder_hidden_dim> <is_use_fp16>
568568
```
569569

570570
Assume the settings of decoding are as follows.
@@ -593,7 +593,7 @@ a. Run the decoding in C++ by running the following script:
593593
`./bin/decoding_sample` runs the decoding in the `cpp`. The arguments of `encoder_sample` is:
594594

595595
```bash
596-
./bin/decoding_sample <batch_size> <beam_width> <head_number> <size_per_head> <sequence_length> <num_layers> <encoder_hidden_dim> <is_use_fp16>
596+
./bin/decoding_sample <batch_size> <beam_width> <head_number> <size_per_head> <vocab_size> <sequence_length> <num_layers> <encoder_hidden_dim> <is_use_fp16>
597597
```
598598

599599
Then the following scripts can run the decoding under the above settings.

0 commit comments

Comments
 (0)