File tree Expand file tree Collapse file tree
Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -564,7 +564,7 @@ python encoder_sample.py \
564564` ./bin/decoding_gemm ` can generate the best GEMM configuration. The arguments of ` decoding_gemm ` are:
565565
566566``` bash
567- ./bin/decoding_gemm < batch_size> < beam_width> < head_number> < size_per_head> < sequence_length> < encoder_hidden_dim> < is_use_fp16>
567+ ./bin/decoding_gemm < batch_size> < beam_width> < head_number> < size_per_head> < vocab_size > < sequence_length> < encoder_hidden_dim> < is_use_fp16>
568568```
569569
570570Assume the settings of decoding are as follows.
@@ -593,7 +593,7 @@ a. Run the decoding in C++ by running the following script:
593593` ./bin/decoding_sample ` runs the decoding in the ` cpp ` . The arguments of ` encoder_sample ` is:
594594
595595``` bash
596- ./bin/decoding_sample < batch_size> < beam_width> < head_number> < size_per_head> < sequence_length> < num_layers> < encoder_hidden_dim> < is_use_fp16>
596+ ./bin/decoding_sample < batch_size> < beam_width> < head_number> < size_per_head> < vocab_size > < sequence_length> < num_layers> < encoder_hidden_dim> < is_use_fp16>
597597```
598598
599599Then the following scripts can run the decoding under the above settings.
You can’t perform that action at this time.
0 commit comments