Skip to content

Commit 7a0d4fb

Browse files
committed
Merge branch 'nvpstr/release' of https://gitlab-master.nvidia.com/dl/JoC/examples into nvpstr/release
2 parents 15075aa + 566f9cb commit 7a0d4fb

1 file changed

Lines changed: 4 additions & 3 deletions

File tree

TensorFlow/LanguageModeling/BERT/README.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -99,7 +99,8 @@ These techniques and optimizations improve model performance and reduce training
9999
### Feature support matrix
100100

101101
The following features are supported by this model.
102-
| Feature | BERT |
102+
103+
| **Feature** | **BERT** |
103104
|:-----------------------:|:--------------------------:|
104105
| Horovod Multi-GPU | Yes |
105106

@@ -640,12 +641,12 @@ Our results were obtained by running the `run_squad.py` training script in the T
640641

641642
The following tables compare `F1` scores across 5 different training runs with different seeds, for both FP16 and FP32 respectively. The runs showcase consistent convergence on all 5 seeds with very little deviation.
642643

643-
| **FP16, 8x GPUs** | **seed #1** | **seed #2** | **seed #3** | **seed #4** | **seed #5** | **mean** | **std** |
644+
| **FP16, 8x GPUs** | **seed 1** | **seed 2** | **seed 3** | **seed 4** | **seed 5** | **mean** | **std** |
644645
|:-----------:|:-----:|:-----:|:-----:|:-----:|:-----:|:-----:|:-----:|
645646
|F1 |90.75|90.82|90.89|91.05|90.79|90.86|0.12|
646647
|Exact match|83.85|83.93|83.95|84.25|83.59|83.91|0.24|
647648

648-
| **FP32, 8x GPUs** | **seed #1** | **seed #2** | **seed #3** | **seed #4** | **seed #5** | **mean** | **std** |
649+
| **FP32, 8x GPUs** | **seed 1** | **seed 2** | **seed 3** | **seed 4** | **seed 5** | **mean** | **std** |
649650
|:-----------:|:-----:|:-----:|:-----:|:-----:|:-----:|:-----:|:-----:|
650651
|F1 |90.70|90.80|90.89|91.08|90.73|90.84|0.15 |
651652
|Exact match|83.82|83.77|84.23|84.19|83.63|83.93|0.27 |

0 commit comments

Comments
 (0)