File tree Expand file tree Collapse file tree
PyTorch/LanguageModeling/BERT Expand file tree Collapse file tree Original file line number Diff line number Diff line change @@ -799,9 +799,6 @@ Our results were obtained by running the `scripts/run_pretraining.sh` and `scrip
799799|1 | 4| 8| 512|8.36 |30.08 | 3.68| 1.00| 1.00
800800|4 | 4| 8| 512|31.52 |116.80 | 3.70| 3.84| 3.82
801801| 8| 4| 8| 512|62.72 |231.68 | 3.69| 7.68| 7.61
802- |1 |N/A | 10| 512|N/A |46.00| N/A| N/A| 1.0
803- |4 |N/A | 10| 512|N/A |164.00 | N/A| N/A| 3.57
804- | 8|N/A | 10| 512|N/A |325.60| N/A| N/A| 7.08
805802
806803
807804###### Fine-tuning NVIDIA DGX-1 With 32G
@@ -832,10 +829,7 @@ Our results were obtained by running the `scripts/run_pretraining.sh` and `scrip
832829|4 | 4 |8 | 512| 34.4| 124.16| 3.60| 3.82| 3.84
833830|8 | 4 | 8| 512| 68.16| 247.04| 3.62| 7.57| 7.64
834831|16 | 4 | 8| 512| 135.68| 488.96| 3.60| 15.08| 15.13
835- |1 | N/A | 10 | 512|N/A | 47.40| N/A| N/A| 1.00
836- |4 | N/A |10 | 512| N/A| 165.60| N/A| N/A| 3.49
837- |8 | N/A | 10| 512| N/A| 325.60| N/A| N/A| 6.87
838- |16 | N/A | 10| 512| N/A| 648.00| N/A| N/A| 13.67
832+
839833
840834###### Pre-training on multiple NVIDIA DGX-2H With 32G
841835
You can’t perform that action at this time.
0 commit comments