You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: MxNet/Classification/RN50v1.5/README.md
+27-24Lines changed: 27 additions & 24 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -168,7 +168,7 @@ The following section lists the requirements that you need to meet in order to s
168
168
169
169
This repository contains Dockerfile which extends the MXNet NGC container and encapsulates some dependencies. Aside from these dependencies, ensure you have the following components:
@@ -585,18 +585,18 @@ The following sections provide details on how we achieved our performance and ac
585
585
586
586
**90 epochs configuration**
587
587
588
-
Our results were obtained by running 8 times the `./runner -n <number of gpus>-b 256--dtype float32` script forTF32and the `./runner -n <number of gpus>-b 256` script for mixed precision in the mxnet-20.12-py3 NGC container on NVIDIADGX A100 with (8x A100 80GB) GPUs.
588
+
Our results were obtained by running 8 times the `./runner -n <number of gpus>-b 512--dtype float32` script forTF32and the `./runner -n <number of gpus>-b 512` script for mixed precision in the mxnet-22.10-py3 NGC container on NVIDIADGX A100 with (8x A100 80GB) GPUs.
589
589
590
590
|**GPUs**|**Accuracy - mixed precision**|**Accuracy -TF32**|**Time to train - mixed precision**|**Time to train -TF32**|**Time to train - speedup**|
591
-
|:---:|:---:|:---:|:---:|:---:|:---:|
592
-
|1|77.185|77.184|14.6|31.26|2.13|
593
-
|8|77.185|77.184|1.8|4.0|2.12|
591
+
|:---:|:---:|:---:|:--:|:---:|:---:|
592
+
|1|77.185|77.184|8.75|29.39|3.36|
593
+
|8|77.185|77.184|1.14|3.82|3.35|
594
594
595
595
##### Training accuracy: NVIDIA DGX-1 (8x V100 16GB)
596
596
597
597
**90 epochs configuration**
598
598
599
-
Our results were obtained by running the `./runner -n <number of gpus>-b 96--dtype float32` training script forFP32and the `./runner -n <number of gpus>-b 192` training script for mixed precision in the mxnet-20.12-py3 NGC container on NVIDIADGX-1with (8x V100 16GB) GPUs.
599
+
Our results were obtained by running the `./runner -n <number of gpus>-b 96--dtype float32` training script forFP32and the `./runner -n <number of gpus>-b 192` training script for mixed precision in the mxnet-22.10-py3 NGC container on NVIDIADGX-1with (8x V100 16GB) GPUs.
600
600
601
601
|**GPUs**|**Accuracy - mixed precision**|**Accuracy -FP32**|**Time to train - mixed precision**|**Time to train -FP32**|**Time to train - speedup**|
602
602
|:---:|:---:|:---:|:---:|:---:|:---:|
@@ -641,18 +641,17 @@ Here are example graphs of FP32 and mixed precision training on 8 GPU 250 epochs
641
641
##### Training performance: NVIDIA DGX A100 (8x A100 80GB)
642
642
643
643
The following results were obtained by running the
0 commit comments