Adding Imagenet Example by PareesaMS · Pull Request #680 · deepspeedai/DeepSpeedExamples

PareesaMS · 2023-08-09T20:03:23Z

This example activated DeepSpeed on the implementation of training a set of popular model architectures on ImageNet dataset. The models include ResNet, AlexNet, and VGG, and the
baseline implementation could be found at pytorch examples Github repository. DeepSpeed activation allows for ease in
running the code in distributed manner, allowing for easily applying fp16 quantization benefitting Zero stage1 memory reduction.

yaozhewei · 2023-08-16T21:54:35Z

+## DeepSpeed Optimizations
+
+Applying fp16 quantization and Zero stage 1 memory optimization we were able to reduce the required memory. The table bellow summarizes the results of running resnet 50 on one
+node 16 V100 GPUs:


on a DGX-1 node (with 16 V100 GPUs)

yaozhewei · 2023-08-16T21:55:14Z

+------------------|-------------------
+
+Furthermore, the memory optimization had no adverse impact on accuracy, a point illustrated by the graph below.
+![resnet-plot](C:\Users\pagolnar\OneDrive - Microsoft\Reports-presentations\Resnet-plot)


the image link is wrong.

yaozhewei · 2023-08-16T21:55:57Z

+Baseline| ? | -
+Baseline with DS activated | 1.66 | -
+DS + fp16 | 1.04 | ?
+Ds + fp16 + Zero 1 | 0.81 | ?


besides memory, how about the training speed

Fixed the table. Did not measure the training speed. Should I repeat the experiments?

yaozhewei · 2023-08-16T21:56:52Z

+ImageNet dataset is large and time-consuming to download. To get started quickly, run `main.py` using dummy data by "--dummy". It's also useful for training speed benchmark. Note that the loss or accuracy is useless in this case.
+
+```bash
+python main.py -a resnet18 --dummy


where is deepspeed?

yaozhewei · 2023-08-16T21:58:00Z

@@ -0,0 +1,2 @@
+torch


deepspeed is also a requirement?

Definitely. Fixed the issue

yaozhewei · 2023-08-16T21:59:49Z

+Baseline| ? | -
+Baseline with DS activated | 1.66 | -
+DS + fp16 | 1.04 | ?
+Ds + fp16 + Zero 1 | 0.81 | ?


table format is not correct. take a look at rendered website

Co-authored-by: Michael Wyatt <mrwyattii@gmail.com>

Adding Imagenet example

fe9abf1

PareesaMS requested review from RezaYazdaniAminabadi, ShadenSmith, arashb, awan-10, conglongli, duli2012, eltonzheng, jeffra, minjiaz, mrwyattii, samyam, tjruwase, xiaoxiawu-microsoft and yaozhewei as code owners August 9, 2023 20:03

yaozhewei reviewed Aug 16, 2023

View reviewed changes

PareesaMS added 9 commits October 3, 2023 21:50

Fix some typos

52f54a7

Fix issues with the table and image

9211898

Fixes some issues in the parameters

2031a0c

Fix the plot

751f34a

Typo

d9b9af2

Fixing some alignments

9886e84

alignments

c75fced

Move resnetplot to assets

c8eb49b

Remove extra image

5579393

mrwyattii approved these changes Nov 8, 2023

View reviewed changes

Merge branch 'master' into dev/pagolnar/ex_imagenet

08127ce

mrwyattii merged commit ccb2a34 into master Nov 8, 2023

hwchen2017 pushed a commit that referenced this pull request Jun 8, 2025

Adding Imagenet Example (#680)

0dd4c6e

Co-authored-by: Michael Wyatt <mrwyattii@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Imagenet Example#680

Adding Imagenet Example#680
mrwyattii merged 11 commits into
masterfrom
dev/pagolnar/ex_imagenet

PareesaMS commented Aug 9, 2023

Uh oh!

yaozhewei Aug 16, 2023

Uh oh!

PareesaMS Oct 4, 2023

Uh oh!

yaozhewei Aug 16, 2023

Uh oh!

PareesaMS Oct 4, 2023

Uh oh!

yaozhewei Aug 16, 2023

Uh oh!

PareesaMS Oct 4, 2023

Uh oh!

yaozhewei Aug 16, 2023

Uh oh!

PareesaMS Oct 4, 2023

Uh oh!

yaozhewei Aug 16, 2023

Uh oh!

PareesaMS Oct 4, 2023

Uh oh!

yaozhewei Aug 16, 2023

Uh oh!

PareesaMS Oct 4, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

PareesaMS commented Aug 9, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants