Add conditional replacement of `@torch.inference_mode` for inference on AMD DirectML GPUs by deruyter92 · Pull Request #3295 · DeepLabCut/DeepLabCut

deruyter92 · 2026-04-28T08:13:28Z

Motivation
Currently, our inference runners use @torch.inference_mode, which is not supported for AMD GPUs with DirectML inference mode. Essentially @torch.inference_mode is a stricter version of @torch.no_grad, which is newer and faster and works for most users. However, since it does not work for AMD DirectML users. It would be worthwhile to conditionally replace it with @torch.no_grad when necessary.

solves #3289

Changes
This PR replaces @torch.inference_mode with a conditional @_no_grad_decorator, controlled by the env variable DLC_DIRECTML_NO_GRAD. The decorator resolves to @torch.no_grad for DirectML users if they set the env var to "true", otherwise it defaults to @torch.inference_mode (keeping current behavior).

AMD GPUs with DirectML inference mode currently do not support torch.inference_mode, which is stricter than torch.no_grad. This commit fixes that by adding a conditional `no_grad_decorator` which is controlled by the env variable DLC_DIRECTML_NO_GRAD. It resolves to either @torch.no_grad (if set "true") or @torch.inference_mode (default).

Copilot

Pull request overview

Adds an opt-in workaround for AMD DirectML inference by conditionally using torch.no_grad() instead of torch.inference_mode() in the PyTorch inference runners, controlled via an environment variable.

Changes:

Introduces DLC_DIRECTML_NO_GRAD env var parsing and a conditional decorator to select no_grad vs inference_mode.
Applies the conditional decorator to InferenceRunner.inference and CTDInferenceRunner.inference in place of @torch.inference_mode().

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

C-Achard

Good fix, thanks!
Just minor comments, otherwise LGTM

Co-authored-by: Cyril Achard <cyril.achard@epfl.ch>

C-Achard

Thanks for the docs update!

deruyter92 requested a review from Copilot April 28, 2026 08:13

Copilot started reviewing on behalf of deruyter92 April 28, 2026 08:14 View session

Copilot AI reviewed Apr 28, 2026

View reviewed changes

deruyter92 mentioned this pull request Apr 28, 2026

[Bug] DeepLabCut 3.0 PyTorch with AMD GPU (DirectML) fails on ConvTranspose2d in inference_mode — Workaround: use no_grad #3289

Open

2 tasks

deruyter92 added 4 commits April 28, 2026 10:29

Update comment

5c52ede

update naming: _no_grad_decorator -> _inference_mode_decorator

6ae4829

Add pytest for directml conditional no_grad inference mode

880394b

update pyproject.toml and uv.lock: add importlib to dev dependencies

03ac5a0

deruyter92 marked this pull request as ready for review April 28, 2026 08:57

deruyter92 requested a review from C-Achard April 28, 2026 08:57

C-Achard assigned deruyter92 Apr 28, 2026

C-Achard added pytorch gpu labels Apr 28, 2026

C-Achard approved these changes Apr 28, 2026

View reviewed changes

Comment thread deeplabcut/pose_estimation_pytorch/runners/inference.py Outdated

Comment thread deeplabcut/pose_estimation_pytorch/runners/inference.py

Comment thread pyproject.toml Outdated

deruyter92 and others added 4 commits April 28, 2026 16:26

Update deeplabcut/pose_estimation_pytorch/runners/inference.py

199501c

Co-authored-by: Cyril Achard <cyril.achard@epfl.ch>

restore pyproject.toml and uv.lock (no changes needed)

a4d29d8

add error-hint (using contextmanager) for directml-related runtime error

4c1bdb1

add note in TechHardware.md for DirectML inference troubleshooting

662555f

C-Achard approved these changes Apr 28, 2026

View reviewed changes

C-Achard linked an issue May 5, 2026 that may be closed by this pull request

[Bug] DeepLabCut 3.0 PyTorch with AMD GPU (DirectML) fails on ConvTranspose2d in inference_mode — Workaround: use no_grad #3289

Open

2 tasks

deruyter92 requested review from AlexEMG and MMathisLab May 6, 2026 09:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add conditional replacement of `@torch.inference_mode` for inference on AMD DirectML GPUs#3295

Add conditional replacement of `@torch.inference_mode` for inference on AMD DirectML GPUs#3295
deruyter92 wants to merge 9 commits into
mainfrom
jaap/amd_direct_ml_inference

deruyter92 commented Apr 28, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

C-Achard left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

C-Achard left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

deruyter92 commented Apr 28, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

C-Achard left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

C-Achard left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants