Addition of BUCTD Models by n-poulsen · Pull Request #2952 · DeepLabCut/DeepLabCut

n-poulsen · 2025-04-11T08:12:42Z

BUCTD Pose Estimation Models

BUCTD is a state of the art crowded animal (and human) pose estimation algorithm. This PR serves to add this directly to the DeepLabCut code base. Here is the stand alone paper code for single image inference. This PR also expands significantly the code to track individuals in videos.

Paper: Rethinking pose estimation in crowds: overcoming the detection information
bottleneck and ambiguity

Ref:

@InProceedings{Zhou_2023_ICCV,
    author    = {Zhou, Mu and Stoffl, Lucas and Mathis, Mackenzie Weygandt and Mathis, Alexander},
    title     = {Rethinking Pose Estimation in Crowds: Overcoming the Detection Information Bottleneck and Ambiguity},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2023},
    pages     = {14689-14699}
}

Tracking Performance

The following video was produced through the example notebook: examples/COLAB/Demo_BUCTD_and_CTD_tracking.ipynb. The model was trained on 78 images containing the 3 mice, with the default parameters for the model.

Models

Both CoAM and PreNet BUCTD architectures are added. Some base PreNet architectures are added (most notably ctd_prenet_cspnext_m and ctd_prenet_cspnext_x), but of course any backbone/pose head can be used as a PreNet BUCTD model.

Training and Model Confituration

BUCTD models are trained using generative sampling. The configuration for generative sampling during training is stored in the pytorch_config under the data: gen_sampling key:

data:
  gen_sampling:
    keypoint_sigmas: 0.1

BUCTD models require conditions from a bottom-up for evaluation. This can be configured through the data key as well:

# Example: Loading the predictions for snapshot-250.pt of shuffle 1.
data:
  conditions:
    shuffle: 1
    snapshot: snapshot-250.pt

# Example: Loading the predictions for the last snapshot of shuffle 6.
data:
  conditions:
    shuffle: 6
    snapshot_index: -1

Tracking

One of the big advantages of having a CTD model is that it can be used to track individuals directly! Let's say you have the pose for your animals at frame T. Then you can use those poses as conditions for frame T+1, and let your CTD model simply "update" the poses depending on how much your mice moved.

In the simplest scenario, you only need to run the BU model on the first frame, and then the CTD model takes over for inference and tracking:

Run the BU model to generate conditions for the 1st frame of the video
For every frame after that, use the predictions from the previous frame as conditions

However, this may not fit your scenario perfectly. Maybe all the mice aren't present in the first frame, and if they aren't detected by the BU model they'll never be tracked. Maybe at some point the CTD model makes an error and you lose track of a mouse. There are some options to deal with this:

Run the BU model every time at least one animal is not detected (if you expect N animals to be in the video and you only detect N-1 animals, run the BU model):
- In this case, the predictions from the BU model need to be "merged in" to the existing N-1 tracks
- We can merge them in by using a similarity score between poses (OKS) which ranges from 0 to 1
- You likely don't want to run the BU model every frame, as this would slow down inference.
Run the BU model every K frames in case new animal appears

Docs & Examples

Docs were added for different approaches to pose estimation in: docs/pytorch/architectures.md
A new COLAB notebook was added to train a CTD model: examples/COLAB/Demo_BUCTD_and_CTD_tracking.ipynb

Bug fixes & Improvements

calc_object_keypoint_similarity: allow users to pass arrays to have different OKS sigmas for each keypoint
users can now get the scorer from the DLCLoader and a Snapshot
Loaders have a method to list snapshots

…derived bbox

MMathisLab

🚀🔥

LucZot and others added 30 commits March 21, 2024 09:58

add code of hrnet + coam backbone

b57d849

refactor and finish hrnet+coam integration

0690899

add ctd configs

c6e6668

initial work on generative sampling

78ebb57

buctd cont I

4503b87

prepare dataloader for loading cond pose with gen sampling

ec4aa87

adapt hrnet-coam output to hrnet code

6c0cf37

put kpt encoders into modules

704a0c6

add kpt encoder to hrnet_coam

a53ba27

small additions

964dd5c

init CTD inference

3e85fd1

test CTD training - I

8c94c7b

buctd training!

107ea70

remove one for loop in _get_condition_matrix

f8b7fa9

optim trials for cond kpt encoding

9bb4ae6

add ctd evaluation based on loaded bu predictions

ac29688

use only overlapping individuals for swapping error

bb31cc1

add marmoset bu path

ee7918f

Merge branch 'main' into lucas/buctd_integration

b83e8ec

scale cond_kpts in collate

e769277

switch back to original function for creating conditional matrix

006b13a

update benchmark scripts

1ccf705

add hflip to internal benchmark script

a4973c9

pad with black pixels instead of context for CTD + add margin for BU-…

304f9c2

…derived bbox

add script for testing ctd performance with coco api

0b4be31

make CTD test inference compatible with previous code

c93c1bd

add bu name to output path names for CTD test inference

9c9ef50

add bu name to output path names for CTD test inference - fix

4de92f9

update BUCTD training with new TD developments

fbbaf13

merged main

5a432b5

n-poulsen and others added 11 commits April 11, 2025 11:27

update BUCTD colab

f54cc47

fix COLAB rendering

4ff3d95

fix COLAB rendering

bd2d14d

fix COLAB rendering

28ff47f

fix pip install in COLAB

a04a9bd

skip notebooks for codespell

c42fe22

install deeplabcut with --pre in colab

1d92522

fix tests

b0dae22

Merge branch 'main' into lucas/buctd_v2

9696a4b

fix failing tests on windows

9eec042

fix tests windows

a42c61d

MMathisLab added the maDLC label Apr 11, 2025

LucZot mentioned this pull request Apr 12, 2025

Inquiry on Integration Timeline for BUCTD in DeepLabCut amathislab/BUCTD#18

Closed

MMathisLab approved these changes Apr 14, 2025

View reviewed changes

Update architectures.md

1d67089

AlexEMG approved these changes Apr 14, 2025

View reviewed changes

n-poulsen and others added 6 commits April 14, 2025 10:19

deal with absolute windows paths

d6b3f74

Version update

0fc92a0

fix when paths are converted to windows

6a1e0a9

Merge branch 'main' into lucas/buctd_v2

3900605

fix windows tests

fb7c647

Merge branch 'main' into lucas/buctd_v2

44a54fb

MMathisLab merged commit cfce4ba into main Apr 14, 2025

MMathisLab added the do not delete label Apr 15, 2025

maximpavliv mentioned this pull request May 5, 2025

ValueError: Misconfigured conditions in the pytorch_config: None. #2970

Closed

2 tasks

maximpavliv added the CTD Contidional Top-Down label May 15, 2025

This was referenced May 21, 2025

Fix deeplabcut.analyze_images() with CTD model #2990

Merged

BUCTD and CTD tracking #2987

Closed

MMathisLab deleted the lucas/buctd_v2 branch June 15, 2025 13:37

deruyter92 mentioned this pull request May 21, 2026

Add 3.0 changelog #3340

Merged

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Addition of BUCTD Models#2952

Addition of BUCTD Models#2952
MMathisLab merged 114 commits into
mainfrom
lucas/buctd_v2

n-poulsen commented Apr 11, 2025 •

edited by MMathisLab

Loading

Uh oh!

MMathisLab left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Uh oh!

Conversation

n-poulsen commented Apr 11, 2025 • edited by MMathisLab Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

BUCTD Pose Estimation Models

Tracking Performance

Models

Training and Model Confituration

Tracking

Docs & Examples

Bug fixes & Improvements

Uh oh!

MMathisLab left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

n-poulsen commented Apr 11, 2025 •

edited by MMathisLab

Loading