wip: add detr keypoint architecture #1182

jveitchmichaelis · 2025-10-23T20:54:19Z

@bw4sz no direct training integration yet, but this is what I'm thinking for the modeling.

The implementation looks verbose, but that's because a lot of the pieces are copied from transformers with very minor changes - like a reference to bbox changed to point. To modify the arch we need:

DeformableDetrKeypointConfig: Same as object detection but explicit parameter for point loss cost.
DeformableDetrKeypointDetectionOutput: Mostly similar but name changes.
DeformableDetrForKeypointDetection: Model, changes number of decoder outputs to 2 and removes some bbox specific bits.
DeformableDetrKeypointMatcher: Hungarian matcher that uses L1 or L2 cost instead of IoU.
DeformableDetrKeypointLoss: L1 or L2.
DeformableDetrKeypointImageProcessor: Normalizes keypoints to relative pixel coords and some other bits.

I don't anticipate checking in the current test suite as the overfit test takes a minute or two to run, but it does converge.

jveitchmichaelis · 2025-10-23T22:43:19Z

High level:

Dataset support, initially derived from existing box datasets where we can derive keypoints from box centers.
Configuration task support which is tied to architecture, e.g. task = boxes by default (allows retinanet, detr), but can be task = points (detr), polygons (?)
Modeling support (create/load models should open the appropriate architecture)
Transforms + augmentation should be minimal for now, but ensure that they support keypoints if passed in
Training support - ensure that forward pass works with .fit() and all functions in main work
Metrics should be modified to be task-specific
Evaluate should also allow keypoint precision/recall based on matching metrics, with a threshold for pixel distance instead of IoU

codecov · 2025-10-23T23:08:04Z

Codecov Report

❌ Patch coverage is 78.57143% with 138 lines in your changes missing coverage. Please review.
✅ Project coverage is 85.52%. Comparing base (915a945) to head (4ed7f25).
⚠️ Report is 23 commits behind head on main.

Files with missing lines	Patch %	Lines
src/deepforest/models/keypoint.py	79.48%	64 Missing ⚠️
src/deepforest/datasets/training.py	71.42%	22 Missing ⚠️
src/deepforest/main.py	72.60%	20 Missing ⚠️
src/deepforest/evaluate.py	77.61%	15 Missing ⚠️
src/deepforest/predict.py	73.33%	8 Missing ⚠️
src/deepforest/utilities.py	80.95%	4 Missing ⚠️
src/deepforest/augmentations.py	57.14%	3 Missing ⚠️
src/deepforest/callbacks.py	75.00%	1 Missing ⚠️
src/deepforest/visualize.py	75.00%	1 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1182      +/-   ##
==========================================
- Coverage   87.38%   85.52%   -1.86%     
==========================================
  Files          20       22       +2     
  Lines        2569     3165     +596     
==========================================
+ Hits         2245     2707     +462     
- Misses        324      458     +134

Flag	Coverage Δ
unittests	`85.52% <78.57%> (-1.86%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

jveitchmichaelis · 2025-10-31T19:07:07Z

@bw4sz integration is done, I'll add tests. Lots of TODOs dotted around which relate to tidying up and merging some duplicated code, but I anticipate polygons/segmentation will follow in a similar way.

The pre-commit failure is weird. It thinks that pandas is undefined, but the import is unchanged and ruff passes locally.

jveitchmichaelis · 2025-11-05T07:58:40Z

Currently training a DETR backbone on the lidar pretrain dataset, will swap that in once done.

jveitchmichaelis · 2025-11-07T21:11:26Z

src/deepforest/datasets/training.py


    def load_image(self, idx):
-        img_name = os.path.join(self.root_dir, self.image_names[idx])
+        img_name = os.path.join(self.root_dir, os.path.basename(self.image_names[idx]))


Will remove this, or make it an option.

jveitchmichaelis force-pushed the keypoint branch 4 times, most recently from 37e04fb to 5d3e5f5 Compare October 23, 2025 22:38

jveitchmichaelis force-pushed the keypoint branch from d6f90e9 to bd9a09a Compare November 7, 2025 21:03

jveitchmichaelis commented Nov 7, 2025

View reviewed changes

add detr keypoint architecture

078b545

jveitchmichaelis force-pushed the keypoint branch 3 times, most recently from 8eb5caf to 53820cd Compare November 7, 2025 22:42

integrate keypoints

4ed7f25

jveitchmichaelis force-pushed the keypoint branch from 53820cd to 4ed7f25 Compare November 8, 2025 18:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

wip: add detr keypoint architecture #1182

wip: add detr keypoint architecture #1182

jveitchmichaelis commented Oct 23, 2025 •

edited

Loading

Uh oh!

jveitchmichaelis commented Oct 23, 2025 •

edited

Loading

Uh oh!

codecov bot commented Oct 23, 2025 •

edited

Loading

Uh oh!

jveitchmichaelis commented Oct 31, 2025 •

edited

Loading

Uh oh!

jveitchmichaelis commented Nov 5, 2025

Uh oh!

jveitchmichaelis Nov 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

wip: add detr keypoint architecture #1182

Are you sure you want to change the base?

wip: add detr keypoint architecture #1182

Conversation

jveitchmichaelis commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jveitchmichaelis commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Oct 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

jveitchmichaelis commented Oct 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jveitchmichaelis commented Nov 5, 2025

Uh oh!

jveitchmichaelis Nov 7, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jveitchmichaelis commented Oct 23, 2025 •

edited

Loading

jveitchmichaelis commented Oct 23, 2025 •

edited

Loading

codecov bot commented Oct 23, 2025 •

edited

Loading

jveitchmichaelis commented Oct 31, 2025 •

edited

Loading