A little performance drop when running this code, ask for HigherHRnet version #17

cucdengjunli · 2022-12-20T07:10:22Z

Dear authors:
It is grateful to read your paper and code. when i try to run this project to reproduce your paper work. my result is dropped about 2mm, could you explain why ?

is your code responde to this setting? using [5 views; mask; weights;].

my conda environment is that, show in the picture:

my GPU is RTX3090, cuda11.3 , torch1.11.0

cucdengjunli · 2022-12-20T07:11:52Z

this is my validation result on panoptic. my training set is the same as /Faster-VoxelPose-main/configs/panoptic/jln64.yaml

cucdengjunli · 2022-12-21T02:37:24Z

the backbone you provide is Resnet , the backbone you mentioned in paper is HRnet, maybe this is the reason. Let me exchange the backbone and see the result

your core/config.py show that you use HigherHRnet

cucdengjunli · 2022-12-21T08:30:19Z

thanks for your perfect job！ Could you please offer a HigherHRnet backbone Version？

the code maybe like this:
https://github.com/HRNet/HigherHRNet-Human-Pose-Estimation/blob/master/lib/models/pose_higher_hrnet.py
https://github.com/HRNet/HigherHRNet-Human-Pose-Estimation/blob/master/experiments/coco/higher_hrnet/w32_512_adam_lr1e-3.yaml

gpastal24 · 2023-01-09T22:02:15Z

Hi, how did you manage to train the model on RTX30 series gpu? Did you make any changes to the code?

cucdengjunli · 2023-02-07T04:14:01Z

Hi, how did you manage to train the model on RTX30 series gpu? Did you make any changes to the code?

microsoft/voxelpose-pytorch#19

I try this and succeed

gpastal24 · 2023-02-07T07:59:13Z

Hi, how did you manage to train the model on RTX30 series gpu? Did you make any changes to the code?

microsoft/voxelpose-pytorch#19

I try this and succeed

I did something similar, eventually. I returned the total loss at each iteration, and I got 18.6mm 3d error.

`

    loss = loss_dict["total"]
    loss_2d = loss_dict["2d_heatmaps"]
    loss_1d = loss_dict["1d_heatmaps"]
    loss_bbox = loss_dict["bbox"]
    loss_joint = loss_dict["joint"]

    losses.update(loss.item())
    losses_2d.update(loss_2d.item())
    losses_1d.update(loss_1d.item())
    losses_bbox.update(loss_bbox.item())
    losses_joint.update(loss_joint.item())

    optimizer.zero_grad()
    loss.backward()
    optimizer.step()

`

Mosh-Wang · 2023-03-23T14:40:53Z

Hi, how did you manage to train the model on RTX30 series gpu? Did you make any changes to the code?

microsoft/voxelpose-pytorch#19
I try this and succeed

I did something similar, eventually. I returned the total loss at each iteration, and I got 18.6mm 3d error.

`
    loss = loss_dict["total"]
    loss_2d = loss_dict["2d_heatmaps"]
    loss_1d = loss_dict["1d_heatmaps"]
    loss_bbox = loss_dict["bbox"]
    loss_joint = loss_dict["joint"]

    losses.update(loss.item())
    losses_2d.update(loss_2d.item())
    losses_1d.update(loss_1d.item())
    losses_bbox.update(loss_bbox.item())
    losses_joint.update(loss_joint.item())

    optimizer.zero_grad()
    loss.backward()
    optimizer.step()
`

Hello, it’s convenient to ask you exactly how to change the part of loss? How is loss_dict defined?

gpastal24 · 2023-03-27T09:38:39Z

@Mosh-Wang I just changed the code to backprop the total loss at every batch iteration. Loss dict is returned by the FVP model. I didn't do anything fancy.

Faster-VoxelPose/lib/models/voxelpose.py

Lines 74 to 80 in 4daaeda

    
           loss_dict = { 
        
               "2d_heatmaps": loss_2d, 
        
               "1d_heatmaps": loss_1d, 
        
               "bbox": 0.1 * loss_bbox, 
        
               "joint": loss_joint, 
        
               "total": loss_2d + loss_1d + 0.1 * loss_bbox + loss_joint 
        
           }

Mosh-Wang · 2023-03-31T18:58:02Z

@Mosh-Wang I just changed the code to backprop the total loss at every batch iteration. Loss dict is returned by the FVP model. I didn't do anything fancy.

Faster-VoxelPose/lib/models/voxelpose.py

Lines 74 to 80 in 4daaeda

loss_dict = {

"2d_heatmaps": loss_2d,

"1d_heatmaps": loss_1d,

"bbox": 0.1 * loss_bbox,

"joint": loss_joint,

"total": loss_2d + loss_1d + 0.1 * loss_bbox + loss_joint

}

Thank you very much for your reply. Can I ask you one more question? That is, when I am training Panoptic, I use TRAIN_HEATMAP_SRC: 'image' and TEST_HEATMAP_SRC: 'image' in the original config, and I get the following error. Do you also use this setting? Or have you changed it? Or what do you think is the reason?

gpastal24 · 2023-03-31T19:18:01Z

@Mosh-Wang

Faster-VoxelPose/lib/dataset/JointsDataset.py

Line 71 in 4daaeda

input_heatmap = torch.zeros((1, 1, 1))

Change this to input_heatmaps or

Faster-VoxelPose/lib/dataset/JointsDataset.py

Line 168 in 4daaeda

'input_heatmap': input_heatmaps,

to input_heatmap.
They are not used anyway if you are using the image for training and testing.
Just make a quick check before waiting for a whole epoch that this resolves the problem both for training and valdating

gpastal24 · 2023-04-02T05:57:43Z

@Mosh-Wang regarding the other q. I dont know if it matters that much. If you try both approaches would you br kind to let us know if training the 2D network as well, increases the performance of the method?

zaie · 2023-04-26T06:37:31Z

Maybe the omitted loss_off cause the performance drop.
#26

cucdengjunli · 2023-06-20T14:09:24Z

i reproduce the higherhrnet version backbone

gpastal24 · 2023-07-07T06:47:41Z

@cucdengjunli

i reproduce the higherhrnet version backbone

Did you get the same results as the paper?

AlvinYH · 2023-07-23T16:19:10Z

Hi, @cucdengjunli. Thanks for your interest in our work. We've modified the code and you can pull the recent release. Yes, we make several changes to the model architecture (remove the offset branch and reduce the feature dimension in the weight_net). And the experimental results are slightly different from the one in the original paper. Specifically, as for Panoptic dataset, the mpjpe increases a little (+0.15mm) while the new model yields an improvement of 1.44 in terms of AP25. You can download the pre-trained checkpoints. We'll revise our paper to specify these alternations. Also, thanks for pointing our mistake. We did use Pose ResNet for training on Panoptic Dataset instead of HigherHRNet. We'll fix this typo in the final version. And using HigherHRNet is expected to further reduce the errors.

cucdengjunli · 2023-08-13T00:33:11Z

@cucdengjunli

i reproduce the higherhrnet version backbone

Did you get the same results as the paper?

yes , mpjpe@500mm: 17.966

cucdengjunli · 2023-08-13T00:33:32Z

Hi, @cucdengjunli. Thanks for your interest in our work. We've modified the code and you can pull the recent release. Yes, we make several changes to the model architecture (remove the offset branch and reduce the feature dimension in the weight_net). And the experimental results are slightly different from the one in the original paper. Specifically, as for Panoptic dataset, the mpjpe increases a little (+0.15mm) while the new model yields an improvement of 1.44 in terms of AP25. You can download the pre-trained checkpoints. We'll revise our paper to specify these alternations. Also, thanks for pointing our mistake. We did use Pose ResNet for training on Panoptic Dataset instead of HigherHRNet. We'll fix this typo in the final version. And using HigherHRNet is expected to further reduce the errors.

thank you!

CodeCrusader66 · 2023-08-13T00:44:02Z

Hi, @cucdengjunli. Thanks for your interest in our work. We've modified the code and you can pull the recent release. Yes, we make several changes to the model architecture (remove the offset branch and reduce the feature dimension in the weight_net). And the experimental results are slightly different from the one in the original paper. Specifically, as for Panoptic dataset, the mpjpe increases a little (+0.15mm) while the new model yields an improvement of 1.44 in terms of AP25. You can download the pre-trained checkpoints. We'll revise our paper to specify these alternations. Also, thanks for pointing our mistake. We did use Pose ResNet for training on Panoptic Dataset instead of HigherHRNet. We'll fix this typo in the final version. And using HigherHRNet is expected to further reduce the errors.

I have alse reproduce the higherHRnet version code 😄,the result is as the same as your paper said. may I send you a merge request?

cucdengjunli changed the title ~~a little accuracy loss when running this code~~ A little performance drop when running this code Dec 20, 2022

cucdengjunli changed the title ~~A little performance drop when running this code~~ A little performance drop when running this code, ask for HigherHRnet version Dec 21, 2022

gpastal24 mentioned this issue May 23, 2023

Training Error #30

Closed

qbxlvnf11 mentioned this issue Nov 13, 2023

I don't know why the performance is slightly lower when I execute the code following Readme. #38

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A little performance drop when running this code, ask for HigherHRnet version #17

A little performance drop when running this code, ask for HigherHRnet version #17

cucdengjunli commented Dec 20, 2022 •

edited

Loading

cucdengjunli commented Dec 20, 2022

cucdengjunli commented Dec 21, 2022 •

edited

Loading

cucdengjunli commented Dec 21, 2022

gpastal24 commented Jan 9, 2023

cucdengjunli commented Feb 7, 2023

gpastal24 commented Feb 7, 2023 •

edited

Loading

Mosh-Wang commented Mar 23, 2023

gpastal24 commented Mar 27, 2023

Mosh-Wang commented Mar 31, 2023 •

edited

Loading

gpastal24 commented Mar 31, 2023 •

edited

Loading

gpastal24 commented Apr 2, 2023

zaie commented Apr 26, 2023

cucdengjunli commented Jun 20, 2023

gpastal24 commented Jul 7, 2023

AlvinYH commented Jul 23, 2023 •

edited

Loading

cucdengjunli commented Aug 13, 2023

cucdengjunli commented Aug 13, 2023

CodeCrusader66 commented Aug 13, 2023

A little performance drop when running this code, ask for HigherHRnet version #17

A little performance drop when running this code, ask for HigherHRnet version #17

Comments

cucdengjunli commented Dec 20, 2022 • edited Loading

cucdengjunli commented Dec 20, 2022

cucdengjunli commented Dec 21, 2022 • edited Loading

cucdengjunli commented Dec 21, 2022

gpastal24 commented Jan 9, 2023

cucdengjunli commented Feb 7, 2023

gpastal24 commented Feb 7, 2023 • edited Loading

Mosh-Wang commented Mar 23, 2023

gpastal24 commented Mar 27, 2023

Mosh-Wang commented Mar 31, 2023 • edited Loading

gpastal24 commented Mar 31, 2023 • edited Loading

gpastal24 commented Apr 2, 2023

zaie commented Apr 26, 2023

cucdengjunli commented Jun 20, 2023

gpastal24 commented Jul 7, 2023

AlvinYH commented Jul 23, 2023 • edited Loading

cucdengjunli commented Aug 13, 2023

cucdengjunli commented Aug 13, 2023

CodeCrusader66 commented Aug 13, 2023

cucdengjunli commented Dec 20, 2022 •

edited

Loading

cucdengjunli commented Dec 21, 2022 •

edited

Loading

gpastal24 commented Feb 7, 2023 •

edited

Loading

Mosh-Wang commented Mar 31, 2023 •

edited

Loading

gpastal24 commented Mar 31, 2023 •

edited

Loading

AlvinYH commented Jul 23, 2023 •

edited

Loading