Trainer cannot handle 1d tensor when return results from test_epoch_end #5979

zhiruiluo · 2021-02-14T23:17:25Z

zhiruiluo
Feb 14, 2021

🐛 Bug

When trainer run_test() called, the results from test cannot properly handle a 1D tensor in the results dictionary.

Such error will happen:

/usr/local/lib/python3.6/dist-packages/pytorch_lightning/trainer/trainer.py in run_test(self)
708 for k, v in result.items():
709 if isinstance(v, torch.Tensor):
--> 710 result[k] = v.cpu().item()
711
712 return eval_loop_results

ValueError: only one element tensors can be converted to Python scalars

Please reproduce using the BoringModel

To Reproduce

To reproduce with BoringModel, only need to replace the test_epoch_end.

def test_epoch_end(self, outputs) -> None:
    torch.stack([x["y"] for x in outputs]).mean()
    f1_score = torch.tensor([1,1,1,1])
    return {'f1_score': f1_score}

Expected behavior

def run_test(self):

        # remove the tensors from the eval results
        for i, result in enumerate(eval_loop_results):
            if isinstance(result, dict):
                for k, v in result.items():
                    if isinstance(v, torch.Tensor):
                        # should check if you can call .item()
                        result[k] = v.cpu().item()

Environment

PyTorch Version (e.g., 1.0): 1.1.8

Additional context

Answered by awaelchli

Feb 15, 2021

Because for example in multi gpu mode, if we would allow the user to return, then we're missing the information what to do with the data, how the data is collected and synced, or reduced or whatever.
The logging api offers reduction and sync, by specifying the custom arguments how to do so.
On the other hand, self.write offers a way to collect all results.
There will also be a prediction api in 1.2. #5752
cc @tchaton

View full answer

awaelchli · 2021-02-15T02:12:26Z

awaelchli
Feb 15, 2021

Hi

You should see somewhere a warning:

UserWarning: The testing_epoch_end should not return anything as of 9.1. To log, use self.log(...) or self.write(...) directly in the LightningModule

Only scalar tensors are supported.

0 replies

zhiruiluo · 2021-02-15T02:18:09Z

zhiruiluo
Feb 15, 2021
Author

Hi

You should see somewhere a warning:

UserWarning: The testing_epoch_end should not return anything as of 9.1. To log, use self.log(...) or self.write(...) directly in the LightningModule

Only scalar tensors are supported.

I think returning from testing_epoch_end is a very useful functionality when you want to do any post-processing with the results.
Why is thie not recommended? Is there any potential bug to do so?

0 replies

awaelchli · 2021-02-15T02:30:16Z

awaelchli
Feb 15, 2021

Because for example in multi gpu mode, if we would allow the user to return, then we're missing the information what to do with the data, how the data is collected and synced, or reduced or whatever.
The logging api offers reduction and sync, by specifying the custom arguments how to do so.
On the other hand, self.write offers a way to collect all results.
There will also be a prediction api in 1.2. #5752
cc @tchaton

0 replies

zhiruiluo · 2021-02-15T02:38:11Z

zhiruiluo
Feb 15, 2021
Author

Thank for the explaination. I will try the self.write. Could you please refer me to the page of self.write?

0 replies

awaelchli · 2021-02-15T04:02:06Z

awaelchli
Feb 15, 2021

Sorry, it's not called "write", the warning seems to have the wrong name. There is currently write_prediction and write_prediction_dict
https://pytorch-lightning.readthedocs.io/en/latest/common/lightning_module.html#write-prediction

0 replies

tchaton · 2021-02-15T09:33:01Z

tchaton
Feb 15, 2021
Maintainer

Hey @lzrpotato,

From 1.2, Trainer and Lightning will have a predict function. It is still in BETA. We will soon update the doc.

Best,
T.C

1 reply

zhiruiluo Feb 15, 2021
Author

Thanks.

Trainer cannot handle 1d tensor when return results from test_epoch_end #5979

Uh oh!

Uh oh!

zhiruiluo Feb 14, 2021

🐛 Bug

Please reproduce using the BoringModel

To Reproduce

Expected behavior

Environment

Additional context

Replies: 6 comments · 1 reply

Uh oh!

awaelchli Feb 15, 2021

Uh oh!

zhiruiluo Feb 15, 2021 Author

Uh oh!

awaelchli Feb 15, 2021

Uh oh!

zhiruiluo Feb 15, 2021 Author

Uh oh!

Uh oh!

awaelchli Feb 15, 2021

Uh oh!

Uh oh!

tchaton Feb 15, 2021 Maintainer

Uh oh!

zhiruiluo Feb 15, 2021 Author

zhiruiluo
Feb 14, 2021

Replies: 6 comments 1 reply

awaelchli
Feb 15, 2021

zhiruiluo
Feb 15, 2021
Author

awaelchli
Feb 15, 2021

zhiruiluo
Feb 15, 2021
Author

awaelchli
Feb 15, 2021

tchaton
Feb 15, 2021
Maintainer

zhiruiluo Feb 15, 2021
Author