Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PANDA dataset noisy WSIs and dataset splits #52

Open
undercutspiky opened this issue Jan 2, 2025 · 2 comments
Open

PANDA dataset noisy WSIs and dataset splits #52

undercutspiky opened this issue Jan 2, 2025 · 2 comments
Labels
question Further information is requested

Comments

@undercutspiky
Copy link

In the paper, you cite Kaggle and a paper for removing WSIs with noisy labels. I couldn't find a list of files with noisy labels that can be safely removed. Would it be possible for you to share the list of files you used?
Also, it'd be great if you could also share the exact train, val, and test splits used for PANDA so that I can just cite your work for using the specific WSIs and the split.

Thank you.

@HelloWorldLTY
Copy link

Same question here, I also wonder if it is possible to access datasets and paired labels used in this figure:
image

Thanks a lot!!!

@Richarizardd Richarizardd added the question Further information is requested label Jan 14, 2025
@undercutspiky
Copy link
Author

undercutspiky commented Jan 17, 2025

I was pointed to the splits used in their PANTHER paper by the corresponding author: https://github.com/mahmoodlab/PANTHER/tree/main/src/splits/classification/panda_wholesight

However, there are 2 problems:

  1. The splits seem to be 60:20:20 instead of 80:10:10 (the PANTHER paper says that they are 80:10:10).

  2. Minor: those splits have 9552 WSIs instead of 9555. PANTHER paper doesn't say anything about the total number of slides but this paper says that the total number of WSIs are supposed to be 9555.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

3 participants