spCellEval - Benchmarking Cell Phenotyping Methods in Spatial Proteomics

We present "spCellEval", a quantitative comparison of automated/semi-automated cell phenotyping methods for Spatial Proteomics datasets on a diverse set of 10 curated public datasets. The methods are compared with a list of label transfer metrics divided into 4 categories: classification performance, distribution recovery, stability and scalability. This benchmark acts as a foundation to evaluate and improve automated cell phenotyping.

Current Results Overview:

Getting Started

In order to reproduce the results, the raw datasets currently need to be downloaded from public repositories. Please refer to the public registered Stage 1 manuscript. IMMUCan is one example dataset.

Raw Dataset 
  │
  ▼
Preprocessing
  │
  ├───────────────────────┬───────────────────────┐
  ▼                       ▼                       ▼
Method 1                Method 2              Method n
  │                       │                       │
  ▼                       ▼                       ▼
pred_fold_{1-5}.csv   pred_fold_{1-5}.csv   pred_fold_{1-5}.csv
  │                       │                       │
  └───────────────────────┼───────────────────────┘
                          ▼
                  Evaluation Scripts
                          │
                          ▼
                        Results

Preprocessing

Preprocessing of each dataset can be found in src/preprocessing/datasets/<process_dataset.ipynb> Paths need to be adjusted.

For some datasets, multistack tiffs or channel_names have to be created. Please refer to src/preprocessing/

Running methods

Scripts to run each method are provided in src/<method>. For supervised method, create kfolds first using the run_kfold_creator.py file.

Datasets and parameter settings can be found in manuscript supplement.

For installation, and method specific details like runtime, please refer to each method's documentation.

The expected output from each method is a predictions_*.csv file for each fold chosen and a fold_time.txt recording running times if chosen.

Evaluation Scripts

The notebooks in src/metrics_scripts perform the evaluation over all methods. The code blocks withiin eval_mapping.ipynb goes through all the methods within a specified dataset and outputs a final_results.csv file that contains all of the metrics for different levels.

Adding your own method

To officially add your own method, please open an issue and provide us with the following to reproduce your method.

GitHub repo for the method
List of Parameters used (if any)
OPTIONAL: Your predictions (this speeds up the evaluation process)

Folder Structure to add your predictions in

results/
├── Dataset1/
│   ├── method1/
│   │    ├──predictions_*.csv
│   │    └──fold_times.txt
│   ├── method2/
│   ...
├── Dataset2/
└── Dataset3/

Name		Name	Last commit message	Last commit date
Latest commit History 210 Commits
img		img
src		src
.DS_Store		.DS_Store
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

spCellEval - Benchmarking Cell Phenotyping Methods in Spatial Proteomics

Current Results Overview:

Getting Started

Preprocessing

Running methods

Evaluation Scripts

Adding your own method

About

Uh oh!

Releases

Packages

Contributors 3

Uh oh!

Languages

License

SchapiroLabor/spCellEval

Folders and files

Latest commit

History

Repository files navigation

spCellEval - Benchmarking Cell Phenotyping Methods in Spatial Proteomics

Current Results Overview:

Getting Started

Preprocessing

Running methods

Evaluation Scripts

Adding your own method

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Uh oh!

Languages

Packages