Skip to content

SchapiroLabor/spCellEval

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

spCellEval - Benchmarking Cell Phenotyping Methods in Spatial Proteomics

We present "spCellEval", a quantitative comparison of automated/semi-automated cell phenotyping methods for Spatial Proteomics datasets on a diverse set of 10 curated public datasets. The methods are compared with a list of label transfer metrics divided into 4 categories: classification performance, distribution recovery, stability and scalability. This benchmark acts as a foundation to evaluate and improve automated cell phenotyping.

Alt text

Current Results Overview:

Alt text

Getting Started

In order to reproduce the results, the raw datasets currently need to be downloaded from public repositories. Please refer to the public registered Stage 1 manuscript. IMMUCan is one example dataset.

Raw Dataset 
  │
  ▼
Preprocessing
  │
  ├───────────────────────┬───────────────────────┐
  ▼                       ▼                       ▼
Method 1                Method 2              Method n
  │                       │                       │
  ▼                       ▼                       ▼
pred_fold_{1-5}.csv   pred_fold_{1-5}.csv   pred_fold_{1-5}.csv
  │                       │                       │
  └───────────────────────┼───────────────────────┘
                          ▼
                  Evaluation Scripts
                          │
                          ▼
                        Results

Preprocessing

Preprocessing of each dataset can be found in src/preprocessing/datasets/<process_dataset.ipynb> Paths need to be adjusted.

For some datasets, multistack tiffs or channel_names have to be created. Please refer to src/preprocessing/

Running methods

Scripts to run each method are provided in src/<method>. For supervised method, create kfolds first using the run_kfold_creator.py file.

Datasets and parameter settings can be found in manuscript supplement.

For installation, and method specific details like runtime, please refer to each method's documentation.

The expected output from each method is a predictions_*.csv file for each fold chosen and a fold_time.txt recording running times if chosen.

Evaluation Scripts

The notebooks in src/metrics_scripts perform the evaluation over all methods. The code blocks withiin eval_mapping.ipynb goes through all the methods within a specified dataset and outputs a final_results.csv file that contains all of the metrics for different levels.

Adding your own method

To officially add your own method, please open an issue and provide us with the following to reproduce your method.

  1. GitHub repo for the method
  2. List of Parameters used (if any)
  3. OPTIONAL: Your predictions (this speeds up the evaluation process)

Folder Structure to add your predictions in

results/
├── Dataset1/
│   ├── method1/
│   │    ├──predictions_*.csv
│   │    └──fold_times.txt
│   ├── method2/
│   ...
├── Dataset2/
└── Dataset3/

About

Repository for spCellEval (Benchmarking Cell Phenotyping Methods in Spatial Proteomics)

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •