Name	Name	Last commit message	Last commit date
Latest commit judyueshen upload notebooks for generation of figures (cell system submission) Oct 24, 2020 15f9266 · Oct 24, 2020 History 176 Commits
binder	binder	updated requirements	Aug 26, 2019
cellbox	cellbox	Minor formatting	Sep 26, 2020
configs	configs	Add example configs for LOO and S2C exprs	Sep 26, 2020
data	data	Create sparse data demo	Apr 23, 2020
manuscript	manuscript	upload notebooks for generation of figures (cell system submission)	Oct 24, 2020
scripts	scripts	Minor formatting	Sep 26, 2020
.gitignore	.gitignore	Add init support to grid search	Jun 8, 2020
.travis.yml	.travis.yml	Attempt to rename the 'pertbio' package to 'cellbox'	Sep 26, 2020
LICENSE	LICENSE	Initial commit	Aug 22, 2019
README.md	README.md	Update README.md	Sep 28, 2020
requirements.txt	requirements.txt	Update requirements.txt	Jul 10, 2020
test.py	test.py	Update test.py	Sep 26, 2020

Repository files navigation

CellBox

This is CellBox scripts developed in Sander lab.

Maintained by Bo Yuan, Judy Shen, and Augustin Luna.

If you want to discuss the usage or to report a bug, please use the 'Issues' function here on GitHub.

If you find CellBox useful for your research, please consider citing the corresponding publication. bioRxiv: link

For more information, please find our contact information here.

Quick Start

Easily try CellBox online with Binder

Go to: https://mybinder.org/v2/gh/dfci/CellBox/version_for_revision
From the New dropdown, click Terminal
Run the following command for a short example of model training process:

python scripts/main.py -config=configs/Example.random_partition.json

Alternatively, in project folder, do the same command

Installation

Install using pip

The following command will install cellbox from a particular branch using the '@' notation:

pip install git+https://github.com/dfci/CellBox.git@version_for_revision#egg=cellbox\&subdirectory=cellbox

Install using setup.py

Clone repository and in the cellbox folder run:

python3.6 setup.py install

Only python3.6 supported. Anaconda or pipenv is recommended to create python environment.

Now you can test if the installation is successful

import cellbox
cellbox.VERSION

Project Structure

Data files: in ./data/ folder

node_index.txt: names of each protein/phenotypic node.
expr_index.txt: information each perturbation condition (also see loo_label.csv).
expr.csv: Protein expression data from RPPA for the protein nodes and phenotypic node values. Each row is a condition while each column is a node.
pert.csv: Perturbation strength and target of all perturbation conditions. Used as input for differential equations.

cellbox package:

CellBox is defined in model.py
A dataset factory function for random parition and leave one out tasks
Some training util functions in tensorflow

One click model construction

Step 1: Create experiment json files (some examples can be found under ./configs/)

Make sure to specify the experiment_id and experiment_type
- experiment_id: name of the experiments, would be used to generate results folders
- experiment_type: currently available tasks are {"random partition", "leave one out (w/o single)", "leave one out (w/ single)", "full data", "single to combo"]}
Different training stages can be specified using stages and sub_stages in config file

Step 2: Use main.py to construct models using random partition of dataset

The experiment type configuration file is specified by --experiment_config_path or -config

python scripts/main.py -config=configs/Example.random_partition.cfg.json

Note: always run the script in the root folder.

A random seed can also be assigned by using argument --working_index or -i

python scripts/main.py -config=configs/Example.random_partition.cfg.json -i=1234

When training with leave-one-out validation, make sure to specify the drug index --drug_index or -drug to leave out from training.

Step 3: Analyze result files

You should see a experiment folder generated under results using the date and experiment_id.
Under experiment folder, you would see different models run with different random seeds
Under each model folder, you would have:
- record_eval.csv: log file with loss changes and time used.
- random_pos.csv: how the data was split (only for random partitions)
- best.W, best.alpha, best.eps: model parameters snapshot for each training stage
- best.test_hat: Prediction on test set, using the best model for each stage
- .ckpt files are the final models in tensorflow compatible format.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CellBox

Quick Start

Installation

Install using pip

Install using setup.py

Project Structure

Data files: in ./data/ folder

cellbox package:

One click model construction

Step 1: Create experiment json files (some examples can be found under ./configs/)

Step 2: Use main.py to construct models using random partition of dataset

Step 3: Analyze result files

About

Releases 5

Packages

Contributors 6

Languages

License

sanderlab/CellBox

Folders and files

Latest commit

History

Repository files navigation

CellBox

Quick Start

Installation

Install using pip

Install using setup.py

Project Structure

Data files: in ./data/ folder

cellbox package:

One click model construction

Step 1: Create experiment json files (some examples can be found under ./configs/)

Step 2: Use main.py to construct models using random partition of dataset

Step 3: Analyze result files

About

Resources

License

Stars

Watchers

Forks

Releases 5

Packages 0

Contributors 6

Languages

Packages