we provide an example script to run experiments with the features already extracted from our dataset:
- Run
run_deepDTnet.m: predict drug-target interactions, and evaluate the results with cross-validation. Note: See the "Tutorial" section below for a detailed instruction on how to run deepDTnet on your own dataset.
auc.m: evaluation scriptcompute_similarity.m: compute Jaccard similarity based on association network(i.e., drug-disease, drug-side-effect, and protein-disease networks)imf_train.mexa64: pre-built binary file of PU-Matrix Completion algorithmPUMC.m: predict drug-target interactions (DTIs) with PU-Matrix Completionrun_deepDTnet.m: example code of running deepDTnet for drug-target prediction
drug_dict.txt: list of drug unique identifier and drug namesprotein_dict.txt: list of protein unique identifier and protein namesdisease_dict.txt: list of disease unique identifier and disease namesse_dict.txt: list of side effect unique identifier and side effect namesdrugdrug.txt: Drug-Drug interaction matrixdrugDisease.txt: Drug-Disease association matrixdrugsideEffect.txt: Drug-SideEffect association matrixdrugsim1network.txt: Drug chemical similarity matrixdrugsim2network.txt: Drug therapeutic similarity matrixdrugsim3network.txt: Drug sequence similarity matrixdrugsim4network.txt: Drug biological processes similarity matrixdrugsim5network.txt: Drug cellular component similarity matrixdrugsim6network.txt: Drug molecular function similarity matrixproteinprotein.txt: Protein-Protein interaction matrixproteinDisease.txt: Protein-Disease association matrixproteinsim1network.txt: Protein sequence similarity matrixproteinsim2network.txt: Protein biological processes similarity matrixproteinsim3network.txt: Protein cellular component similarity matrixproteinsim4network.txt: Protein molecular function similarity matrix
We provided the pre-trained vector representations for the 15 networks, which were used to produce the results in our paper.
This directory contains code necessary to run the DNGR algorithm.
- Run
main.m: an example script to generate a low-dimensional vector representation of features for each node in the Drug chemical similarity network. Note: For different networks, you need to fine tune the parameters inBuildNets.m. The original version of this code base was originally from http://shelson.top/, and we owe many thanks to Shaosheng Cao for making his code available.
Supplementary_Table_3.xlsx: an excel table including 4,978 experimentally validated drug-target interactions connecting 732 approved drugs and 1,915 human targets.Supplementary_Table_4.xlsx: an excel table including all target lists for G-protein-coupled receptors (GPCRs), kinases, nuclear receptors (NRs), and ion channels (ICs).Supplementary_Table_5.xlsx: an excel table including novel drug-target interactions predicted by deepDTnet with score above the threshold.
We test the code on Ubuntu 16.04 with Matlab R2016a installed
- Put interaction/association/similarity matrices in the
data/folder. - Create a
simNet/folder underdeepDTnet/and runcompute_similarity.min thesrc/folder, which will compute the Jaccard similarity of drugs and proteins, based on the three association matrices mentioned above. - run
main.min theDNGR/corefolder for 2 interaction networks ,10 similarity networks in thedata/folder and 3 similarity networks in thesimNet/folder, which will learn the feature vectors of drugs and proteins for the 15 networks and save them in thefeature/folder. - Specify parameters and set the path of feature vectors in
run_deepDTnet.mand execute it. This script will predict the drug-target interactions and evaluate the results using a five-fold cross-validation.