Scripts to execute various steps in the PDC Hi-C data analysis.
Scripts were run using PBS job scheduler or in command line.
Each script contains comments about its functionality.
0a.refgenie.sh- download genomic sequences and assets using refgenie0b.make_annotation_folder.sh- softlink refgeine assets into one folder0c.restriction_sites.sh- create restriction site BED files
1.fetchdata.sh- download data from SRA usingsratoolkit2a.cutadapt.sh- trim reads to certain length usingcutadapt2b.gzipFiles.sh- compress files usinggzip3.makePDXs.sh- combine human and mouse reads4a.splitPDXs-xenome.sh- split human and mouse reads usingxenome4b.splitPDXs-getHg38Reads.sh- extract human reads using the combined alignment strategy
-
5a.runHiC-Pro- scripts to run HiC-Pro pipelineconfig-system.txt- an example of configuration file to install HiC-Proconfig_hicpro_hg38.txt- an example of sample-specific configuration file for HiC-Pro1.run_HiCPro_hg38.sh- a script creating the following files using settings fromconfig_hicpro_hg38.txt2.HiCPro_step1_PDX.qsub- First step of HiC-Pro pipeline3.HiCPro_step2_PDX.qsub- Second step of HiC-Pro pipelineinputfiles_PDX.txt- List of samples to process. Only "R1" file is necessary
-
5b.runHiCExplorer- scripts to run HiCExplorer pipelinerun_bwamem_R1.sh- aligning R1 readsrun_bwamem_R2.sh- aligning R2 readshicBuildMatrix.sh- run HiCExplorer on the aligned R1 and R2 BAM files
-
5c.runJuicer- scripts to run Juicer pipelinesetup_folders.sh- setup folders and softlink raw data for Juicer processingrun_juicer.sh- run Juicer on the linked data
-
5d.collectResults- scripts to collect and merge QC metrics from different pipelines11.hicpro_collectStats.sh- HiC-Pro stats12.hicpro_mergeStats.sh21.hicexplorer_collectStats.sh- HiCExplorer stats22.hicexplorer_mergeStats.sh31.juicer_collectStats.sh- Juicer stats32.juicer_mergeStats.sh
-
6.convertFileFormats- scripts to convert file formats to.h5hicpro_to_h5.sh- convert HiC-Pro results to.h5formatjuicer_to_h5- scripts for step-wise conversion of Juicer results to.h5format1.juicer_to_txt.sh- convert.hicformat to text format2.juicer2hicpro.R- R script to convert text format to matrix. Called by3.convert_to_hicpro.qsub3.convert_to_hicpro.sh- convert text format to matrix4.hicpro_to_h5.sh- convert and KR normalize matrix to.h5format
-
7.hicPlotDistVsCounts.sh- estimate distance-dependent decay -
8.TADs.sh- TAD calling usinghicFindTADs