Graph Data Generation and Load into Neo4J

Pre-requisites

Clone the repository

 git clone https://github.com/nmdp-bioinformatics/grimm`
 cd grimm

Setup Python3 virtual environment Make sure virtualenv is installed.
```
pip3 install virtualenv
```
Create Virtual Environment
```
virtualenv -p python3 venv
source venv/bin/activate
```
Install pandas library
```
pip3 install pandas
```
Download and prepare wmda data. Python script downloads reference wmda data and untars it in wmda directory
```
 cd graph_generator/data
 python wmda_download.py
```
Generate nodes/edges/toplinks from the reference wmda data. The freqs file is converted to HPF format first.
```
 cd ..
 python wmda_to_hpf_csv.py
 python generate_neo4j_wmda_hpf.py
```

Generated nodes and edges files are in the output/csv directory.

 output
 └── csv
     ├── edges.csv
     ├── nodes.csv
     └── top_links.csv

Load the nodes/graph into Neo4J database
```
 ./bulk_load_neo4j.sh
```
Login to Neo4j http://localhost:7474/
- Default username/password is neo4j/ontological