Skip to content

Commit

Permalink
Add parallelization benchmark
Browse files Browse the repository at this point in the history
  • Loading branch information
MitraDarja authored Aug 9, 2023
1 parent 52eb5ec commit 439eeab
Showing 1 changed file with 10 additions and 1 deletion.
11 changes: 10 additions & 1 deletion utils/README.md
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
`# Evaluation of Needle count
# Evaluation of Needle count

## Differential Expression
Download the sequencing experiments listed in accession.lst and the human transcripts from gencode as a fasta file.
Expand Down Expand Up @@ -35,6 +35,15 @@ differential expressed genes from https://doi.org/10.1016/j.dib.2018.03.079.

# Evaluation of Needle

## Parallelization

In order to compare the two parallelization methods of Needle when computing minimizers, four sequencing experiments were tested.
For the simple version the following command was used with either `X=1` or `X=4` threads.
```
needle minimiser -k 21 -w 25 -t X SRR1313229.fastq.gz SRR1313228.fastq.gz SRR1313227.fastq.gz SRR1313226.fastq.gz
```
For the ram friendly version a `--ram` was added to the previous command.

## Differential Expression
Download the sequencing experiments of the GEO experiment with the accession number GSE58135 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE58135).
Then create a Needle index in the following way, assuming all sequencing files are stored in a folder named GSE58135:
Expand Down

0 comments on commit 439eeab

Please sign in to comment.