Skip to content

Commit b59d163

Browse files
author
Mitchell R. Vollger
committed
.
1 parent 1005020 commit b59d163

File tree

1 file changed

+10
-10
lines changed

1 file changed

+10
-10
lines changed

src/quick-start.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ For using Fiber-seq data it is important to **check with your sequencing provide
1919

2020
</div>
2121

22-
### Predict m6A and infer nucleosomes
22+
## Predict m6A and infer nucleosomes
2323

2424
#### Your PacBio data uses the SPQR chemistry or latter
2525

@@ -43,15 +43,15 @@ This will both make m6A calls and identify [nucleosomes](glossary.md#inferred-nu
4343

4444
**Note**, the input **CCS bam must have average kinetics** to be able to call m6A.
4545

46-
### Alignment and phasing
46+
## Alignment and phasing
4747

4848
We recommend aligning with [pbmm2](https://github.com/PacificBiosciences/pbmm2) and phasing with [HiPhase](https://github.com/PacificBiosciences/hiphase). Please see their respective documentation for more information.
4949

5050
Alternatively, we have written a [snakemake pipeline](https://github.com/mrvollger/k-mer-variant-phasing) to align and phase Fiber-seq data; however, this pipeline is not officially supported outside of our lab at this time.
5151

5252
After this point, you will have a Fiber-seq BAM file that is compatible with all the [extraction](fibertools/extracting/extracting.md) commands in `fibertools`.
5353

54-
### Validate your Fiber-seq BAM file
54+
## Validate your Fiber-seq BAM file
5555

5656
We have a quick validation tool which can test your BAM file for the desired Fiber-seq features.
5757
At this point you should have m6A calls, nucleosome calls, and aligned reads (phasing is optional).
@@ -67,7 +67,7 @@ Fraction phased: 85.10%
6767
Fraction with kinetics: 0.00%
6868
```
6969

70-
### Fiber-seq peaks and UCSC browser tracks (FIRE)
70+
## Fiber-seq peaks and UCSC browser tracks (FIRE)
7171

7272
Once you have a phased bam file, you can identify [Fiber-seq inferred regulatory elements (FIREs)](glossary.md#fires) to call Fiber-seq peaks and make a UCSC trackHub.
7373

@@ -76,33 +76,33 @@ You can find more details in on installing and running FIRE here:
7676

7777
# Fiber-seq starting with Oxford Nanopore Technologies (ONT)
7878

79-
### Predict m6A
79+
## Predict m6A
8080

8181
**ft predict-m6a** does not include a model for ONT data; however, you can use software, such as [Dorado](https://github.com/nanoporetech/dorado), to add CpG and m6A to your ONT BAM file.
8282

83-
### Alignment and phasing
83+
## Alignment and phasing
8484

8585
You can either use [Dorado](https://github.com/nanoporetech/dorado) to align your ONT data or use a tool like [minimap2](https://github.com/lh3/minimap2) to align your data. If you do use `minimap2` be sure to include the flag `-y` to preserve the CpG and m6A information in the output BAM file.
8686

8787
If you do want to do phasing we recommend using [WhatsHap](https://whatshap.readthedocs.io/en/latest/) for phasing ONT data. Please see their documentation for more information.
8888

89-
### Filtering m6A calls
89+
## Filtering m6A calls
9090

9191
If you do use Dorado you must then filter the m6A calls with [modkit](https://github.com/nanoporetech/modkit) using a tenth percentile cutoff for each flow-cell independently. This is the only way to get good m6A calls in our experience, and using any hard ML threshold will not hold between flow-cells. Here is an example command:
9292

9393
```bash
9494
modkit call-mods -t 8 -p 0.1 input.dorado.bam filtered.dorado.bam
9595
```
9696

97-
### Infer nucleosomes and MSPs
97+
## Infer nucleosomes and MSPs
9898

9999
Once you have CpG and m6A information in your filtered ONT BAM file, you can use [`ft add-nucleosomes`](fibertools/help.md#ft-add-nucleosomes) to infer nucleosomes and MSPs.
100100

101101
```bash
102102
ft add-nucleosomes filtered.dorado.bam output.bam
103103
```
104104

105-
### A full example for processing ONT data
105+
## A full example for processing ONT data
106106

107107
Here is an example summary of the commands to process ONT data assuming you have already completed 6mA and CpG calling with `dorado`:
108108

@@ -124,7 +124,7 @@ modkit call-mods -p 0.1 tmp.ONT.fiberseq.bam - \
124124

125125
After this point, you will have a Fiber-seq BAM file that is compatible with all the [extraction](fibertools/extracting/extracting.md) commands in `fibertools`.
126126

127-
### Fiber-seq peaks and UCSC browser tracks (FIRE)
127+
## Fiber-seq peaks and UCSC browser tracks (FIRE)
128128

129129
We have had good success in applying the [FIRE pipeline](https://github.com/fiberseq/FIRE) to ONT data. However, this does require a heuristic in FIRE that must be enabled. To enable this add `ont: true` to your `config.yaml` file when setting up your FIRE run.
130130
You can find more details in on installing and running FIRE here:

0 commit comments

Comments
 (0)