You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: src/quick-start.md
+10-10Lines changed: 10 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -19,7 +19,7 @@ For using Fiber-seq data it is important to **check with your sequencing provide
19
19
20
20
</div>
21
21
22
-
###Predict m6A and infer nucleosomes
22
+
## Predict m6A and infer nucleosomes
23
23
24
24
#### Your PacBio data uses the SPQR chemistry or latter
25
25
@@ -43,15 +43,15 @@ This will both make m6A calls and identify [nucleosomes](glossary.md#inferred-nu
43
43
44
44
**Note**, the input **CCS bam must have average kinetics** to be able to call m6A.
45
45
46
-
###Alignment and phasing
46
+
## Alignment and phasing
47
47
48
48
We recommend aligning with [pbmm2](https://github.com/PacificBiosciences/pbmm2) and phasing with [HiPhase](https://github.com/PacificBiosciences/hiphase). Please see their respective documentation for more information.
49
49
50
50
Alternatively, we have written a [snakemake pipeline](https://github.com/mrvollger/k-mer-variant-phasing) to align and phase Fiber-seq data; however, this pipeline is not officially supported outside of our lab at this time.
51
51
52
52
After this point, you will have a Fiber-seq BAM file that is compatible with all the [extraction](fibertools/extracting/extracting.md) commands in `fibertools`.
53
53
54
-
###Validate your Fiber-seq BAM file
54
+
## Validate your Fiber-seq BAM file
55
55
56
56
We have a quick validation tool which can test your BAM file for the desired Fiber-seq features.
57
57
At this point you should have m6A calls, nucleosome calls, and aligned reads (phasing is optional).
@@ -67,7 +67,7 @@ Fraction phased: 85.10%
67
67
Fraction with kinetics: 0.00%
68
68
```
69
69
70
-
###Fiber-seq peaks and UCSC browser tracks (FIRE)
70
+
## Fiber-seq peaks and UCSC browser tracks (FIRE)
71
71
72
72
Once you have a phased bam file, you can identify [Fiber-seq inferred regulatory elements (FIREs)](glossary.md#fires) to call Fiber-seq peaks and make a UCSC trackHub.
73
73
@@ -76,33 +76,33 @@ You can find more details in on installing and running FIRE here:
76
76
77
77
# Fiber-seq starting with Oxford Nanopore Technologies (ONT)
78
78
79
-
###Predict m6A
79
+
## Predict m6A
80
80
81
81
**ft predict-m6a** does not include a model for ONT data; however, you can use software, such as [Dorado](https://github.com/nanoporetech/dorado), to add CpG and m6A to your ONT BAM file.
82
82
83
-
###Alignment and phasing
83
+
## Alignment and phasing
84
84
85
85
You can either use [Dorado](https://github.com/nanoporetech/dorado) to align your ONT data or use a tool like [minimap2](https://github.com/lh3/minimap2) to align your data. If you do use `minimap2` be sure to include the flag `-y` to preserve the CpG and m6A information in the output BAM file.
86
86
87
87
If you do want to do phasing we recommend using [WhatsHap](https://whatshap.readthedocs.io/en/latest/) for phasing ONT data. Please see their documentation for more information.
88
88
89
-
###Filtering m6A calls
89
+
## Filtering m6A calls
90
90
91
91
If you do use Dorado you must then filter the m6A calls with [modkit](https://github.com/nanoporetech/modkit) using a tenth percentile cutoff for each flow-cell independently. This is the only way to get good m6A calls in our experience, and using any hard ML threshold will not hold between flow-cells. Here is an example command:
Once you have CpG and m6A information in your filtered ONT BAM file, you can use [`ft add-nucleosomes`](fibertools/help.md#ft-add-nucleosomes) to infer nucleosomes and MSPs.
100
100
101
101
```bash
102
102
ft add-nucleosomes filtered.dorado.bam output.bam
103
103
```
104
104
105
-
###A full example for processing ONT data
105
+
## A full example for processing ONT data
106
106
107
107
Here is an example summary of the commands to process ONT data assuming you have already completed 6mA and CpG calling with `dorado`:
After this point, you will have a Fiber-seq BAM file that is compatible with all the [extraction](fibertools/extracting/extracting.md) commands in `fibertools`.
126
126
127
-
###Fiber-seq peaks and UCSC browser tracks (FIRE)
127
+
## Fiber-seq peaks and UCSC browser tracks (FIRE)
128
128
129
129
We have had good success in applying the [FIRE pipeline](https://github.com/fiberseq/FIRE) to ONT data. However, this does require a heuristic in FIRE that must be enabled. To enable this add `ont: true` to your `config.yaml` file when setting up your FIRE run.
130
130
You can find more details in on installing and running FIRE here:
0 commit comments