New pipeline: nf-core/nf-detect-seq

### Pipeline title/name

detect-seq

### Keywords

genomics, base editor, crispr, off-targets

### What is it about?

A pipeline to process Detect-seq sequencing data (generated via dU chemical labeling and biotin pulldown) to identify genome-wide off-target editing sites by programmable base editors.


### Please provide a schematic diagram of the proposed pipeline

![Image](https://github.com/user-attachments/assets/0ccc581f-a051-46dc-b16f-1caf6002df3b)

### What would a minimal first release of this pipeline include?

Adapter trimming (cutadapt), genomic alignment (HISAT-3N), recovery of low-quality and unmapped reads via samtools. Custom python can be used as-is from the original [publication](https://pubmed.ncbi.nlm.nih.gov/34099937/) and [pipieline](https://github.com/menghaowei/Detect-seq/tree/master/src/detect_seq) for the MVP.

Plots are out of scope for the minimal first release. 

### I confirm my proposed pipeline will follow nf-core guidelines. Most importantly, my pipeline will:

- [x] be built with Nextflow.
- [x] pass nf-core lint tests and use standardized parameters.
- [x] be community-owned and developed within the nf-core organization.
- [x] open source under the MIT license with proper credits and acknowledgments.
- [x] have a descriptive, all lowercase, and without punctuation name.
- [x] use the nf-core pipeline template and predominantly use official nf-core modules.
- [x] focus on a specific data/analysis type with appropriate scope.
- [x] have properly maintained documentation.
- [x] be bundled using versioned Docker/Singularity containers.

### Why do we need a new pipeline?

While a [Snakemake pipeline](https://github.com/menghaowei/Detect-seq?tab=readme-ov-file) exists to support this type of analysis, there is no Nextflow equivalent as far as I know. For a tool like Detect-seq that's relevant to therapeutic base editing, having a Nextflow version could significantly expand its user base in pharma and clinical research environments where Nextflow is often the institutional standard. 

Several additions to the original pipeline can be made with standard nf-core building blocks (e.g., MultiQC, regression testing with nf-test) to facilitate reproducibility, quality control assessment, and iteration.  

### Who would be interested?

CRISPR/base editing researchers: Anyone developing or benchmarking new programmable base editors (e.g., CBE variants like BE3, BE4, ABE variants) could use this pipeline to characterize their editor's off-target profile before publication or therapeutic application.

Detect-seq data and analysis could potentially be used as a comparator alongside other approaches like CIRCLE-seq, GUIDE-seq, or CRISPResso2.

### What has been done so far

I have created: 1) a [HISAT-3N ALIGN nf-core module](https://github.com/nf-core/modules/pull/9718), a [HISAT-3N BUILD nf-core module](https://github.com/nf-core/modules/pull/9668), and  3) a [samclip nf-core module](https://github.com/nf-core/modules/pull/8999)

### URL to existing work (if applicable)

_No response_

### Are there any similar existing nf-core pipelines?

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New pipeline: nf-core/nf-detect-seq #129

Pipeline title/name

Keywords

What is it about?

Please provide a schematic diagram of the proposed pipeline

What would a minimal first release of this pipeline include?

I confirm my proposed pipeline will follow nf-core guidelines. Most importantly, my pipeline will:

Why do we need a new pipeline?

Who would be interested?

What has been done so far

URL to existing work (if applicable)

Are there any similar existing nf-core pipelines?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

New pipeline: nf-core/nf-detect-seq #129

Description

Pipeline title/name

Keywords

What is it about?

Please provide a schematic diagram of the proposed pipeline

What would a minimal first release of this pipeline include?

I confirm my proposed pipeline will follow nf-core guidelines. Most importantly, my pipeline will:

Why do we need a new pipeline?

Who would be interested?

What has been done so far

URL to existing work (if applicable)

Are there any similar existing nf-core pipelines?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions