Skip to content

Non-alphabet characters aren't counted as gaps when computing sequence coverage #268

@nikkithadani

Description

@nikkithadani

I'm not sure if this is something we want to fix or just something we want to note (can be a real issue for some viral sequences) but sequences with large numbers of Xs or other non-residue characters are not counted as gaps when applying the sequence coverage threshold.

keep_seqs = (1 - ali.count("-", axis="seq")) >= min_cov

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions