Skip to content

Some reference entries show up as duplicates #135

@DeadlineWasYesterday

Description

@DeadlineWasYesterday

Hi Ben,

First off, really appreciate your work and the available teaching material.

I have been getting this strange problem when trying to work with SAM files for downstream steps. Samtools view errors out saying there are duplicated entries in the SAM header.

Here are the diagnostics I ran:

  1. I double checked my reference to make sure all the headers were unique.
  2. I built the index and ran the alignment three times.
  3. I checked to see if I had reference entries with different headers, but the same sequence. I did find a few, but these were not the headers showing up as duplicates.
  4. In the SAM file, it appears that all my reads are also being mapped twice to these duplicated headers.

image

image

image

These are the commands I ran to build the index and do the alignment:

bowtie-build --threads 40 -o 2 m7b.fa m7b
 ~/bin/bowtie -v 0 -y --norc -a -p 250 -f u36.fa -x m7b -S u36ob.sam

Surely, this is not normal?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions