-
Notifications
You must be signed in to change notification settings - Fork 74
Description
Hello,
I've run Centrifuge on my data, and it generated two output files: classification.txt and report.tsv. I then created a kreport.tsv file from the classification.txt file. However, I've noticed a couple of issues:
Data Discrepancy: The reads in the kreport.tsv file are reduced compared to my raw reads, which raises concerns about the accuracy and completeness of the taxonomic assignments.
Missing Sequence IDs: The taxon information in the output files does not include sequence IDs, which prevents me from generating a repseq.fa file necessary for further functional annotation.
Could you please provide guidance on how to include sequence IDs in the output and how to proceed with generating the necessary files for functional annotation?
Thank you for your assistance.