Hi,
First of all, thank you for this very useful tool.
I seem to have encountered an issue with the -i option. I ran intronIC on human annotations (EnSEMBL release 110 GTF, GRCh38), both with and without this option enabled, but obtained identical results in both cases.
Below are the commands I used, along with the filtering section of the logs. From what I can see, introns located in alternative isoforms still appear to be filtered out even when using the -i option.
intronIC classify -g /mnt/data/arabec/refs/GRCh38/Homo_sapiens.GRCh38.dna.toplevel.fa -a /mnt/data/arabec/refs/GRCh38/Homo_sapiens.GRCh38.110.gtf -n Homo_sapiens -p 4
Total genes: 62,754, introns generated: 1,396,783
Intron Filtering Summary:
┌────────────────────────────┬────────────┬────────────┐
│ Category │ Included │ Excluded │
├────────────────────────────┼────────────┼────────────┤
│ Duplicates │ 0 │ 994,237 │
│ Too short │ 0 │ 99 │
│ Ambiguous bases │ 0 │ 0 │
│ Non-canonical │ 4,522 │ 0 │
│ Overlapping │ 0 │ 0 │
│ Alternative isoform │ 0 │ 171,225 │
├────────────────────────────┼────────────┼────────────┤
│ Total excluded │ │ 1,165,561 │
│ Retained for scoring │ │ 230,655 │
└────────────────────────────┴────────────┴────────────┘
intronIC classify -g /mnt/data/arabec/refs/GRCh38/Homo_sapiens.GRCh38.dna.toplevel.fa -a /mnt/data/arabec/refs/GRCh38/Homo_sapiens.GRCh38.110.gtf -n Homo_sapiens -i -p 4
Total genes: 62,754, introns generated: 1,396,783
Intron Filtering Summary:
┌────────────────────────────┬────────────┬────────────┐
│ Category │ Included │ Excluded │
├────────────────────────────┼────────────┼────────────┤
│ Duplicates │ 0 │ 994,237 │
│ Too short │ 0 │ 99 │
│ Ambiguous bases │ 0 │ 0 │
│ Non-canonical │ 4,522 │ 0 │
│ Overlapping │ 0 │ 0 │
│ Alternative isoform │ 0 │ 171,225 │
├────────────────────────────┼────────────┼────────────┤
│ Total excluded │ │ 1,165,561 │
│ Retained for scoring │ │ 230,655 │
└────────────────────────────┴────────────┴────────────┘
Could you please have a look at this behavior?
Thank you very much for your help.
Best regards,
Alexia
Conda environment details:
intronIC version: 2.4.2
Python version: 3.11.15
Hi,
First of all, thank you for this very useful tool.
I seem to have encountered an issue with the
-ioption. I ran intronIC on human annotations (EnSEMBL release 110 GTF, GRCh38), both with and without this option enabled, but obtained identical results in both cases.Below are the commands I used, along with the filtering section of the logs. From what I can see, introns located in alternative isoforms still appear to be filtered out even when using the
-ioption.intronIC classify -g /mnt/data/arabec/refs/GRCh38/Homo_sapiens.GRCh38.dna.toplevel.fa -a /mnt/data/arabec/refs/GRCh38/Homo_sapiens.GRCh38.110.gtf -n Homo_sapiens -p 4Total genes: 62,754, introns generated: 1,396,783
Intron Filtering Summary:
┌────────────────────────────┬────────────┬────────────┐
│ Category │ Included │ Excluded │
├────────────────────────────┼────────────┼────────────┤
│ Duplicates │ 0 │ 994,237 │
│ Too short │ 0 │ 99 │
│ Ambiguous bases │ 0 │ 0 │
│ Non-canonical │ 4,522 │ 0 │
│ Overlapping │ 0 │ 0 │
│ Alternative isoform │ 0 │ 171,225 │
├────────────────────────────┼────────────┼────────────┤
│ Total excluded │ │ 1,165,561 │
│ Retained for scoring │ │ 230,655 │
└────────────────────────────┴────────────┴────────────┘
intronIC classify -g /mnt/data/arabec/refs/GRCh38/Homo_sapiens.GRCh38.dna.toplevel.fa -a /mnt/data/arabec/refs/GRCh38/Homo_sapiens.GRCh38.110.gtf -n Homo_sapiens -i -p 4Total genes: 62,754, introns generated: 1,396,783
Intron Filtering Summary:
┌────────────────────────────┬────────────┬────────────┐
│ Category │ Included │ Excluded │
├────────────────────────────┼────────────┼────────────┤
│ Duplicates │ 0 │ 994,237 │
│ Too short │ 0 │ 99 │
│ Ambiguous bases │ 0 │ 0 │
│ Non-canonical │ 4,522 │ 0 │
│ Overlapping │ 0 │ 0 │
│ Alternative isoform │ 0 │ 171,225 │
├────────────────────────────┼────────────┼────────────┤
│ Total excluded │ │ 1,165,561 │
│ Retained for scoring │ │ 230,655 │
└────────────────────────────┴────────────┴────────────┘
Could you please have a look at this behavior?
Thank you very much for your help.
Best regards,
Alexia
Conda environment details:
intronIC version: 2.4.2
Python version: 3.11.15