Skip to content

-i option seems to have no effect on filtering [intronIC v2.4.2] #15

@arabec

Description

@arabec

Hi,

First of all, thank you for this very useful tool.

I seem to have encountered an issue with the -i option. I ran intronIC on human annotations (EnSEMBL release 110 GTF, GRCh38), both with and without this option enabled, but obtained identical results in both cases.

Below are the commands I used, along with the filtering section of the logs. From what I can see, introns located in alternative isoforms still appear to be filtered out even when using the -i option.

intronIC classify -g /mnt/data/arabec/refs/GRCh38/Homo_sapiens.GRCh38.dna.toplevel.fa -a /mnt/data/arabec/refs/GRCh38/Homo_sapiens.GRCh38.110.gtf -n Homo_sapiens -p 4

Total genes: 62,754, introns generated: 1,396,783
Intron Filtering Summary:
┌────────────────────────────┬────────────┬────────────┐
│ Category │ Included │ Excluded │
├────────────────────────────┼────────────┼────────────┤
│ Duplicates │ 0 │ 994,237 │
│ Too short │ 0 │ 99 │
│ Ambiguous bases │ 0 │ 0 │
│ Non-canonical │ 4,522 │ 0 │
│ Overlapping │ 0 │ 0 │
│ Alternative isoform │ 0 │ 171,225 │
├────────────────────────────┼────────────┼────────────┤
│ Total excluded │ │ 1,165,561 │
│ Retained for scoring │ │ 230,655 │
└────────────────────────────┴────────────┴────────────┘

intronIC classify -g /mnt/data/arabec/refs/GRCh38/Homo_sapiens.GRCh38.dna.toplevel.fa -a /mnt/data/arabec/refs/GRCh38/Homo_sapiens.GRCh38.110.gtf -n Homo_sapiens -i -p 4

Total genes: 62,754, introns generated: 1,396,783
Intron Filtering Summary:
┌────────────────────────────┬────────────┬────────────┐
│ Category │ Included │ Excluded │
├────────────────────────────┼────────────┼────────────┤
│ Duplicates │ 0 │ 994,237 │
│ Too short │ 0 │ 99 │
│ Ambiguous bases │ 0 │ 0 │
│ Non-canonical │ 4,522 │ 0 │
│ Overlapping │ 0 │ 0 │
│ Alternative isoform │ 0 │ 171,225 │
├────────────────────────────┼────────────┼────────────┤
│ Total excluded │ │ 1,165,561 │
│ Retained for scoring │ │ 230,655 │
└────────────────────────────┴────────────┴────────────┘

Could you please have a look at this behavior?

Thank you very much for your help.

Best regards,
Alexia

Conda environment details:
intronIC version: 2.4.2
Python version: 3.11.15

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions