Skip to content

VCFfiles: seperated or merged #105

@hsymoon

Description

@hsymoon

Hi developers,
Thanks for developing this helpful tool. I encountered two questions when I used vireo.
As I have different time experiment data from 10x scrna-seq (batch1 for D1 , batch2 for D2 ,...).
Cellsnp-lite was used to call common SNP for each batch, followed by vireo to demultiplex .
Here we have a donor.vcf.gz .

Q1:I wonder which one could get more resonble result:
1)Seperated :
CELL_FILE : .cellSNP.cells.vcf.gz for each batch from cellsnp-lite (like batch1.cellSNP.cells.vcf.gz )
DONOR_FILE: bcftools view donor.vcf.gz -R batch1.cellSNP.cells.vcf.gz -Oz -o donors.sub_Batch1.vcf.gz
~/miniconda3/bin/vireo -c batch1.cellSNP.cells.vcf.gz -d donors.sub_Batch1.vcf.gz -o ${re} -N $n --randSeed 2

  1. Merged:
    CELL_FILE : "bcftools merge" was used to merge cellSNP.cells.vcf.gz for each batch from cellsnp-lite ,generated all.cellSNP.cells.vcf.gz.
    DONOR_FILE: bcftools view donor.vcf.gz -R all.cellSNP.cells.vcf.gz -Oz -o donors.sub_All.vcf.gz
    ~/miniconda3/bin/vireo -c all.cellSNP.cells.vcf.gz -d donors.sub_All.vcf.gz -o ${re} -N $n --randSeed 2

As I tried ,even though --randSeed was set to the same, cells in batch1 was demultiplexed to different donors in Seperated or Merged.
Could you tell me which one could get more resonble result and why .Many thanks.

Q2: Mode4 in vireo was applicable when with genotype but not confident (or only for subset of SNPs).
The command is : vireo -c $CELL_DATA -d $DONOR_GT_FILE -o $OUT_DIR --forceLearnGT.
Could you give some examples for this mode?Sorry for my questions.

   Thank you  very much.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions