Skip to content

Does providing a number of expected donors force vireo to find that many donors? #118

@LilianaLucca

Description

@LilianaLucca

Dear vireo people,

thanks so much for creating such a useful tool!
I am working on a large single-cell RNAseq (10x 5 prime) dataset of human donors for whom I do not have a reference genotype. The samples are batched two-by-two, mixing a biological male and a biological female, so that by combining the expression of some sexually dimorphic genes (such as RPS4Y1 and XIST) with the donor1 vs donor0 vireo attribution, we can figure out who is who.

This approach works fairly well, but in about 10% of our 100+ batches, we are presented with cells that appear to be only male or only female, yet attributed to two different individuals by vireo. I am trying to discriminate between two possibilities:

  1. that for some reason the emulsion contains only cells from a single donor, but vireo tries to attribute them to two individuals because it expects there to be two individuals
  2. that there has been some clerical error in annotating the biological sex of the donors, and we do in fact have two individuals of the same sex in the emulsion.

This is the command that I have run:

cellsnp-lite -s ${input_dir}/$batch/outs/per_sample_outs/$batch/count/sample_alignments.bam
-b ${input_dir}/$batch/outs/per_sample_outs/$batch/count/sample_filtered_feature_bc_matrix/barcodes.tsv.gz
-O ${cellsnp_lite_output_dir}/$batch/
-R ${ref_dir}/genome1K.phase3.SNP_AF5e2.chr1toX.hg38.vcf
-p 5
--minMAF 0.05
--minCOUNT 20
--gzip --genotype
--nproc 5

vireo -c ${cellsnp_lite_output_dir}/$batch/
-N 2
-o ${vireo_output_dir}/$batch/

Any idea on how I could figure out where is the truth?

Many thanks!

Liliana

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions