-
Notifications
You must be signed in to change notification settings - Fork 29
Description
Dear vireo people,
thanks so much for creating such a useful tool!
I am working on a large single-cell RNAseq (10x 5 prime) dataset of human donors for whom I do not have a reference genotype. The samples are batched two-by-two, mixing a biological male and a biological female, so that by combining the expression of some sexually dimorphic genes (such as RPS4Y1 and XIST) with the donor1 vs donor0 vireo attribution, we can figure out who is who.
This approach works fairly well, but in about 10% of our 100+ batches, we are presented with cells that appear to be only male or only female, yet attributed to two different individuals by vireo. I am trying to discriminate between two possibilities:
- that for some reason the emulsion contains only cells from a single donor, but vireo tries to attribute them to two individuals because it expects there to be two individuals
- that there has been some clerical error in annotating the biological sex of the donors, and we do in fact have two individuals of the same sex in the emulsion.
This is the command that I have run:
cellsnp-lite -s ${input_dir}/$batch/outs/per_sample_outs/$batch/count/sample_alignments.bam
-b ${input_dir}/$batch/outs/per_sample_outs/$batch/count/sample_filtered_feature_bc_matrix/barcodes.tsv.gz
-O ${cellsnp_lite_output_dir}/$batch/
-R ${ref_dir}/genome1K.phase3.SNP_AF5e2.chr1toX.hg38.vcf
-p 5
--minMAF 0.05
--minCOUNT 20
--gzip --genotype
--nproc 5
vireo -c ${cellsnp_lite_output_dir}/$batch/
-N 2
-o ${vireo_output_dir}/$batch/
Any idea on how I could figure out where is the truth?
Many thanks!
Liliana