-
Notifications
You must be signed in to change notification settings - Fork 29
Description
Hi - I have two questions/issues that might be related, which is why I post them together here! And first of all, thanks for such a cool tool!
My first issue is that when plotting prob_max vs prob_doublet as shown below, there is a fraction of cells that have quite low doublet probabilities, while also having a near-zero prob_max (visualized in the orange circle). From my understanding of the calculation of prob_max and prob_doublet, this shouldn't be possible with only 8 donors as used here - since 8 times ~0 is of course not enough to account for the missing probability (i.e. (1-prob_doublet)). But perhaps the 8 single-probabilities and 64-doublet-pair probabilities does not need to add up to 1?
Secondly, and perhaps related, I looked into the prob_doublet.tsv.gz table as seen in the image below (I have obscurred column names for privacy-reasons). Here, it looks like there are many columns with only missing data (comb29 to comb56) and even columns missing, as seen from the total column count of 57, which should be 1+8^2 = 65. Do you know why this might be the case?

