Conversation
|
Testing on 2/26 locally ( |
|
@StevenCannon-USDA FWIW from a fresh clone on the branch I see all existing components plus the allele search one. @That-Thing Regarding the TODO in the element code (glyma.Wm82.gnm4.Gm01 etc placeholders) could this be made customizable, such as part of the "collections"? In case we want to use this for something other than soybean in future. |
There was a problem hiding this comment.
Testing now, after (re?)running npm run build, the content comes up fine for me.
This all looks good to me - though I think I would like two changes:
- Change the text "Ref / Alt only" under 1.5 to "All strains". The behavior has changed vs. the implementation at https://www.soybase.org/tools/ . I think the new behavior is good, but that first radio button does give all strains, so that should be so-labeled.
- I think we should limit the query region size, in both the "Identifier" and "Region" sections. I suggest 1000000. That probably then calls for a corresponding label: "Flanking Region (1 million max)"
… 1 million, ch ange wording of ref / alt.
|
@matthewwiese response: |
|
@That-Thing That's interesting, I think the Pysam's Whereas e.g. https://data.soybase.org/Glycine/max/diversity/Wm82.gnm2.div.Valliyodan_Brown_2021/USB481-25Kshared50Kpos.vcf.gz.tbi is ~170 KB. My guess is the data is corrupt or something? @adf-ncgr @StevenCannon-USDA any ideas? |
|
sounds plausible. When I run bcftools view I don't get an error but indexed retrieval seems to yield nothing but the headers. I'll see if I can fix the version in the datastore but @StevenCannon-USDA will then have to sync it to where the web-hosting takes place. |
|
OK, there were several files that seemed to have a similar problem (different genome versions of the same Song_Hyten_2015 genotype set). I've fixed them on ceres but I think the problem @That-Thing reported will remain until @StevenCannon-USDA can sync the new files to the web-hosting server. |
|
I have schlepped these over to c2s2. This hasn't seemed to fix the problem for me -- though the explanation makes sense. Adding @weihuang12 , who has been working on these files. Is the problem that some of the indexes are out of date with the bgzipped VCFs? |
|
I'm not really sure what happened. As @matthewwiese said the tbi files that were there before I re-ran the indexing seemed to be only ~70bytes in size, although it looked like they had been updated recently. Now if I run a bcftools view locally on ceres the indexing seems to work OK, but I agree that the web-hosted one is still mis-behaving; not sure if some sort of caching could be at play. Can you verify that the tbi files on c2s2 are appropriately sized? (they are hidden through the h5ai interface so I can't tell) |
|
actually nevermind about validating the file size, I just curled it and it seems OK. Puzzled as to what may be happening. |
|
Actually, I think it was a caching issue, at least in the way I was testing it. doing something like: |
Web component version of the SoyBase allele search tool requested by @StevenCannon-USDA.