-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create a visualization of mutations relative to the protein sequence, its domains, mutation hotspots, etc. #66
Comments
At some point I can definitely create a d3 visualization that can generate these in the client, I just need to know what data I will be getting to generate the figure and how it corresponds to the MD Anderson example. |
Yeah, I think we should keep this on the wishlist for now. There is actually a somewhat complex mix of data being represented here. From left to right:
The raw material for most of the data needed could be obtained from the Ensembl API (or similar). The variant landscape could come from Cosmic, the ICGC data portal, TCGA, etc. Zach has been working on a similar visualization concept in R using R modules (Biomart?) to get the raw data and ggplot2 to create the image. If we decide to tackle this at some point we should probably get together a group and brainstorm on some of the specifics. |
The GenVisR (Zach's R modules) protein visualization continues to evolve. https://github.com/griffithlab/GenVisR Alternatively, in many cases we have all the fields to generate a lolliplot via cBioPortal. http://www.cbioportal.org/mutation_mapper.jsp The source code is also available on GitHub under a GNU Affero General Public License V3 https://github.com/cBioPortal/cbioportal |
Example of compressed introns vs regular in GenVisR https://bioconductor.org/packages/release/bioc/vignettes/GenVisR/inst/doc/Intro.html#133_gencov_(sequence_coverage_graphic) |
Consider embedding protein paint. https://docs.google.com/document/d/1KNx4pVCKd4wgoHI4pjknBRTLrzYp6AL_D-j6MjcQSvQ/edit |
From discussions with members of the Melbourne Genomics Health Alliance, this is a priority. Partially because we don't have a nice way to dig through variants that we do have. Aliases only work on the quick search, browse and advanced search pages. On a gene page, unless you are searching for an exact variant, you can't get there. |
Everyone loves the COSMIC way of filtering...there is a histogram of events. That sort of thing would work for us as well. I love the lolliplot idea but I don't know about implementation. Shahil's visual (lolliplot that replaced variant occurrences with evidence count) was awesome for this sort of thing. But lolliplots don't really work that well for gene-level events. |
We have talked about the idea of collaborating with the ProteinPaint team at St. Judes to resolve this issue. Its not done yet, but it could be a way to resolve this... |
Since we have mutations that are point sites or are more generally in certain domains or other large features of the protein. For example, MD Anderson creates these for each gene and they are customized when you click on each mutation:

The text was updated successfully, but these errors were encountered: