Replies: 5 comments
-
|
Although I don't think this feature would be used by most current Gramps Web users, I personally find it super interesting, so if there are open source libraries we can leverage, why not? Some questions:
I think the VCF files would be stored as media objects in Gramps, so the first step in implementing such a feature would be to decide what (read-only) REST API endpoints we could set up that take the raw data (and whatever freely available external information we need) and compute/derive some JSON data from it that we could then use for visualization down the line. Could you sketch something for this (Python-only) part? |
Beta Was this translation helpful? Give feedback.
-
I would simply create a new VCF file with the header copied from a real one and paste some SNPs into it just for testing.
I personally like:
Other larger providers are:
I can sketch some concrete examples in the near future. I think it would be useful to use Ensembl VEP to annotate the VCF. From the command line, it would only take the following to annotate the VCF with two SNP databases (ChatGPT example): vep \
-i input.vcf.gz \
-o annotated.vcf.gz \
--vcf --compress_output bgzip \
--cache --offline --assembly GRCh38 \
--af_gnomad \
--custom file=clinvar.vcf.gz,short_name=ClinVar,format=vcf,type=exact,coords=0,fields=CLNSIG%CLNREVSTAT%CLNDNAfter that, you can find information such as allele frequencies and pathogenicity in the VCF file. An alternative to Ensembl VEP would be bcftools. One important thing to check prior to annotation is whether the VCF is based on the human reference genome GRCh38 or GRCh37. |
Beta Was this translation helpful? Give feedback.
-
|
As I am currently in the exploration phase of this idea, I will paste some repositories here that are worth looking into:
-
Another thing to consider is that these computations require large amounts of data such as reference genomes and possible compute. |
Beta Was this translation helpful? Give feedback.
-
|
Another repo to consider is ADMIXTOOLS. Here are some descriptions of the tools it has to offer (AI-generated). smartpca qp3Pop (outgroup f3) qpDstat qpAdm convertf Example: |
Beta Was this translation helpful? Give feedback.
-
Beta Was this translation helpful? Give feedback.



Uh oh!
There was an error while loading. Please reload this page.
-
I would like to propose a feature that allows users to upload their Whole Genome Sequencing (WGS) data in VCF format and perform basic, understandable analysis without requiring deep DNA or bioinformatics knowledge.
The following concept first describes a basic DNA annotation but can be expanded into a comprehensive ancestry report.
Feature Idea
The idea is to enable users to upload their VCF file and then:
Possible Data Sources
The following open-source databases could be used:
Optionally:
ClinVar – basic health-related variant annotations
Motivation
One reason this idea is important to me is that there is currently no open-source, self-hostable solution aimed at non-DNA experts that allows simple analysis of personal WGS data.
Most existing tools are:
This feature could bridge the gap between genealogy, ancestry, and basic genomic exploration.
Technical Feasibility
From a technical perspective:
Scope Consideration
I am aware that DNA analysis may be partially outside classic genealogy.
However, a limited, well-defined scope (basic matching and visualization) could add value without overcomplicating the project.
Thank you for considering this idea.
Beta Was this translation helpful? Give feedback.
All reactions