Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[INFRASTRUCTURE] : Create infrastructure to support the ontology browsing tool #19

Open
gtsueng opened this issue Oct 31, 2024 · 1 comment
Assignees

Comments

@gtsueng
Copy link

gtsueng commented Oct 31, 2024

Name

Create infrastructure to support the ontology browsing tool

Description

The ontology/hierarchical browsing tool requires efficient retrieval of lineage information for the display of the lineage tree, but also to enable the aggregation of counts of records that match a term and the counts of records from each child term. The infrastructure should be flexible enough to work with different ontologies (as needed).

User studies have been conducted which indicate that users do like the suggestion to display counts of relevant records for each search term, with a preference for seeing both the count of the records for that term, and the count of the records for that term plus all child terms.

Other ontologies currently in use in the Discovery Portal

  • species, infectiousAgent: NCBI Taxonomy
  • topicCategory: EDAM Topics Ontology (latest, unstable version)
  • healthCondition: MONDO, HPO, DO, NCIT

In the pipeline:

  • measurementTechnique: Subset of MMO, CHMO, BAO, OBI, EFO, EDAM Topics, NCIT
  • variableMeasured: TBD

WBS Task

https://github.com/NIAID-Data-Ecosystem/nde-roadmap/issues/20
https://github.com/NIAID-Data-Ecosystem/nde-roadmap/issues/4

Related Issues

@gtsueng
Copy link
Author

gtsueng commented Jan 9, 2025

It was noticed in November that there are discrepancies between returns using the OLS API and the NCBI Taxonomy browser.

In NCBI, 3366610 is a child of 2157:
https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?mode=Tree&id=3366610&lvl=3&lin=f&keep=1&srchmode=1&unlock
In OLS: 3366610 is not in the children list of 2157:
https://www.ebi.ac.uk/ols4/api/ontologies/ncbitaxon/terms/http%253A%252F%252Fpurl.obolibrary.org%252Fobo%252FNCBITaxon_2157/children?&lang=en

Upon investigation, it appears that the discrepancy stems from a nomenclature status dispute between NCBI Taxon and GTDB https://en.wikipedia.org/wiki/Euryarchaeota#Nomenclatural_status

To resolve this discrepancy, we will update and use the Biothings Taxonomy API to perform organism queries.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants