Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Parser Fix]: Clone the url field to the field includedInDataCatalog.dataset #234

Open
4 tasks
gtsueng opened this issue Jan 24, 2025 · 0 comments
Open
4 tasks
Assignees

Comments

@gtsueng
Copy link
Contributor

gtsueng commented Jan 24, 2025

Issue Name

Clone the url field to the field includedInDataCatalog.dataset

Issue Description

Upon merging identical records from two different repositories, the url field is merged. This results in the front-end being unable to send the user to the appropriate record in each repository. In other words, one of the two unique url fields is lost.

This will be increasingly problematic as more repositories look for their data. In a merged case, one of the two repos may be offput by the loss of the link to their site.

To address this, the parsers/crawlers should copy/save the url field value to the includedInDataCatalog object. This object has a field (dataset) in the schema specifically for this purpose. The dataset property is inherited from schema.org, thus it will be dataset regardless of the actual record type (i.e. ComputationalTool, ResourceCatalog will still use includedInDataCatalog.dataset). We'll just have to be more accurate about its display on the front end.

Note, if this can be done post-build, that's fine too.

Issue Example

https://data.niaid.nih.gov/resources?id=gse143393
Image

Related WBS task

https://github.com/NIAID-Data-Ecosystem/nde-roadmap/issues/14

For internal use only. Assignee, please select the status of this issue

  • Not yet started
  • In progress
  • Blocked
  • Will not address

Status Description

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants