Update warning about ignoring cached namespaces #1258

stephprince · 2025-04-07T23:45:03Z

Motivation

Combine the warnings about ignoring cached namespaces into a single warning, and only warn for core namespaces if the cached version is newer than the loaded version.

How to test the behavior?

with an example file from the pynwb test suite:

from pynwb import NWBHDF5IO

io = NWBHDF5IO("tests/back_compat/2.1.0_nwbfile_with_extension.nwb", "r")
io.read()

UserWarning: Ignoring the following cached namespace(s) because another version is already loaded:
hdmf-experimental - cached version: 0.2.0, loaded version: 0.5.0
Ignore this warning if these versions are compatible.

Checklist

Did you update CHANGELOG.md with your changes?
Does the PR clearly describe the problem and the solution?
Have you reviewed our Contributing Guide?
Does the PR use "Fix #XXX" notation to tell GitHub to close the relevant issue numbered XXX when the PR is merged?

codecov · 2025-04-07T23:46:08Z

Codecov Report

Attention: Patch coverage is 97.80220% with 2 lines in your changes missing coverage. Please review.

Project coverage is 91.68%. Comparing base (6f138ec) to head (0e84263).
Report is 1 commits behind head on dev.

Files with missing lines	Patch %	Lines
src/hdmf/spec/namespace.py	97.40%	1 Missing and 1 partial ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##              dev    #1258      +/-   ##
==========================================
+ Coverage   91.66%   91.68%   +0.01%     
==========================================
  Files          42       42              
  Lines        9552     9597      +45     
  Branches     1921     1933      +12     
==========================================
+ Hits         8756     8799      +43     
- Misses        518      519       +1     
- Partials      278      279       +1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

src/hdmf/spec/namespace.py

for more information, see https://pre-commit.ci

src/hdmf/utils.py

rly · 2025-04-08T00:31:32Z

I would really like to not show the hdmf-experimental warning but I know we do not ensure backwards compatibility between schema versions there. What if we also hide the warning if none of the data types in the namespace are used in the file? Most files do not involve data types used in hdmf-experimental.

On rewrite/append, the loaded namespaces are written to the file, so we should not have a problem with writing new data that is incompatible with old namespaces.

stephprince · 2025-04-08T16:59:24Z

I would really like to not show the hdmf-experimental warning but I know we do not ensure backwards compatibility between schema versions there. What if we also hide the warning if none of the data types in the namespace are used in the file? Most files do not involve data types used in hdmf-experimental.

I think that's a good idea! I agree it would be nice to not display that warning unless necessary.

Are there any places you can point me to in the code that get all the data types used in the file? I am thinking this would be an optional argument to NamespaceCatalog.load_namespaces since the NamespaceCatalog seems isolated from the file object?

rly · 2025-04-08T18:16:10Z

After talking with @oruebel , we realized that it would be hard or hacky to get all the data types used in the file. The hacky way is to iterate through all groups and datasets in the file and look for the neurodata_type or data_type attributes, but this involves iterating through the whole file just to check for a warning. This iteration is expensive, especially for streaming data.

Another approach we discussed is to have the IO object build all the builders before loading the namespace from the file. Then we can iterate through the in-memory builders. But this requires refactoring the logic in NWBHDF5IO and HDF5IO, which currently takes in a BuildManager that has the loaded namespace. This would be complicated.

The primary use case for the hdmf-experimental warning is to warn users of incompatible/breaking changes introduced in, say 0.4.0 when the data was written in an older version, say 0.3.0. This really only impacts ExternalResources. (EnumData is also in hdmf-experimental but has not changed). To my knowledge, no one is using older versions of ExternalResources (and perhaps also the latest version), it is relatively stable now, and no one has yet raised an issue about incompatibilities. Therefore the impact of just ignoring this warning is minimal.

So, I would say, let's include hdmf-experimental into the "core" set of namespaces to ignore from hdmf.

stephprince · 2025-04-08T22:48:21Z

So, I would say, let's include hdmf-experimental into the "core" set of namespaces to ignore from hdmf.

Thanks for explaining in detail. That sounds good to me - I'll add the hdmf-experimental namespace to the NamespaceCatalog.__core_namespace list.

Note that the pynwb __init__.py file will also need to be updated to add the "core" core namespace and fully eliminate the pynwb/hdmf namespace related warnings when the cached namespace version is not newer. (In the example output I demonstrated above with "tests/back_compat/2.1.0_nwbfile_with_extension.nwb", I think I was accidentally using a cached typemap in which I had added the "core" namespace in pynwb for testing purposes).

Edit: see NeurodataWithoutBorders/pynwb#2064 as the open issue to add "core" as a core namespace

stephprince · 2025-04-09T17:51:55Z

@rly this is ready for you to look at again

src/hdmf/spec/namespace.py

for more information, see https://pre-commit.ci

src/hdmf/utils.py

src/hdmf/spec/namespace.py

tests/unit/spec_tests/test_load_namespace.py

for more information, see https://pre-commit.ci

stephprince · 2025-04-17T18:57:14Z

@rly I think I addressed all of your comments

I believe the test failures are due to an ImportError issue with the schemasheets package. The warning in the logs is a little confusing because schemasheets is actually installed, but does not seem to be compatible with the latest versions of linkml_runtime>=1.9.0. I'll open up an issue on that repo (edit: see linkml/schemasheets#147).

rly · 2025-04-26T06:52:47Z

Thanks! I made a couple minor comments. Otherwise looks good.

stephprince · 2025-05-06T17:47:51Z

@rly I think I addressed all your remaining comments! This should be ready to merge

rly · 2025-05-06T18:03:09Z

Looks good. Thank you! Squash and merge anytime.

rly · 2025-05-06T18:04:03Z

I'll just do it since Matt and I are managing this package and the release.

h-mayorquin · 2025-05-07T14:19:51Z

This is great! Thanks for moving this forward!

stephprince added 5 commits April 7, 2025 15:36

add util function for version comparison

4c70b23

update warning and refactor namespace loading

cfed7d3

add tests for new namespace warnings

aba88a6

remove old warning filtering

91b0c42

remove order_deps functions from h5tools

8762e01

stephprince commented Apr 7, 2025

View reviewed changes

src/hdmf/spec/namespace.py Outdated Show resolved Hide resolved

[pre-commit.ci] auto fixes from pre-commit.com hooks

583848d

for more information, see https://pre-commit.ci

stephprince commented Apr 7, 2025

View reviewed changes

src/hdmf/utils.py Outdated Show resolved Hide resolved

stephprince added 4 commits April 7, 2025 16:51

update CHANGELOG

c6a2c3a

remove backslashes from f-strings for older python versions

d8d7e29

fix spelling in comment

b3a2f3a

Merge branch 'dev' into update-namespace-warning

2a6c21d

stephprince requested a review from rly April 8, 2025 00:05

add hdmf-experimental to core_namespace list

bd97a44

stephprince marked this pull request as ready for review April 9, 2025 17:51

stephprince mentioned this pull request Apr 9, 2025

[Feature]: Specify core namespace when creating the NamespaceCatalog NeurodataWithoutBorders/pynwb#2064

Open

3 tasks

Merge branch 'dev' into update-namespace-warning

ffe9384

rly reviewed Apr 13, 2025

View reviewed changes

src/hdmf/spec/namespace.py Outdated Show resolved Hide resolved

rly and others added 2 commits April 13, 2025 09:32

Update src/hdmf/spec/namespace.py

0ea3597

[pre-commit.ci] auto fixes from pre-commit.com hooks

03dbb66

for more information, see https://pre-commit.ci

rly reviewed Apr 13, 2025

View reviewed changes

src/hdmf/utils.py Outdated Show resolved Hide resolved

rly reviewed Apr 13, 2025

View reviewed changes

src/hdmf/spec/namespace.py Outdated Show resolved Hide resolved

rly reviewed Apr 13, 2025

View reviewed changes

tests/unit/spec_tests/test_load_namespace.py Outdated Show resolved Hide resolved

update version comparison function

373ce06

stephprince and others added 3 commits April 17, 2025 10:43

move core_namespace list definition to hdmf common init

c2a0a9e

add warning comparison to tests

d43e076

[pre-commit.ci] auto fixes from pre-commit.com hooks

2e87ec7

for more information, see https://pre-commit.ci

stephprince added 2 commits May 6, 2025 10:37

update warning message for extension namespace compatibility

62528a3

Merge branch 'dev' into update-namespace-warning

0e84263

rly approved these changes May 6, 2025

View reviewed changes

rly merged commit 4ea9ffa into dev May 6, 2025
26 of 27 checks passed

rly deleted the update-namespace-warning branch May 6, 2025 18:04

stephprince mentioned this pull request May 7, 2025

[Feature]: Load_namespaces consistency across ZarrIO and HDF5IO hdmf-dev/hdmf-zarr#275

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update warning about ignoring cached namespaces #1258

Update warning about ignoring cached namespaces #1258

stephprince commented Apr 7, 2025 •

edited

Loading

codecov bot commented Apr 7, 2025 •

edited

Loading

rly commented Apr 8, 2025

stephprince commented Apr 8, 2025

rly commented Apr 8, 2025

stephprince commented Apr 8, 2025 •

edited

Loading

stephprince commented Apr 9, 2025

stephprince commented Apr 17, 2025 •

edited

Loading

rly commented Apr 26, 2025

stephprince commented May 6, 2025

rly commented May 6, 2025 •

edited

Loading

rly commented May 6, 2025

h-mayorquin commented May 7, 2025

Update warning about ignoring cached namespaces #1258

Update warning about ignoring cached namespaces #1258

Conversation

stephprince commented Apr 7, 2025 • edited Loading

Motivation

How to test the behavior?

Checklist

codecov bot commented Apr 7, 2025 • edited Loading

Codecov Report

rly commented Apr 8, 2025

stephprince commented Apr 8, 2025

rly commented Apr 8, 2025

stephprince commented Apr 8, 2025 • edited Loading

stephprince commented Apr 9, 2025

stephprince commented Apr 17, 2025 • edited Loading

rly commented Apr 26, 2025

stephprince commented May 6, 2025

rly commented May 6, 2025 • edited Loading

rly commented May 6, 2025

h-mayorquin commented May 7, 2025

stephprince commented Apr 7, 2025 •

edited

Loading

codecov bot commented Apr 7, 2025 •

edited

Loading

stephprince commented Apr 8, 2025 •

edited

Loading

stephprince commented Apr 17, 2025 •

edited

Loading

rly commented May 6, 2025 •

edited

Loading