- Ben Brock
- Tim Davis
- Jim Kitchen
- Tim Mattson
- Scott McMillan
- Erik Welch
- Isaac Virshup
- Will Kimmerer
- Discuss auxiliary data, as well as next steps for formalizing the API.
- Tim presented the auxiliary data used in the SuiteSparse Matrix Collection.
- These consist of various bits of metadata associated with one of
- The matrix as a whole (i.e. arbitrary ASCII)
- Rows/columns/vertices
- Edges (individual nonzeros)
- In SuiteSparse, these are stored as separate documents within a tarball
- Based on standardized data in MatrixMarket file, look for files (text, MatrixMarket) storing additional data.
- These consist of various bits of metadata associated with one of
Do we want to support auxiliary data?
Jim: yes, we can store these by nature of using a binary data container. Auxiliary data can simply be additional datasets in a nested namespace.
Isaac: one exception is data associated with individual edges, which is more tightly associated with nonzeros, so needs to be associated with matrix structure.
Erik's 2.0 proposal supports edge properties, essentially by allowing additional values arrays.
User-defined types: Tim presented SuiteSparse's user-defined type mechanism, which essentially stores the C struct along with metadata.
Should we store the types of indices/values in metadata? Jim: if container does not support all types, it becomes necessary to store additional type information. Example: bool (HDF5/NetCDF have no bool).
Are we ready to begin drafting spec document with some of the 1.0 details we have mostly crystallized? Yes.
-
Ben will create independent GitHub repo with binary sparse prototype parser.
-
Ben will continue to investigate NetCDF
-
Isaac will set up spec document using Bikeshed