Improve index memory management #73

albe · 2019-09-22T11:15:00Z

Currently an index preallocates an array of the length of the index and then fills it with data on demand. Since the internal array is untyped, memory is allocated just for the structure. Still, for large indices this means a whole lot of memory being kept allocated unused in a couple of use cases, most notably the write-only scenario.

Optimally an index would only keep a (configurable) fixed upper bound of data in memory, then fill that on demand.
This could be a good scenario for a ring buffer.

Also, optimally the internal array buffer would be a typed array of (u)int32 to have index data in a contiguous memory block. This could potentially optimize index entry/buffer translation since the entry would just be a typed view on the underlying buffer and no copying on read would be involved.

albe · 2019-10-05T16:09:24Z

Index range reading should return a generator and work with the same semantics as partition reading. Hence, the underlying file abstraction could be shared between index and partition.

albe · 2020-08-20T00:26:15Z

See https://gist.github.com/albe/39c7b79f46daa49d2cf373ffab3c4513 -ugh

albe · 2020-08-22T22:21:05Z

The feedback on nodejs/help repo suggests this is to be expected, as typed arrays do a bit more.
After some testing with creating a custom implementation of a "buffer view entry" the single access use case is faster than the current implementation by a factor of 2, but slower by a factor of two when a second access happens (and hence likely even more for further accesses - i.e. it behaves bad for "cache hits" in the index reader). So the buffer reads need to be memoized eagerly (lazy adds a condition in the access path).

albe added the enhancement label Sep 22, 2019

albe added the P: Index Affects the indexing layer label Oct 5, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve index memory management #73

Improve index memory management #73

albe commented Sep 22, 2019

albe commented Oct 5, 2019

albe commented Aug 20, 2020

albe commented Aug 22, 2020

Improve index memory management #73

Improve index memory management #73

Comments

albe commented Sep 22, 2019

albe commented Oct 5, 2019

albe commented Aug 20, 2020

albe commented Aug 22, 2020