VPQ Dataset: write codes one word at a time during building #1487

achirkin · 2025-11-03T15:24:23Z

Improve the efficiency of process_and_fill_codes_kernel by writing the codes in larger chunks.

copy-pr-bot · 2025-11-04T14:55:27Z

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

achirkin · 2025-11-04T14:57:57Z

Moving this to draft: although the write efficiency improves by 2-5x according to the nsight profiler (less store instructions), the overall kernel runtime barely changes at all, because the bottleneck is data reading and ALU (encoding). So the value of the PR is in question.

lowener · 2025-11-05T14:44:40Z

cpp/src/neighbors/detail/vpq_dataset.cuh

+      if (filled_bits >= BitsPerLabel) {
+        filled_bits -= BitsPerLabel;
+        // write the codes to global memory
+        *out_codes_ptr++ = staging_codes;


Moving the lane condition if (lane_id == 0) to only this line can improve warp parallelism

achirkin added 2 commits November 3, 2025 15:49

VPQ Dataset: write codes a word at a time

f29f713

It's little endian though

07eaf7a

achirkin requested a review from a team as a code owner November 3, 2025 15:24

achirkin added the improvement Improves an existing functionality label Nov 3, 2025

achirkin added this to Vector Search, ML, & Data Mining Release Board Nov 3, 2025

achirkin added the non-breaking Introduces a non-breaking change label Nov 3, 2025

github-project-automation bot moved this to Todo in Vector Search, ML, & Data Mining Release Board Nov 3, 2025

achirkin moved this from Todo to In Progress in Vector Search, ML, & Data Mining Release Board Nov 3, 2025

Merge branch 'main' into enh-vpq-dataset-faster-codewriting

d888f57

cjnolet approved these changes Nov 4, 2025

View reviewed changes

achirkin marked this pull request as draft November 4, 2025 14:55

lowener reviewed Nov 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

VPQ Dataset: write codes one word at a time during building #1487

VPQ Dataset: write codes one word at a time during building #1487

Uh oh!

achirkin commented Nov 3, 2025

Uh oh!

copy-pr-bot bot commented Nov 4, 2025

Uh oh!

achirkin commented Nov 4, 2025 •

edited

Loading

Uh oh!

lowener Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

VPQ Dataset: write codes one word at a time during building #1487

Are you sure you want to change the base?

VPQ Dataset: write codes one word at a time during building #1487

Uh oh!

Conversation

achirkin commented Nov 3, 2025

Uh oh!

copy-pr-bot bot commented Nov 4, 2025

Uh oh!

achirkin commented Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lowener Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

achirkin commented Nov 4, 2025 •

edited

Loading