do not zero out sink tokens during recache #21

hturki · 2025-10-25T00:46:49Z

fixes #20

ki-lw · 2025-10-25T05:14:06Z

Thanks for your modification! However, I believe there’s a logical issue with the change.

The cache zeroing occurs under if not self.global_sink, meaning it only happens when global_sink is False. In that case, the sink region is not global — otherwise, we could simply set global_sink = True.

During the subsequent cache update process, whether the region cache["k"][:, self.generator.model.config.sink_size * self.frame_seq_length:] was zeroed beforehand doesn’t really matter. It will be recomputed based on current_start and write_len, thus properly updated.

In my view, the if self.global_sink branch mainly concerns the update mechanism for the sink region, and zeroing [sink_size * self.frame_seq_length:] does not affect the subsequent updates of that region itself.

hturki · 2025-10-26T02:47:28Z

Good point! Looking at this more closely, I actually think that the zero'ing out is unnecessary since as you mention the cache entries will be overwritten during the subsequent recompute. I think that the proper fix is to actually modify https://github.com/NVlabs/LongLive/pull/21/files#diff-4ed558c68c6ec9a180de74af9d37373fdf8fff84ac40cbf34a6669ad03599a18R260 to account for whether we are using a global sink or not.

hturki · 2025-10-26T02:50:18Z

This is what I get with the model shared on hugging face before this MR is applied

rank0-0-0_lora.mp4

And then with the fix, this is what we get with the global sink enabled:

rank0-0-0_lora.mp4

And this is with the fix + the global sink disabled:

rank0-0-0_lora.mp4

At least on this example, it seems that keeping the global sink is important for maintaining the appearance of the boy.

ki-lw · 2025-10-26T08:26:44Z

Yeah! That’s great!

In fact, based on the experimental setup described in the LongLive paper, the global_sink setting should default to True. I was just a bit surprised to see that in the provided longlive_interactive_inference.yaml, it’s set to False.

That said, your modification effectively fixes the behavior when global_sink is set to False — even though none of us would really recommend using that configuration (haha).

do not zero out sink tokens during recache

e03cceb

have recompute take global sink into account

e59bf14

hturki and others added 2 commits October 26, 2025 23:13

Update causal_model.py

85070c1

actually use block mask

4f69408

hturki force-pushed the ht/recache-fix branch from 2681bcb to f5597f1 Compare October 29, 2025 07:22

more tweaks

d69a1fa

hturki force-pushed the ht/recache-fix branch from f5597f1 to d69a1fa Compare October 29, 2025 18:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

do not zero out sink tokens during recache #21

do not zero out sink tokens during recache #21

Uh oh!

hturki commented Oct 25, 2025

Uh oh!

ki-lw commented Oct 25, 2025

Uh oh!

hturki commented Oct 26, 2025

Uh oh!

hturki commented Oct 26, 2025 •

edited

Loading

Uh oh!

ki-lw commented Oct 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

do not zero out sink tokens during recache #21

Are you sure you want to change the base?

do not zero out sink tokens during recache #21

Uh oh!

Conversation

hturki commented Oct 25, 2025

Uh oh!

ki-lw commented Oct 25, 2025

Uh oh!

hturki commented Oct 26, 2025

Uh oh!

hturki commented Oct 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ki-lw commented Oct 26, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hturki commented Oct 26, 2025 •

edited

Loading