Confusion about SP #12

ictzyqq · 2025-02-28T07:07:54Z

ictzyqq
Feb 28, 2025

The open source projects released in the past five days are amazing! I'm a little confused about the parallel strategy during inference.
The technical report indicates that the attention part employs 4-way Tensor Parallelism (TP4) with Sequence Parallelism (SP). I wonder what the implementation details of SP are, both in the prefilling and decoding stages. Are you going to open-source the code?

Cccei000 · 2025-03-04T03:44:25Z

Cccei000
Mar 4, 2025

I think they swallowed it lol. The day6 article suggests the actual parallelism for inference deployment does not include TP as well as SP, but only EP and DP which makes more sense. The SP extremely confused me but now I decide to let it go :)

1 reply

ictzyqq Mar 4, 2025
Author

I think they swallowed it lol. The day6 article suggests the actual parallelism for inference deployment does not include TP as well as SP, but only EP and DP which makes more sense. The SP extremely confused me but now I decide to let it go :)

YES! TP with SP has been abandoned by them. SP doesn't bother me anymore. :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Confusion about SP #12

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Confusion about SP #12

ictzyqq Feb 28, 2025

Replies: 1 comment · 1 reply

Cccei000 Mar 4, 2025

ictzyqq Mar 4, 2025 Author

ictzyqq
Feb 28, 2025

Replies: 1 comment 1 reply

Cccei000
Mar 4, 2025

ictzyqq Mar 4, 2025
Author