-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Pull requests: NVIDIA/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add conditions_embeddings argument to TransformerBlock, TransformerLayer for DiT (diffusion transformer)
complexity: low
Final Review
PR is in the "final review" stage
fix the strategy in tensor_need_offloading_checker
#4121
opened Apr 3, 2026 by
lhb8125
Loading…
5 tasks
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.