Highlights
Popular repositories Loading
-
-
DeepSpeed
DeepSpeed PublicForked from deepspeedai/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python
-
Megatron-LM
Megatron-LM PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer models at scale
Python
-
GPU-Puzzles
GPU-Puzzles PublicForked from srush/GPU-Puzzles
Solve puzzles. Learn CUDA.
Jupyter Notebook
-
light-llm-grad
light-llm-grad PublicThis is a lightweight training framework for LLM (Language Model).
-
dotfiles
dotfiles PublicForked from coderabbitai/dotfiles
A modern Zsh/tmux, Vim and Homebrew centric setup for macOS and Linux
Shell
15 contributions in the last year
Day of Week | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | April Apr | ||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Activity overview
Contribution activity
April 2025
Created 1 commit in 1 repository
Created 1 repository
-
wZuck/Pai-Megatron-Patch
Python
This contribution was made on Apr 10
Created a pull request in alibaba/Pai-Megatron-Patch that received 4 comments
fix Qwen2.5VL window/full attention error
see issue #552 fix packed_seq_params for Qwen2.5VL
Opened 1 issue in 1 repository
alibaba/Pai-Megatron-Patch
1
open
-
Qwen2.5 VL model impl wrong
This contribution was made on Apr 10