🎯
Focusing
Applied Machine Learning/Machine Learning Systems
-
USC -> ByteDance
- LA -> Shanghai
- https://vermouth1992.github.io
Pinned Loading
-
volcengine/verl
volcengine/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
-
drl-portfolio-management
drl-portfolio-management Public archiveCSCI 599 deep learning and its applications final project
-
synthetic-time-series-smart-grid
synthetic-time-series-smart-grid Public archiveSynthetic Time Series Generation using Generative Adversarial Network
-
321 contributions in the last year
Day of Week | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | April Apr | ||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Less
No contributions.
Low contributions.
Medium-low contributions.
Medium-high contributions.
High contributions.
More
Contribution activity
April 2025
Created 2 commits in 1 repository
Opened 2 pull requests in 1 repository
volcengine/verl
2
merged
-
[logger] fix: fix mlflow
This contribution was made on Apr 14
-
[megatron] feat: optimize entropy loss
This contribution was made on Apr 10
Reviewed 32 pull requests in 1 repository
volcengine/verl
25 pull requests
-
doc: upgrade to vllm 0.8.3
This contribution was made on Apr 14
-
fix time for dapo
This contribution was made on Apr 14
-
mcore readme
This contribution was made on Apr 14
-
fix: replace '@' with '_at_' in metric names to comply with MLflow naming constraints
This contribution was made on Apr 14
-
Update vllm 0.8.2 with megatron 0.11.0
This contribution was made on Apr 14
-
Fix megatron default config
This contribution was made on Apr 13
-
fix checkpoint rng_states confliction
This contribution was made on Apr 13
-
fix: Megatron_workers batch_size config is not processed correctly
This contribution was made on Apr 13
-
reset default tp size
This contribution was made on Apr 12
-
tests: add import utils tests
This contribution was made on Apr 12
-
[mcore] option to use dist checkpoint
This contribution was made on Apr 11
-
fix: use packaging to compre versions instead of str comparing
This contribution was made on Apr 11
-
Support fsdp2 for fsdp_worker
This contribution was made on Apr 11
-
[sglang] docs: fix README index
This contribution was made on Apr 11
-
Change behaviour during raw prompt extraction
This contribution was made on Apr 10
-
docs: update recent talks
This contribution was made on Apr 10
-
fix: wrong pg_clipfrac_lower
This contribution was made on Apr 8
-
docs: add open-hands, vagen
This contribution was made on Apr 8
-
fix: optim.warmup_style do not take effect (#418)
This contribution was made on Apr 8
-
fix: support non-DTensor when converting fsdp checkpoints to hf model
This contribution was made on Apr 7
-
[algo] misc: remove redundant tile([1, response_length]), efficient broadcast instead
This contribution was made on Apr 4
-
Support REINFORCE++-baseline and add script for REINFORCE++
This contribution was made on Apr 4
-
fix: the error is not raised when using both megatron and hf inference
This contribution was made on Apr 3
-
docs: add config docs for evaluation.yaml
This contribution was made on Apr 3
-
fix: misleading eos_mask->response_mask
This contribution was made on Apr 3
- Some pull request reviews not shown.