- 👋 Hi, I’m @CSfufu
- I am currently focus on MLLM reasoning and Reinforcement Learning.
【次の交差点でお会いします、よろしくお願いします】
-
Zhejiang University
- Shanghai China
-
06:05
(UTC +08:00)
Highlights
- Pro
Pinned Loading
-
XiaoYee/Awesome_Efficient_LRM_Reasoning
XiaoYee/Awesome_Efficient_LRM_Reasoning Public😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
-
Revisual-R1
Revisual-R1 Public🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to …
-
hiyouga/EasyR1
hiyouga/EasyR1 PublicEasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
-
shawn0728/ARES
shawn0728/ARES Public🌴 ARES is an open-source framework for adaptive multimodal reasoning, featuring a two-stage pipeline—Adaptive Cold-Start and Entropy-Shaped Policy Optimization—to balance reasoning depth and effici…
-
volcengine/verl
volcengine/verl Publicverl: Volcano Engine Reinforcement Learning for LLMs
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

