Pinned Loading
-
RLHF-CustomData
RLHF-CustomData PublicBuilding an LLM with RLHF involves fine-tuning using human-labeled preferences. Based on Learning to Summarize from Human Feedback, it uses supervised learning, reward modeling, and PPO to improve …
Jupyter Notebook 1
-
-
Replicating_Researchpaper
Replicating_Researchpaper PublicThis repository focuses on reproducing results from notable machine learning and AI research papers by implementing their methods, models, and experiments. Each project includes detailed documentat…
Jupyter Notebook 3
-
-
Machine_Learning_Projects
Machine_Learning_Projects PublicThe Machine Learnings repo consist of open-source machine learning projects covering various domains. It provides users with access to diverse projects, complete with documentation, tutorials, and …
-
If the problem persists, check the GitHub status page or contact support.