🎯
Focusing
Engineer working on LLMs, RL.
Trying to be useful. Aim is to contribute meaningfully to development of AGI.
- Hyderabad, IN
- @_nitish_pandey_
- https://huggingface.co/nitishpandey04
Pinned Loading
-
rl-baselines
rl-baselines PublicBaseline implementations of popular RL algorithms in pytorch.
Python
-
classic-control-rl
classic-control-rl PublicTrain agents for classic control reinforcement learning environments
Jupyter Notebook
-
-
QLoRA-Fine-Tuning-of-Open-Sourced-LLMs
QLoRA-Fine-Tuning-of-Open-Sourced-LLMs PublicFine tuning open sourced LLMs like llama, gemma, qwen, etc using QLoRA technique using huggingface ecosystem of libraries
Jupyter Notebook
-
Language-Modeling-Zero-to-Hero
Language-Modeling-Zero-to-Hero PublicLearning and implementing language modeling from scratch in PyTorch. Starting from bag-of-words models all the way upto Transformers (LLMs)!
Jupyter Notebook 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

