nitishpandey04

Follow

🎯

Focusing

Nitish Pandey nitishpandey04

🎯

Focusing

Follow

Engineer working on LLMs, RL. Trying to be useful. Aim is to contribute meaningfully to development of AGI.

7 followers · 52 following

Achievements

Achievements

Pinned Loading

microLLM microLLM Public

small language models research

Python 1
rl-baselines rl-baselines Public

Baseline implementations of popular RL algorithms in pytorch.

Python
classic-control-rl classic-control-rl Public

Train agents for classic control reinforcement learning environments

Jupyter Notebook
nanochat nanochat Public

Forked from karpathy/nanochat

The best ChatGPT that $100 can buy.

Python
QLoRA-Fine-Tuning-of-Open-Sourced-LLMs QLoRA-Fine-Tuning-of-Open-Sourced-LLMs Public

Fine tuning open sourced LLMs like llama, gemma, qwen, etc using QLoRA technique using huggingface ecosystem of libraries

Jupyter Notebook
Language-Modeling-Zero-to-Hero Language-Modeling-Zero-to-Hero Public

Learning and implementing language modeling from scratch in PyTorch. Starting from bag-of-words models all the way upto Transformers (LLMs)!

Jupyter Notebook 1