-
LLM-quest Public
Verbose implementations of LLMs architectures, techniques and research papers from scratch. DeepSeek, Qwen3..., RLHF, MoE, Multimodal...
-
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedJan 9, 2026 -
number-token-loss Public
Forked from ai4sd/number-token-lossPyPI package for number token loss
Python MIT License UpdatedDec 18, 2025 -
transformers Public
Forked from huggingface/transformers🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Python Apache License 2.0 UpdatedDec 17, 2025 -
GRPO-classic-RL Public
Open-source implementation/adaptation of DeepSeek GRPO applied to Reinforcement Learning control problems. Example on LunarLander-V3.
Jupyter Notebook MIT License UpdatedDec 6, 2025 -
rlhf-book Public
Forked from natolambert/rlhf-bookTextbook on reinforcement learning from human feedback
TeX MIT License UpdatedDec 3, 2025 -
LLMs-from-scratch Public
Forked from rasbt/LLMs-from-scratchImplement a ChatGPT-like LLM in PyTorch from scratch, step by step
-
Moonlight Public
Forked from MoonshotAI/MoonlightMuon is Scalable for LLM Training
MIT License UpdatedJul 27, 2025 -
ffn-from-scratch Public
Feed Forward Neural Network (FFN) from Scratch. In pure Python and NumPy. One derivative at a time.
-
bdo-enhancing-model Public
Probabilistic Modeling of Black Desert Online's Enhancement System. A proof of concept predicting outcomes to derive optimal profitability strategies.
Jupyter Notebook MIT License UpdatedApr 30, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedApr 29, 2025 -
aaamlp-enhanced-pdf Public
Approaching (Almost) Any Machine Learning Problem (AAAMLP) PDF from @abhishekkrthakur with outline, cover, notes
-
mlp Public
Forked from lukasugar/mlpThe Multilayer Perceptron Language Model
Python UpdatedAug 9, 2024