-
modded-nanogpt Public
Forked from KellerJordan/modded-nanogptNanoGPT (124M) in 3 minutes
Python MIT License UpdatedOct 15, 2025 -
nanochat Public
Forked from karpathy/nanochatThe best ChatGPT that $100 can buy.
Python UpdatedOct 14, 2025 -
book-trm Public
Forked from SourceShift/book-trmBuilding Tiny Recursive Models from Scratch
Python UpdatedOct 9, 2025 -
SVD-LLM Public
Forked from AIoT-MLSys-Lab/SVD-LLM[ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2
Python Apache License 2.0 UpdatedAug 28, 2025 -
STAR Public
Forked from Agora-Lab-AI/STARImplementation of the paper from Liquid AI
Python MIT License UpdatedAug 25, 2025 -
reasoning-gym Public
Forked from open-thought/reasoning-gymprocedural reasoning datasets
Python Apache License 2.0 UpdatedAug 18, 2025 -
Hyena-Y Public
Forked from The-Swarm-Corporation/Hyena-YA PyTorch implementation of the Hyena-Y model, a convolution-based multi-hybrid architecture optimized for edge devices.
Python MIT License UpdatedAug 18, 2025 -
cartridges Public
Forked from HazyResearch/cartridgesStoring long contexts in tiny caches with self-study
Python Apache License 2.0 UpdatedAug 17, 2025 -
captum Public
Forked from meta-pytorch/captumModel interpretability and understanding for PyTorch
Python BSD 3-Clause "New" or "Revised" License UpdatedAug 14, 2025 -
speechbrain Public
Forked from speechbrain/speechbrainA PyTorch-based Speech Toolkit
Python Apache License 2.0 UpdatedAug 13, 2025 -
R-Zero Public
Forked from Chengsong-Huang/R-Zerocodes for R-Zero: Self-Evolving Reasoning LLM from Zero Data (https://www.arxiv.org/pdf/2508.05004)
Python UpdatedAug 8, 2025 -
-
ASI-Arch Public
Forked from GAIR-NLP/ASI-ArchAlphaGo Moment for Model Architecture Discovery.
Python Apache License 2.0 UpdatedAug 4, 2025 -
persona_vectors Public
Forked from safety-research/persona_vectorsPersona Vectors: Monitoring and Controlling Character Traits in Language Models
Python UpdatedJul 30, 2025 -
HRM Public
Forked from sapientinc/HRMHierarchical Reasoning Model Official Release
Python Apache License 2.0 UpdatedJul 29, 2025 -
Midm-2.0 Public
Forked from K-intelligence-Midm/Midm-2.0Official repository for Mi:dm 2.0, the large language model developed by KT.
Jupyter Notebook MIT License UpdatedJul 29, 2025 -
HighNoonLLM Public
HighNoon LLM uses Hierarchical Spatial Neural Memory (HSMN) to process language like humans, organizing text into a tree for efficiency. It cuts computing needs by 78x, excelling in summarization, …
-
RAM Public
Forked from facebookresearch/RAMA framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).
Python MIT License UpdatedJul 25, 2025 -
difflogic_nca_logic_synth Public
Forked from fefespn/difflogic_nca_logic_synthcellular automata simulation using difflogic neural network and synthesize a digital logic circuit using difflogic.
Python MIT License UpdatedJul 24, 2025 -
mixture_of_recursions Public
Forked from raymin0223/mixture_of_recursionsMixture-of-Recursions: Learning Dynamic Recursive Depths for Adaptive Token-Level Computation
Python Apache License 2.0 UpdatedJul 22, 2025 -
audio-flamingo Public
Forked from NVIDIA/audio-flamingoPyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models
UpdatedJul 16, 2025 -
EXAONE-4.0 Public
Forked from LG-AI-EXAONE/EXAONE-4.0Official repository for EXAONE 4.0 built by LG AI Research
Other UpdatedJul 15, 2025 -
ArchScale Public
Forked from microsoft/ArchScaleSimple & Scalable Pretraining for Neural Architecture Research
Python MIT License UpdatedJul 14, 2025 -
autogen Public
Forked from microsoft/autogenA programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Python Creative Commons Attribution 4.0 International UpdatedJul 13, 2025 -
Kimi-K2 Public
Forked from MoonshotAI/Kimi-K2Kimi K2 is the large language model series developed by Moonshot AI team
Other UpdatedJul 12, 2025 -
nannyml Public
Forked from NannyML/nannymlnannyml: post-deployment data science in python
Python Apache License 2.0 UpdatedJul 12, 2025 -
ktransformers Public
Forked from kvcache-ai/ktransformersA Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Python Apache License 2.0 UpdatedJul 12, 2025 -
Muon Public
Forked from KellerJordan/MuonMuon is an optimizer for hidden layers in neural networks
Python MIT License UpdatedJul 12, 2025 -
EasyEdit Public
Forked from zjunlp/EasyEdit[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.
Jupyter Notebook MIT License UpdatedJul 11, 2025 -
mem0 Public
Forked from mem0ai/mem0Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.
Python Apache License 2.0 UpdatedJul 11, 2025