Starred repositories
verl: Volcano Engine Reinforcement Learning for LLMs
Master programming by recreating your favorite technologies from scratch.
An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
An industrial deep learning framework for high-dimension sparse data
Pytorch domain library for recommendation systems
Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
21 Lessons, Get Started Building with Generative AI
Evolving Cheatsheets for computer languages/software/OS like Python, Numpy, Pandas, Java, Linux, Terminal, Gentoo
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like YOLO11, RT-DETR, SAM …
Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"
《李宏毅深度学习教程》(李宏毅老师推荐👍,苹果书🍎),PDF下载地址:https://github.com/datawhalechina/leedl-tutorial/releases
Modeling, training, eval, and inference code for OLMo
Curated list of project-based tutorials
Data Engineering Zoomcamp is a free 9-week course on building production-ready data pipelines. The next cohort starts in January 2026. Join the course here 👇🏼
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
A natural language interface for computers