Skip to content
View LeeWant's full-sized avatar

Block or report LeeWant

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

LLM inference in C/C++

C++ 92,148 14,280 Updated Dec 29, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,465 820 Updated Dec 29, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 66,403 12,256 Updated Dec 29, 2025

Open Machine Learning Compiler Framework

Python 12,968 3,751 Updated Dec 29, 2025

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 22,047 3,890 Updated Dec 29, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 9,092 897 Updated Dec 24, 2025

MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…

C++ 13,804 2,148 Updated Dec 26, 2025

Universal LLM Deployment Engine with ML Compilation

Python 21,800 1,893 Updated Dec 24, 2025

Tile-Based Runtime for Ultra-Low-Latency LLM Inference

Python 494 23 Updated Dec 23, 2025

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

Python 1,362 92 Updated Dec 27, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,960 2,282 Updated Sep 3, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,553 767 Updated Dec 22, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,504 467 Updated Aug 2, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

71,724 8,220 Updated Dec 22, 2025

MiniCPM4 & MiniCPM4.1: Ultra-Efficient LLMs on End Devices, achieving 3+ generation speedup on reasoning tasks

Jupyter Notebook 8,481 527 Updated Oct 8, 2025

A case study of quantitative modeling for beginners.

Jupyter Notebook 6 Updated Dec 2, 2025

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Python 947 143 Updated Dec 29, 2025

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程

Jupyter Notebook 26,973 2,700 Updated Dec 20, 2025

LLMs-from-scratch项目中文翻译

Jupyter Notebook 2,148 350 Updated Oct 15, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 36,285 4,289 Updated Dec 28, 2025
Jupyter Notebook 676 167 Updated Dec 18, 2024

A Library for Advanced Deep Time Series Models for General Time Series Analysis.

Python 11,149 1,775 Updated Dec 19, 2025

SpringBoot+Thymeleaf+MyBatis制作的校园事务管理系统

JavaScript 14 4 Updated Apr 10, 2025

Time series Timeseries Deep Learning Machine Learning Python Pytorch fastai | State-of-the-art Deep Learning library for Time Series and Sequences in Pytorch / fastai

Jupyter Notebook 5,943 710 Updated Jul 29, 2025

This repository is transfered from the personal account of Dr. Zhognwei Deng (Michael Teng)

Python 89 7 Updated Mar 31, 2025

An Autonomous LLM Agent for Complex Task Solving

Python 8,484 892 Updated Aug 12, 2024

研究生数学建模,本科生数学建模、数学建模竞赛优秀论文,数学建模算法,LaTeX论文模板,算法思维导图,参考书籍,Matlab软件教程,PPT

TeX 9,769 2,226 Updated Sep 20, 2025

✔️李沐 【动手学深度学习】课程学习笔记:使用pycharm编程,基于pytorch框架实现。

Python 2,926 505 Updated Sep 11, 2023

collect all awesome about IT

524 171 Updated Oct 19, 2022

Java 学习&面试指南(Go、Python 后端面试通用,计算机基础面试总结)。准备后端技术面试,首选 JavaGuide!

Java 153,239 46,098 Updated Dec 24, 2025
Next