Skip to content
View alvinshao0313's full-sized avatar
  • Nanjing University of Science and Technology(NJUST)
  • Automation Building, No. 95, Zhongguancun East Road, Haidian District, Beijing

Highlights

  • Pro

Block or report alvinshao0313

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

361 15 Updated Oct 20, 2025

[IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".

60 5 Updated May 24, 2024

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,082 206 Updated Oct 21, 2025

Official Repository for NeurIPS 2025 Paper: Next Semantic Scale Prediction via Hierarchical Diffusion Language Models

Python 20 Updated Oct 13, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 19,088 3,126 Updated Oct 22, 2025
Python 85 13 Updated Sep 26, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,123 264 Updated Oct 22, 2025

[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

Python 123 17 Updated May 16, 2024

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

Python 38,844 3,845 Updated May 31, 2025

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python 307 41 Updated Jun 18, 2025
Cuda 2 1 Updated Jun 9, 2025

The official implementation of the EMNLP 2023 paper LLM-FP4

Python 217 21 Updated Dec 15, 2023

PyTorch native quantization and sparsity for training and inference

Python 2,438 349 Updated Oct 22, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 11,917 1,809 Updated Oct 22, 2025

Learn visual autoregressive

Python 1 Updated Jul 22, 2025

LLM Inference with Microscaling Format

Python 31 4 Updated Nov 12, 2024

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,526 405 Updated Jul 16, 2023
Shell 13 1 Updated Sep 24, 2025

Free ChatGPT&DeepSeek API Key,免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API,支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。

Python 33,057 2,349 Updated Oct 10, 2025

🔥 公益免费的ChatGPT API,Free ChatGPT API,GPT4 API,可直连,无需代理,使用标准 OpenAI APIKEY 格式访问 ChatGPT,可搭配ChatGPT-next-web、ChatGPT-Midjourney、Lobe-chat、Botgem、FastGPT、沉浸式翻译等项目使用

5,452 519 Updated Jul 28, 2025

s1: Simple test-time scaling

Python 6,576 766 Updated Jun 25, 2025

Fully open reproduction of DeepSeek-R1

Python 25,565 2,396 Updated Sep 8, 2025

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,388 200 Updated May 7, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,196 1,094 Updated Oct 12, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 60,647 10,700 Updated Oct 21, 2025

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,968 524 Updated Apr 11, 2025

[ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting

Python 80 5 Updated Apr 8, 2025

run DeepSeek-R1 GGUFs on KTransformers

Python 253 16 Updated Mar 3, 2025

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Python 841 121 Updated Oct 22, 2025

Awesome Mobile LLMs

256 14 Updated Oct 19, 2025
Next