alvinshao0313

Shao Yuantian alvinshao0313

Nanjing University of Science and Technology / major in Computer Science and Technology

3 followers · 0 following

Nanjing University of Science and Technology(NJUST)
Automation Building, No. 95, Zhongguancun East Road, Haidian District, Beijing
https://www.njust.edu.cn/

Highlights

Lists (8)

Sort

Stars

VILA-Lab / Awesome-DLMs

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

361 15 Updated Oct 20, 2025

RUCAIBox / Awesome-Text-Diffusion-Models

Forked from AoiDragon/Awesome-Text-Diffusion-Models

[IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".

60 5 Updated May 24, 2024

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,082 206 Updated Oct 21, 2025

zhouc20 / HDLM

Official Repository for NeurIPS 2025 Paper: Next Semantic Scale Prediction via Hierarchical Diffusion Language Models

Python 20 Updated Oct 13, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 19,088 3,126 Updated Oct 22, 2025

amd / Quark

Python 85 13 Updated Sep 26, 2025

vllm-project / llm-compressor

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,123 264 Updated Oct 22, 2025

DD-DuDa / BitDistiller

[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

Python 123 17 Updated May 16, 2024

hiroi-sora / Umi-OCR

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。

Python 38,844 3,845 Updated May 31, 2025

microsoft / microxcaling

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python 307 41 Updated Jun 18, 2025

alexarmbr / nvfp4_linear

Cuda 2 1 Updated Jun 9, 2025

nbasyl / LLM-FP4

The official implementation of the EMNLP 2023 paper LLM-FP4

Python 217 21 Updated Dec 15, 2023

pytorch / ao

PyTorch native quantization and sparsity for training and inference

Python 2,438 349 Updated Oct 22, 2025

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 11,917 1,809 Updated Oct 22, 2025

Weepingchestnut / Lvar

Learn visual autoregressive

Python 1 Updated Jul 22, 2025

aiha-lab / MX-QLLM

LLM Inference with Microscaling Format

Python 31 4 Updated Nov 12, 2024

openlm-research / open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,526 405 Updated Jul 16, 2023

ziwenhahaha / scripts

Shell 13 1 Updated Sep 24, 2025

chatanywhere / GPT_API_free

Free ChatGPT&DeepSeek API Key，免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API，支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。

Python 33,057 2,349 Updated Oct 10, 2025

popjane / free_chatgpt_api

🔥 公益免费的ChatGPT API，Free ChatGPT API，GPT4 API，可直连，无需代理，使用标准 OpenAI APIKEY 格式访问 ChatGPT，可搭配ChatGPT-next-web、ChatGPT-Midjourney、Lobe-chat、Botgem、FastGPT、沉浸式翻译等项目使用

5,452 519 Updated Jul 28, 2025

simplescaling / s1

s1: Simple test-time scaling

Python 6,576 766 Updated Jun 25, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,565 2,396 Updated Sep 8, 2025

atfortes / Awesome-LLM-Reasoning

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,388 200 Updated May 7, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,196 1,094 Updated Oct 12, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 60,647 10,700 Updated Oct 21, 2025

AutoGPTQ / AutoGPTQ

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,968 524 Updated Apr 11, 2025

BrotherHappy / OSTQuant

[ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting

Python 80 5 Updated Apr 8, 2025

ubergarm / r1-ktransformers-guide

run DeepSeek-R1 GGUFs on KTransformers

Python 253 16 Updated Mar 3, 2025

ModelCloud / GPTQModel

LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.

Python 841 121 Updated Oct 22, 2025

stevelaskaridis / awesome-mobile-llm

Awesome Mobile LLMs

256 14 Updated Oct 19, 2025

Shao Yuantian alvinshao0313

Highlights

Lists (8)

✨ Inspiration

LLM

Out Wall

paper list

pypi

Remote Sensing LM

Umixing

VLM

Stars