Skip to content
View alvinshao0313's full-sized avatar
  • Nanjing University of Science and Technology(NJUST)
  • Automation Building, No. 95, Zhongguancun East Road, Haidian District, Beijing

Highlights

  • Pro

Block or report alvinshao0313

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Curated list of recent visual autoregressive (VAR) modeling works

31 Updated Mar 17, 2025

A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image generation.

37 Updated Mar 2, 2025

This repository contains low-bit quantization papers from 2020 to 2025 on top conference.

67 2 Updated Sep 24, 2025

Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs

Python 26 Updated Oct 15, 2025

The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".

438 19 Updated Oct 28, 2025

[IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".

60 5 Updated May 24, 2024

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,197 217 Updated Nov 8, 2025

Official Repository for NeurIPS 2025 Paper: Next Semantic Scale Prediction via Hierarchical Diffusion Language Models

Python 22 Updated Oct 13, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 20,090 3,323 Updated Nov 10, 2025
Python 92 15 Updated Sep 26, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,215 282 Updated Nov 10, 2025

[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

Python 126 17 Updated May 16, 2024

OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。

Python 39,665 3,923 Updated May 31, 2025

PyTorch emulation library for Microscaling (MX)-compatible data formats

Python 312 41 Updated Jun 18, 2025
Cuda 2 1 Updated Jun 9, 2025

The official implementation of the EMNLP 2023 paper LLM-FP4

Python 217 21 Updated Dec 15, 2023

PyTorch native quantization and sparsity for training and inference

Python 2,499 364 Updated Nov 10, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,090 1,852 Updated Nov 10, 2025

Learn visual autoregressive

Python 1 Updated Jul 22, 2025

LLM Inference with Microscaling Format

Python 32 4 Updated Nov 12, 2024

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,525 405 Updated Jul 16, 2023
Shell 13 1 Updated Sep 24, 2025

Free ChatGPT&DeepSeek API Key,免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API,支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。

Python 34,294 2,440 Updated Oct 10, 2025

🔥 公益免费的ChatGPT API,Free ChatGPT API,GPT4 API,可直连,无需代理,使用标准 OpenAI APIKEY 格式访问 ChatGPT,可搭配ChatGPT-next-web、ChatGPT-Midjourney、Lobe-chat、Botgem、FastGPT、沉浸式翻译等项目使用

5,521 528 Updated Jul 28, 2025

s1: Simple test-time scaling

Python 6,592 762 Updated Jun 25, 2025

Fully open reproduction of DeepSeek-R1

Python 25,624 2,400 Updated Sep 8, 2025

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,415 200 Updated May 7, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,549 1,125 Updated Nov 10, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 62,661 11,162 Updated Nov 10, 2025

An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.

Python 4,987 524 Updated Apr 11, 2025
Next