-
Nanjing University of Science and Technology(NJUST)
- Automation Building, No. 95, Zhongguancun East Road, Haidian District, Beijing
- https://www.njust.edu.cn/
Highlights
- Pro
Lists (8)
Sort Name ascending (A-Z)
Stars
The official GitHub repo for the survey paper "A Survey on Diffusion Language Models".
[IJCAI'23] The official Github page of the paper "Diffusion Models for Non-autoregressive Text Generation: A Survey".
Official PyTorch implementation for "Large Language Diffusion Models"
Official Repository for NeurIPS 2025 Paper: Next Semantic Scale Prediction via Hierarchical Diffusion Language Models
SGLang is a fast serving framework for large language models and vision language models.
Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
[ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
PyTorch emulation library for Microscaling (MX)-compatible data formats
The official implementation of the EMNLP 2023 paper LLM-FP4
PyTorch native quantization and sparsity for training and inference
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset
Free ChatGPT&DeepSeek API Key,免费ChatGPT&DeepSeek API。免费接入DeepSeek API和GPT4 API,支持 gpt | deepseek | claude | gemini | grok 等排名靠前的常用大模型。
🔥 公益免费的ChatGPT API,Free ChatGPT API,GPT4 API,可直连,无需代理,使用标准 OpenAI APIKEY 格式访问 ChatGPT,可搭配ChatGPT-next-web、ChatGPT-Midjourney、Lobe-chat、Botgem、FastGPT、沉浸式翻译等项目使用
Fully open reproduction of DeepSeek-R1
From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
A high-throughput and memory-efficient inference and serving engine for LLMs
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
[ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting
run DeepSeek-R1 GGUFs on KTransformers
LLM model quantization (compression) toolkit with hw acceleration support for Nvidia CUDA, AMD ROCm, Intel XPU and Intel/AMD/Apple CPU via HF, vLLM, and SGLang.