bryandeng

Follow

Boyuan Deng bryandeng

Follow

machine learning, theories and systems.

124 followers · 247 following

Achievements

Achievements

Organizations

Lists (8)

Sort

Corpus Tools

Dev Tools

Infra

23 repositories

LLM

40 repositories

MLSys

50 repositories

Python Packaging

Reading

22 repositories

Writing

Stars

sp-nitech / diffsptk

A differentiable version of SPTK

Python 191 19 Updated Oct 18, 2025

facebookresearch / LLM-QAT

Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"

Python 317 24 Updated Mar 4, 2025

baichuan-inc / Baichuan-13B

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,961 237 Updated Sep 6, 2023

hkust-nlp / ceval

Official github repo for C-Eval, a Chinese evaluation suite for foundation models [NeurIPS 2023]

Python 1,778 81 Updated Jul 27, 2025

scikit-build / scikit-build

Improved build system generator for CPython C, C++, Cython and Fortran extensions

Python 522 124 Updated Oct 20, 2025

baichuan-inc / Baichuan-7B

A large-scale 7B pretraining language model developed by BaiChuan-Inc.

Python 5,684 506 Updated Jul 18, 2024

openlm-research / open_llama

OpenLLaMA, a permissively licensed open source reproduction of Meta AI’s LLaMA 7B trained on the RedPajama dataset

7,526 404 Updated Jul 16, 2023

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 60,914 10,750 Updated Oct 24, 2025

google / sentencepiece

Unsupervised text tokenizer for Neural Network-based text generation.

C++ 11,387 1,300 Updated Oct 4, 2025

bentoml / OpenLLM

Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.

Python 11,871 782 Updated Oct 20, 2025

mit-han-lab / torchsparse

[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.

Cuda 1,405 177 Updated Feb 24, 2025

ray-project / llm-numbers

Numbers every LLM developer should know

4,259 139 Updated Jan 16, 2024

pypa / setuptools-scm

the blessed package to manage your versions by scm tags

Python 926 227 Updated Oct 20, 2025

jina-ai / langchain-serve

⚡ Langchain apps in production using Jina & FastAPI

Python 1,633 140 Updated Sep 20, 2023

togethercomputer / RedPajama-Data

The RedPajama-Data repository contains code for preparing large datasets for training large language models.

Python 4,834 366 Updated Dec 7, 2024

HazyResearch / meerkat

Explore and understand your training and validation data.

Python 847 45 Updated Dec 24, 2024

mit-han-lab / llm-awq

[MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration

Python 3,319 276 Updated Jul 17, 2025

michael-wzhu / PromptCBLUE

PromptCBLUE: a large-scale instruction-tuning dataset for multi-task and few-shot learning in the medical domain in Chinese

Python 381 43 Updated Jan 23, 2024

laurieburchell / open-lid-dataset

Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)

Perl 75 3 Updated Apr 1, 2025

FreedomIntelligence / HuatuoGPT

HuatuoGPT, Towards Taming Language Models To Be a Doctor. (An Open Medical GPT)

Python 1,265 161 Updated Dec 16, 2024

ymcui / Chinese-LLaMA-Alpaca

中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)

Python 18,934 1,876 Updated Jul 15, 2025

WangRongsheng / XrayGLM

🩺 首个会看胸部X光片的中文多模态医学大模型 | The first Chinese Medical Multimodal Model that Chest Radiographs Summarization.

Python 1,033 143 Updated Nov 20, 2024

scikit-build / scikit-build-core

A next generation Python CMake adaptor and Python API for plugins

Python 403 74 Updated Oct 20, 2025

NVIDIA / TransformerEngine

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…

Python 2,845 527 Updated Oct 23, 2025

EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,322 1,088 Updated Sep 26, 2025

edoliberty / vector-search-class-notes

Class notes for the course "Long Term Memory in AI - Vector Search and Databases" COS 597A @ Princeton Fall 2023

TeX 321 34 Updated Jan 17, 2025

zai-org / VisualGLM-6B

Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型

Python 4,167 426 Updated Aug 23, 2024

ai-shifu / ChatALL

Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, 讯飞星火, 文心一言 and more, discover the best answers

JavaScript 16,076 1,691 Updated Oct 20, 2025

guidance-ai / guidance

A guidance language for controlling large language models.

Jupyter Notebook 20,868 1,119 Updated Oct 14, 2025

HFAiLab / hai-platform-studio

配合 HAI Platform 使用的集成化用户界面

TypeScript 53 20 Updated May 11, 2023