Skip to content
View Li-brua's full-sized avatar

Block or report Li-brua

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

High-performance safetensors model loader

Python 75 14 Updated Nov 19, 2025

LeaderWorkerSet: An API for deploying a group of pods as a unit of replication

Go 617 117 Updated Nov 24, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,592 845 Updated Nov 6, 2025

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 2,270 292 Updated Nov 25, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,228 1,894 Updated Nov 25, 2025

阿布量化交易系统(股票,期权,期货,比特币,机器学习) 基于python的开源量化交易,量化投资架构

Python 15,416 4,284 Updated Mar 11, 2025

基于Python的开源量化交易平台开发框架

Python 34,123 10,412 Updated Nov 2, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,201 748 Updated Nov 25, 2025

vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization

Python 1,978 326 Updated Nov 21, 2025

AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。

Jupyter Notebook 5,189 722 Updated Nov 21, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,096 251 Updated Nov 25, 2025

Injecting Adrenaline into LLM Serving: Boosting Resource Utilization and Throughput via Attention Disaggregation

Python 38 Updated Nov 10, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,541 708 Updated Nov 25, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,311 438 Updated Nov 25, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 63,926 11,519 Updated Nov 25, 2025

Costrict - strict AI coder for enterprises, quality first, including AI Agent, AI CodeReview, AI Completion.

TypeScript 3,061 119 Updated Nov 25, 2025

20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.

Python 12,973 1,364 Updated Nov 25, 2025

心理健康大模型 (LLM x Mental Health), Pre & Post-training & Dataset & Evaluation & Depoly & RAG, with InternLM / Qwen / Baichuan / DeepSeek / Mixtral / LLama / GLM series models

Python 1,635 207 Updated Aug 19, 2025

[ICML 2022 / ICLR 2024] Source code for our papers "Plug & Play Attacks: Towards Robust and Flexible Model Inversion Attacks" and "Be Careful What You Smooth For".

Jupyter Notebook 45 13 Updated Jul 18, 2025

A tutorial of how to integrate Stripe Payments with Django

Python 102 48 Updated Aug 20, 2021

消息推送平台后台管理

CSS 3 Updated Apr 9, 2023

FastAPI + vue3 前后端分离后台管理系统,包含PC端,微信小程序端。接口使用:FastAPI+Pydantic+SQLAlchemy 2.0+Mysql,PC 端使用:Vue3+Typescript+Vite+Element Plus,小程序使用:Uni-APP + uview ui。异步存储,RBAC 权限管理,定时任务,部门管理等功能。

Vue 782 189 Updated Mar 25, 2025

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 49,405 4,103 Updated Nov 21, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,739 6,533 Updated Nov 25, 2025

An Improved Langchain RAG Tutorial (v2) with local LLMs, database updates, and testing.

Python 909 591 Updated Aug 3, 2024

[NeurIPS 2022] Denoising Diffusion Restoration Models -- Official Code Repository

Python 655 64 Updated Oct 10, 2022

[ECCV 2024] codes of DiffBIR: Towards Blind Image Restoration with Generative Diffusion Prior

Python 3,951 346 Updated Jul 29, 2025

A latent text-to-image diffusion model

Jupyter Notebook 71,875 10,525 Updated Jun 18, 2024

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,836 376 Updated Jan 7, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 15,186 1,292 Updated May 23, 2024
Next