Skip to content
View gxh1124's full-sized avatar

Block or report gxh1124

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Nano vLLM

Python 10,835 1,403 Updated Nov 3, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 67,794 12,666 Updated Jan 19, 2026
TypeScript 516 32 Updated Jan 16, 2026

LangChain4j is an open-source Java library that simplifies the integration of LLMs into Java applications through a unified API, providing access to popular LLMs and vector databases. It makes impl…

Java 10,431 1,910 Updated Jan 16, 2026

Spec-driven development (SDD) for AI coding assistants.

TypeScript 18,079 1,232 Updated Jan 19, 2026

Open-source search and retrieval database for AI applications.

Rust 25,586 2,005 Updated Jan 18, 2026

Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mobile (Android & iOS), and Linux/IoT (Arm64 & x86 Docker). Su…

Go 7,530 942 Updated Jan 16, 2026

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 66,045 8,025 Updated Jan 17, 2026

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 5,189 1,810 Updated Feb 26, 2025

Integrate the DeepSeek API into popular softwares

35,137 3,932 Updated Sep 25, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

9,842 729 Updated Nov 7, 2025

Qlib is an AI-oriented Quant investment platform that aims to use AI tech to empower Quant Research, from exploring ideas to implementing productions. Qlib supports diverse ML modeling paradigms, i…

Python 35,766 5,568 Updated Jan 19, 2026

Stable Diffusion web UI

Python 2,310 233 Updated Dec 31, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,983 937 Updated Jan 16, 2026
Python 4,591 373 Updated Dec 19, 2025

Vulkan compute tool for testing video memory stability

Rust 583 31 Updated Dec 22, 2025

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 19,151 2,143 Updated Jan 19, 2026

Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.

Go 159,841 14,186 Updated Jan 19, 2026

LLM inference in C/C++

C++ 93,260 14,529 Updated Jan 18, 2026

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,665 2,235 Updated Feb 1, 2025

Fully open reproduction of DeepSeek-R1

Python 25,829 2,411 Updated Nov 24, 2025

Distribute and run LLMs with a single file.

C 23,632 1,259 Updated Jan 16, 2026
Python 136 6 Updated May 15, 2024

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Python 1,088 132 Updated Oct 18, 2024

GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code

Python 1,800 258 Updated Oct 18, 2024
Next