Skip to content
View lianxintao's full-sized avatar

Block or report lianxintao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Out-of-the-box DeepSeek OCR document parsing Web Studio

TypeScript 490 79 Updated Oct 27, 2025

A learning project for building local knowledge bases from PDFs using LangChain, supporting multiple LLMs (DeepSeek, OpenAI). Features include PDF processing, knowledge graph construction, and natu…

Python 211 31 Updated Jan 30, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 20,402 3,521 Updated Nov 26, 2025

Evaluating Large Language Models for CUDA Code Generation ComputeEval is a framework designed to generate and evaluate CUDA code from Large Language Models.

Python 76 15 Updated Nov 21, 2025

Tyk Open Source API Gateway written in Go, supporting REST, GraphQL, TCP and gRPC protocols

Go 10,510 1,145 Updated Nov 26, 2025

GPU cluster manager for optimized AI model deployment

Python 4,075 409 Updated Nov 26, 2025

Supercharge Your LLM with the Fastest KV Cache Layer

Python 6,206 750 Updated Nov 26, 2025

Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini). Supports local and cloud LLMs (Ollama, Google, Anthropic, ...). Searches 10+ sources - arXiv, PubMed, web, and yo…

Python 3,651 351 Updated Nov 26, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,931 286 Updated May 15, 2025

The ultimate LLM/AI application development framework in Golang.

Go 8,343 632 Updated Nov 26, 2025

Fully open reproduction of DeepSeek-R1

Python 25,680 2,401 Updated Nov 24, 2025

Simple RL training for reasoning

Python 3,795 281 Updated Aug 3, 2025

🤖FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA EA.

Cuda 233 11 Updated Nov 18, 2025

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 8,606 845 Updated Nov 6, 2025

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 267 10 Updated Oct 11, 2024

🤖 AI Gateway | AI Native API Gateway

Go 6,942 890 Updated Nov 26, 2025

Rethinking Student Productivity

TypeScript 12,365 795 Updated Oct 17, 2024

🎆Interactive Online Platform that Visualizes Algorithms from Code

JavaScript 48,202 7,541 Updated Jun 9, 2024

Craft AI-driven interface effortlessly🤖

TypeScript 3,904 905 Updated Nov 26, 2025

[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models

Python 3,381 199 Updated Nov 17, 2025

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 11,328 1,136 Updated Nov 21, 2025

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 11,024 1,118 Updated Apr 30, 2025

A large-scale simulation framework for LLM inference

Python 486 91 Updated Jul 25, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,317 439 Updated Nov 25, 2025

The AI Code Editor

31,738 2,122 Updated Nov 19, 2025

Fast, Accurate, Lightweight Python library to make State of the Art Embedding

Python 2,523 165 Updated Nov 25, 2025

Convert any text to a graph of knowledge. This can be used for Graph Augmented Generation or Knowledge Graph based QnA

Jupyter Notebook 2,042 382 Updated May 15, 2025

A low-latency & high-throughput serving engine for LLMs

Python 447 59 Updated Oct 16, 2025

MS-Agent: Lightweight Framework for Empowering Agents with Autonomous Exploration in Complex Task Scenarios

Python 3,637 412 Updated Nov 26, 2025
Next