Skip to content
View Da-Hong's full-sized avatar

Block or report Da-Hong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 17,594 2,457 Updated Nov 6, 2025

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 8,711 759 Updated Nov 7, 2025

Distributed reliable key-value store for the most critical data of a distributed system

Go 50,728 10,207 Updated Nov 8, 2025

The Triton TensorRT-LLM Backend

905 133 Updated Nov 7, 2025
Python 626 57 Updated Jul 31, 2024

Optimize QWen1.5 models with TensorRT-LLM

Python 17 3 Updated May 14, 2024

Retrieval and Retrieval-augmented LLMs

Python 10,801 805 Updated Oct 22, 2025

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 67,324 7,182 Updated Nov 7, 2025

Production-ready platform for agentic workflow development.

TypeScript 118,395 18,301 Updated Nov 7, 2025
Python 1,663 133 Updated Sep 22, 2025

LLM API 管理 & 分发系统,支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型,统一 API 适配,可用于 key 管理与二次分发。单可执行文件,提供 Docker 镜像,一键部署,开箱即用。LLM API management & k…

JavaScript 27,896 5,500 Updated Jul 18, 2025

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 26,227 6,746 Updated Nov 7, 2025

A modern download manager that supports all platforms. Built with Golang and Flutter.

Dart 21,577 1,492 Updated Oct 15, 2025

A re-implementation of Meta-Prompt in LangChain for building self-improving agents.

Jupyter Notebook 63 3 Updated Apr 16, 2023

Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.

Cuda 626 143 Updated Sep 4, 2025

The official Python library for the OpenAI API

Python 29,204 4,407 Updated Nov 4, 2025

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型,任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型,单并发20tps;INT4量化模型单并发30tps,多并发可达60+。

C++ 4,066 412 Updated Oct 28, 2025

✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows

TypeScript 86,374 60,855 Updated Oct 27, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,115 7,516 Updated Nov 6, 2025

大语言模型指令调优工具(支持 FlashAttention)

Python 178 12 Updated Jan 4, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,672 1,642 Updated Sep 30, 2025

Visualizer for neural network, deep learning and machine learning models

JavaScript 31,754 3,021 Updated Nov 9, 2025

Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 36,473 6,058 Updated Oct 30, 2025

A high performance memory-bound Go cache

Go 6,562 427 Updated Oct 5, 2025

科技爱好者周刊,每周五发布

78,587 3,701 Updated Nov 7, 2025

A scalable inference server for models optimized with OpenVINO™

C++ 787 231 Updated Nov 9, 2025

A cross-platform GUI and ETCD client

Vue 520 61 Updated Dec 14, 2022

A flexible, high-performance serving system for machine learning models

C++ 144 20 Updated Nov 24, 2021

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 9,998 1,664 Updated Nov 8, 2025

Connection pool for Go's grpc client with supports connection reuse.

Go 214 49 Updated Jul 9, 2022
Next