Da-Hong

Da-Hong

4 followers · 14 following

Stars

eosphoros-ai / DB-GPT

AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

Python 17,594 2,457 Updated Nov 6, 2025

xorbitsai / inference

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 8,711 759 Updated Nov 7, 2025

etcd-io / etcd

Distributed reliable key-value store for the most critical data of a distributed system

Go 50,728 10,207 Updated Nov 8, 2025

triton-inference-server / tensorrtllm_backend

The Triton TensorRT-LLM Backend

905 133 Updated Nov 7, 2025

Tlntin / Qwen-TensorRT-LLM

Python 626 57 Updated Jul 31, 2024

Franc-Z / QWen1.5_TensorRT-LLM

Optimize QWen1.5 models with TensorRT-LLM

Python 17 3 Updated May 14, 2024

FlagOpen / FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Python 10,801 805 Updated Oct 22, 2025

infiniflow / ragflow

RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context layer for LLMs

TypeScript 67,324 7,182 Updated Nov 7, 2025

langgenius / dify

Production-ready platform for agentic workflow development.

TypeScript 118,395 18,301 Updated Nov 7, 2025

ninehills / blog

Python 1,663 133 Updated Sep 22, 2025

songquanpeng / one-api

LLM API 管理 & 分发系统，支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型，统一 API 适配，可用于 key 管理与二次分发。单可执行文件，提供 Docker 镜像，一键部署，开箱即用。LLM API management & k…

JavaScript 27,896 5,500 Updated Jul 18, 2025

labring / FastGPT

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, le…

TypeScript 26,227 6,746 Updated Nov 7, 2025

GopeedLab / gopeed

A modern download manager that supports all platforms. Built with Golang and Flutter.

Dart 21,577 1,492 Updated Oct 15, 2025

mbchang / meta-prompt

A re-implementation of Meta-Prompt in LangChain for building self-improving agents.

Jupyter Notebook 63 3 Updated Apr 16, 2023

tensorflow / recommenders-addons

Additional utils and helpers to extend TensorFlow when build recommendation systems, contributed and maintained by SIG Recommenders.

Cuda 626 143 Updated Sep 4, 2025

openai / openai-python

The official Python library for the OpenAI API

Python 29,204 4,407 Updated Nov 4, 2025

ztxz16 / fastllm

fastllm是后端无依赖的高性能大模型推理库。同时支持张量并行推理稠密模型和混合模式推理MOE模型，任意10G以上显卡即可推理满血DeepSeek。双路9004/9005服务器+单显卡部署DeepSeek满血满精度原版模型，单并发20tps；INT4量化模型单并发30tps，多并发可达60+。

C++ 4,066 412 Updated Oct 28, 2025

ChatGPTNextWeb / NextChat

TypeScript 86,374 60,855 Updated Oct 27, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,115 7,516 Updated Nov 6, 2025

zejunwang1 / LLMTuner

大语言模型指令调优工具（支持 FlashAttention）

Python 178 12 Updated Jan 4, 2024

QwenLM / Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 19,672 1,642 Updated Sep 30, 2025

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 31,754 3,021 Updated Nov 9, 2025

chatchat-space / Langchain-Chatchat

Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM, Qwen and…

Python 36,473 6,058 Updated Oct 30, 2025

dgraph-io / ristretto

A high performance memory-bound Go cache

Go 6,562 427 Updated Oct 5, 2025

ruanyf / weekly

科技爱好者周刊，每周五发布

78,587 3,701 Updated Nov 7, 2025

openvinotoolkit / model_server

A scalable inference server for models optimized with OpenVINO™

C++ 787 231 Updated Nov 9, 2025

gtamas / etcdmanager

A cross-platform GUI and ETCD client

Vue 520 61 Updated Dec 14, 2022

iqiyi / xgboost-serving

A flexible, high-performance serving system for machine learning models

C++ 144 20 Updated Nov 24, 2021

triton-inference-server / server

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 9,998 1,664 Updated Nov 8, 2025

shimingyah / pool

Connection pool for Go's grpc client with supports connection reuse.

Go 214 49 Updated Jul 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly