-
NIVIC
- HeFei
-
-
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
-
Mooncake Public
Forked from kvcache-ai/MooncakeMooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++ Apache License 2.0 UpdatedNov 28, 2025 -
FlashMLA Public
Forked from deepseek-ai/FlashMLAFlashMLA: Efficient Multi-head Latent Attention Kernels
C++ MIT License UpdatedNov 26, 2025 -
awesome-sglang Public
Forked from ShangmingCai/awesome-sglangMake SGLang go brrr
Creative Commons Zero v1.0 Universal UpdatedNov 19, 2025 -
-
dify Public
Forked from langgenius/difyProduction-ready platform for agentic workflow development.
TypeScript Other UpdatedOct 21, 2025 -
go-openai Public
Forked from sashabaranov/go-openaiOpenAI ChatGPT, GPT-3, GPT-4, DALL·E, Whisper API wrapper for Go
-
DeepEP Public
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
Cuda MIT License UpdatedMay 1, 2025 -
splitwise-demos Public
Forked from coding-rw/splitwise-demosPython Apache License 2.0 UpdatedMar 5, 2025 -
lws Public
Forked from kubernetes-sigs/lwsLeaderWorkerSet: An API for deploying a group of pods as a unit of replication
Go Apache License 2.0 UpdatedFeb 14, 2025 -
ollama Public
Forked from ollama/ollamaGet up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Go MIT License UpdatedDec 5, 2024 -
one-api Public
Forked from songquanpeng/one-apiOpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…
JavaScript MIT License UpdatedNov 30, 2024 -
LLaMA-Factory Public
Forked from hiyouga/LlamaFactoryUnify Efficient Fine-Tuning of 100+ LLMs
Python Apache License 2.0 UpdatedSep 19, 2024 -
openai-kit Public
Forked from dylanshine/openai-kitA community Swift package used to interact with the OpenAI API
Swift MIT License UpdatedAug 26, 2024 -
-
bert_for_longer_texts Public
Forked from mim-solutions/bert_for_longer_textsBERT classification model for processing texts longer than 512 tokens. Text is first divided into smaller chunks and after feeding them to BERT, intermediate results are pooled. The implementation …
Python Other UpdatedJun 19, 2024 -
agency-swarm Public
Forked from VRSEN/agency-swarmAn opensource agent orchestration framework built on top of the latest OpenAI Assistants API.
Python MIT License UpdatedApr 27, 2024 -
star-all-repos Public
Forked from SyMind/star-all-repos🤩 宝~点两下,我就能骗走你所有的 Star!
-
autogen Public
Forked from microsoft/autogenEnable Next-Gen Large Language Model Applications. Join our Discord: https://discord.gg/pAbnFJrkgZ
-
local_function_calling Public
Forked from puppetm4st3r/local_function_callingThis repository contains a Python implementation that allows you to use gorilla-llm/gorilla-openfunctions-v2 LLM to perform function calling using the OpenAI protocol. It provides a way to extend t…
Python MIT License UpdatedApr 7, 2024 -
gh-proxy Public
Forked from hunshcn/gh-proxygithub release、archive以及项目文件的加速项目
-
hansoldeco-starcoder2-finetune-15b Public
Forked from thstmddns/hansoldeco-starcoder2-finetune-15b -
-
github-release Public
Forked from scikit-build/github-releaseManage github releases from the command line
-
argo-workflows Public
Forked from argoproj/argo-workflowsWorkflow engine for Kubernetes
-
S-LoRA Public
Forked from S-LoRA/S-LoRAS-LoRA: Serving Thousands of Concurrent LoRA Adapters
-
k8s-vgpu-scheduler Public
Forked from Project-HAMi/HAMiOpenAIOS vGPU scheduler for Kubernetes is originated from the OpenAIOS project to virtualize GPU device memory.
-
marp-cli Public
Forked from marp-team/marp-cliA CLI interface for Marp and Marpit based converters
-