-
10:26
(UTC +08:00)
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedNov 6, 2025 -
-
pingora Public
Forked from cloudflare/pingoraA library for building fast, reliable and evolvable network services.
Rust Apache License 2.0 UpdatedJul 9, 2025 -
CUDALibrarySamples Public
Forked from NVIDIA/CUDALibrarySamplesCUDA Library Samples
Cuda Other UpdatedJul 8, 2025 -
nano-vllm Public
Forked from GeeeekExplorer/nano-vllmNano vLLM
Python MIT License UpdatedJun 18, 2025 -
DeepEP Public
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
Cuda MIT License UpdatedMay 9, 2025 -
cudarc Public
Forked from coreylowman/cudarcSafe rust wrapper around CUDA toolkit
Rust Apache License 2.0 UpdatedMay 7, 2025 -
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedApr 24, 2025 -
skypilot Public
Forked from skypilot-org/skypilotSkyPilot: Run AI and batch jobs on any infra (Kubernetes or 12+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
Python Apache License 2.0 UpdatedDec 12, 2024 -
deep-learning-containers Public
Forked from aws/deep-learning-containersAWS Deep Learning Containers are pre-built Docker images that make it easier to run popular deep learning frameworks and tools on AWS.
Python Other UpdatedDec 11, 2024 -
optimum-benchmark Public
Forked from huggingface/optimum-benchmark🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
Python Apache License 2.0 UpdatedNov 25, 2024 -
py-txi Public
Forked from IlyasMoutawwakil/py-txiA Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
Python Apache License 2.0 UpdatedNov 19, 2024 -
-
self-llm Public
Forked from datawhalechina/self-llm《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合中国宝宝的部署教程
Jupyter Notebook Apache License 2.0 UpdatedSep 23, 2024 -
LLM-Dojo Public
Forked from mst272/LLM-Dojo欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓
Python UpdatedSep 20, 2024 -
kueue Public
Forked from kubernetes-sigs/kueueKubernetes-native Job Queueing
Go Apache License 2.0 UpdatedSep 19, 2024 -
Firefly Public
Forked from yangjianxin1/FireflyFirefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型
Python UpdatedSep 19, 2024 -
lws Public
Forked from kubernetes-sigs/lwsLeaderWorkerSet: An API for deploying a group of pods as a unit of replication
Go Apache License 2.0 UpdatedSep 16, 2024 -
dify Public
Forked from langgenius/difyDify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
TypeScript Other UpdatedAug 29, 2024 -
github-workflow-as-kube Public
Forked from kerthcet/github-workflow-as-kubeFollowing the same workflows as Kubernetes.
Apache License 2.0 UpdatedAug 16, 2024 -
llmaz Public
Forked from InftyAI/llmaz☸️ Effortlessly operating LLMs on Kubernetes, e.g. Serving.
Go Apache License 2.0 UpdatedAug 5, 2024 -
go-gin-boilerplate Public template
Forked from vsouza/go-gin-boilerplateA starter project with Golang, Gin and DynamoDB
Go UpdatedJun 26, 2024 -
tuning_playbook Public
Forked from google-research/tuning_playbookA playbook for systematically maximizing the performance of deep learning models.
Other UpdatedJun 18, 2024 -
mpi-operator Public
Forked from kubeflow/mpi-operatorKubernetes Operator for MPI-based applications (distributed training, HPC, etc.)
Go Apache License 2.0 UpdatedMay 1, 2024 -
gin-boilerplate Public
Forked from Massad/gin-boilerplateThe fastest way to deploy a restful api's with Gin Framework with a structured project that defaults to PostgreSQL database and JWT authentication middleware stored in Redis
Go MIT License UpdatedMar 13, 2024 -
kubekey Public
Forked from kubesphere/kubekeyInstall Kubernetes/K3s only, both Kubernetes/K3s and KubeSphere, and related cloud-native add-ons, it supports all-in-one, multi-node, and HA 🔥 ⎈ 🐳
Go Apache License 2.0 UpdatedJan 22, 2024 -
kube-scheduler-simulator Public
Forked from kubernetes-sigs/kube-scheduler-simulatorThe simulator for the Kubernetes scheduler
Go Apache License 2.0 UpdatedDec 1, 2023 -
operator Public
Forked from tektoncd/operatorKubernetes operator to manage installation, updation and uninstallation of tektoncd projects (pipeline, …)
Go Apache License 2.0 UpdatedMay 31, 2023 -
diffusers Public
Forked from huggingface/diffusers🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
Python Apache License 2.0 UpdatedMar 6, 2023 -
memberlist Public
Forked from hashicorp/memberlistGolang package for gossip based membership and failure detection
Go Mozilla Public License 2.0 UpdatedFeb 6, 2023