muma378

Xiao Yang muma378

Shanghai
14:55 (UTC +08:00)

Achievements

Stars

langfuse / langfuse

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 18,201 1,753 Updated Nov 12, 2025

bentoml / llm-optimizer

Benchmark and optimize LLM inference across frameworks with ease

Python 130 13 Updated Sep 12, 2025

SwanHubX / SwanLab

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / veRL/ Swift / Ultra…

Python 3,070 162 Updated Nov 9, 2025

gradio-app / gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,490 3,127 Updated Nov 12, 2025

xming521 / WeClone

🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …

Python 15,730 1,249 Updated Nov 3, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,287 7,537 Updated Nov 12, 2025

Infrasys-AI / AISystem

AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,576 2,236 Updated Sep 3, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 36,439 4,367 Updated Nov 5, 2025

ModelEngine-Group / unified-cache-management

Persist and reuse KV Cache to speedup your LLM.

Python 118 38 Updated Nov 12, 2025

open-metadata / OpenMetadata

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 7,947 1,515 Updated Nov 12, 2025

jinbooooom / ai-infra-hpc

hpc 教程，包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda 60 11 Updated Oct 9, 2025

vllm-project / semantic-router

Intelligent Router for Mixture-of-Models

Rust 2,226 288 Updated Nov 12, 2025

NVIDIA / nccl-tests

NCCL Tests

Cuda 1,330 329 Updated Nov 3, 2025

nestybox / sysbox

An open-source, next-generation "runc" that empowers rootless containers to run workloads such as Systemd, Docker, Kubernetes, just like VMs.

Shell 3,299 189 Updated Nov 3, 2025

moeru-ai / airi

💖🧸 Self hosted, you owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…

Vue 15,580 1,405 Updated Nov 8, 2025

huggingface / picotron

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,891 145 Updated Aug 26, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,858 899 Updated Sep 30, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,876 739 Updated Oct 15, 2025

agentgateway / agentgateway

Next Generation Agentic Proxy for AI Agents and MCP servers

Rust 1,280 189 Updated Nov 8, 2025

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,763 1,523 Updated Nov 10, 2025

mlcommons / inference

Reference implementations of MLPerf® inference benchmarks

Python 1,483 589 Updated Nov 11, 2025

openai / codex

Lightweight coding agent that runs in your terminal

Rust 50,291 6,253 Updated Nov 12, 2025

jd-opensource / OxyGent

Multi-agent collaboration framework

Python 1,668 238 Updated Nov 11, 2025

MemTensor / MemOS

Build memory-native AI agents with Memory OS — an open-source framework for long-term memory, retrieval, and adaptive learning in large language models. Agent Memory | Memory System | Memory Manage…

Python 2,959 258 Updated Nov 11, 2025

modelscope / evalscope

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 1,927 220 Updated Nov 11, 2025

harry0703 / MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 47,683 6,666 Updated Jun 11, 2025

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,025 229 Updated Nov 11, 2025

mlabonne / llm-datasets

Curated list of datasets and tools for post-training.

3,894 321 Updated Nov 10, 2025

Zjh-819 / LLMDataHub

A quick guide (especially) for trending instruction finetuning datasets

3,304 222 Updated Nov 28, 2023

argilla-io / distilabel

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,927 221 Updated Nov 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiao Yang muma378

Achievements

Achievements

Block or report muma378

Stars

langfuse / langfuse

bentoml / llm-optimizer

SwanHubX / SwanLab

gradio-app / gradio

xming521 / WeClone

hiyouga / LLaMA-Factory

Infrasys-AI / AISystem

karpathy / nanochat

ModelEngine-Group / unified-cache-management

open-metadata / OpenMetadata

jinbooooom / ai-infra-hpc

vllm-project / semantic-router

NVIDIA / nccl-tests

nestybox / sysbox

moeru-ai / airi

huggingface / picotron

deepseek-ai / FlashMLA

deepseek-ai / DeepGEMM

agentgateway / agentgateway

NVIDIA / cutlass

mlcommons / inference

openai / codex

jd-opensource / OxyGent

MemTensor / MemOS

modelscope / evalscope

harry0703 / MoneyPrinterTurbo

llm-d / llm-d

mlabonne / llm-datasets

Zjh-819 / LLMDataHub

argilla-io / distilabel