Skip to content
View muma378's full-sized avatar
  • Shanghai
  • 14:55 (UTC +08:00)

Block or report muma378

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23

TypeScript 18,201 1,753 Updated Nov 12, 2025

Benchmark and optimize LLM inference across frameworks with ease

Python 130 13 Updated Sep 12, 2025

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / veRL/ Swift / Ultra…

Python 3,070 162 Updated Nov 9, 2025

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Python 40,490 3,127 Updated Nov 12, 2025

🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …

Python 15,730 1,249 Updated Nov 3, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,287 7,537 Updated Nov 12, 2025

AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术

Jupyter Notebook 15,576 2,236 Updated Sep 3, 2025

The best ChatGPT that $100 can buy.

Python 36,439 4,367 Updated Nov 5, 2025

Persist and reuse KV Cache to speedup your LLM.

Python 118 38 Updated Nov 12, 2025

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 7,947 1,515 Updated Nov 12, 2025

hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等

Cuda 60 11 Updated Oct 9, 2025

Intelligent Router for Mixture-of-Models

Rust 2,226 288 Updated Nov 12, 2025

NCCL Tests

Cuda 1,330 329 Updated Nov 3, 2025

An open-source, next-generation "runc" that empowers rootless containers to run workloads such as Systemd, Docker, Kubernetes, just like VMs.

Shell 3,299 189 Updated Nov 3, 2025

💖🧸 Self hosted, you owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…

Vue 15,580 1,405 Updated Nov 8, 2025

Minimalistic 4D-parallelism distributed training framework for education purpose

Python 1,891 145 Updated Aug 26, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,858 899 Updated Sep 30, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,876 739 Updated Oct 15, 2025

Next Generation Agentic Proxy for AI Agents and MCP servers

Rust 1,280 189 Updated Nov 8, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,763 1,523 Updated Nov 10, 2025

Reference implementations of MLPerf® inference benchmarks

Python 1,483 589 Updated Nov 11, 2025

Lightweight coding agent that runs in your terminal

Rust 50,291 6,253 Updated Nov 12, 2025

Multi-agent collaboration framework

Python 1,668 238 Updated Nov 11, 2025

Build memory-native AI agents with Memory OS — an open-source framework for long-term memory, retrieval, and adaptive learning in large language models. Agent Memory | Memory System | Memory Manage…

Python 2,959 258 Updated Nov 11, 2025

A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.

Python 1,927 220 Updated Nov 11, 2025

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 47,683 6,666 Updated Jun 11, 2025

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,025 229 Updated Nov 11, 2025

Curated list of datasets and tools for post-training.

3,894 321 Updated Nov 10, 2025

A quick guide (especially) for trending instruction finetuning datasets

3,304 222 Updated Nov 28, 2023

Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.

Python 2,927 221 Updated Nov 10, 2025
Next