- Shanghai
-
14:55
(UTC +08:00)
Stars
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with OpenTelemetry, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Benchmark and optimize LLM inference across frameworks with ease
⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / LLaMA Factory / veRL/ Swift / Ultra…
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
🚀 One-stop solution for creating your digital avatar from chat history 💡 Fine-tune LLMs with your chat logs to capture your unique style, then bind to a chatbot to bring your digital self to life. …
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Persist and reuse KV Cache to speedup your LLM.
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
hpc 教程,包含集合通信(mpi、nccl)、cuda 编程、向量化 SIMD、RDMA 通信等
Intelligent Router for Mixture-of-Models
An open-source, next-generation "runc" that empowers rootless containers to run workloads such as Systemd, Docker, Kubernetes, just like VMs.
💖🧸 Self hosted, you owned Grok Companion, a container of souls of waifu, cyber livings to bring them into our worlds, wishing to achieve Neuro-sama's altitude. Capable of realtime voice chat, Minec…
Minimalistic 4D-parallelism distributed training framework for education purpose
FlashMLA: Efficient Multi-head Latent Attention Kernels
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Next Generation Agentic Proxy for AI Agents and MCP servers
CUDA Templates and Python DSLs for High-Performance Linear Algebra
Reference implementations of MLPerf® inference benchmarks
Lightweight coding agent that runs in your terminal
Build memory-native AI agents with Memory OS — an open-source framework for long-term memory, retrieval, and adaptive learning in large language models. Agent Memory | Memory System | Memory Manage…
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Achieve state of the art inference performance with modern accelerators on Kubernetes
Curated list of datasets and tools for post-training.
A quick guide (especially) for trending instruction finetuning datasets
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.