Highlights
- Pro
-
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
-
slime Public
Forked from THUDM/slimeslime is a LLM post-training framework aiming at scaling RL.
Python Apache License 2.0 UpdatedNov 10, 2025 -
sentry-dart Public
Forked from getsentry/sentry-dartSentry SDK for Dart and Flutter
-
flutter_rust_bridge Public
Flutter/Dart <-> Rust binding generator, feature-rich, but seamless and simple.
-
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
-
flutter_convenient_test Public
Write and debug tests easily, with full action history, time travel, screenshots, rapid re-execution, video records, interactivity, isolation and more
-
torch_memory_saver Public
Allow torch tensor memory to be released and resumed later
-
DeepEP Public
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedOct 23, 2025 -
TransformerEngine Public
Forked from NVIDIA/TransformerEngineA library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory…
Python Apache License 2.0 UpdatedOct 20, 2025 -
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
MLIR MIT License UpdatedOct 20, 2025 -
SpecForge Public
Forked from sgl-project/SpecForgeTrain speculative decoding models effortlessly and port them smoothly to SGLang serving.
-
flutter_smooth Public
Achieve ~60 FPS, no matter how heavy the tree is to build/layout
-
LongBench Public
Forked from Fridge003/LongBenchLongBench v2 and LongBench (ACL 25'&24')
Python MIT License UpdatedSep 27, 2025 -
-
torch_utils Public
Utility scripts for PyTorch (e.g. Make Perfetto show some disappearing kernels, Memory profiler that understands more low-level allocations such as NCCL, ...)
-
flutter_portal Public
Evolved Overlay/OverlayEntry - declarative not imperative, intuitive-context, and easy-alignment
-
cutlass Public
Forked from NVIDIA/cutlassCUDA Templates for Linear Algebra Subroutines
C++ Other UpdatedAug 26, 2025 -
DeepGEMM Public
Forked from deepseek-ai/DeepGEMMDeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Cuda MIT License UpdatedAug 13, 2025 -
dart_interactive Public
REPL (interactive shell) for Dart, supporting 3rd party packages, hot reload, and full grammar
-
gpt-oss Public
Forked from openai/gpt-ossgpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Python Apache License 2.0 UpdatedAug 10, 2025 -
-
NeMo-Skills Public
Forked from NVIDIA-NeMo/SkillsA project to improve skills of large language models
Python Apache License 2.0 UpdatedJul 30, 2025 -
rl_visualizer Public
Visualize and post-hoc analyze RL training for debugging and understanding
-
kimina-lean-server Public template
Forked from project-numina/kimina-lean-serverKimina Lean server
Python MIT License UpdatedJul 19, 2025 -
verl Public
Forked from volcengine/verlveRL: Volcano Engine Reinforcement Learning for LLM
Python Apache License 2.0 UpdatedJul 13, 2025 -
indicatif-log-bridge Public
Forked from djugei/indicatif-log-bridgebridges the log crate and indicatif to stop the progress bars and log lines from mixing up
-
debug-print Public
Forked from flashinfer-ai/debug-printDebug print operator for cudagraph debugging
-
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and support state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorR…
C++ Apache License 2.0 UpdatedJun 13, 2025 -
Mooncake Public
Forked from kvcache-ai/MooncakeMooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++ Apache License 2.0 UpdatedJun 13, 2025