Skip to content
View foreverpiano's full-sized avatar
  • Tsinghua University
  • Beijing, China
  • 02:48 (UTC +08:00)

Highlights

  • Pro

Block or report foreverpiano

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The absolute trainer to light up AI agents.

Python 1,814 160 Updated Oct 24, 2025

A selective knowledge distillation algorithm for efficient speculative decoders

12 Updated Oct 24, 2025

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 259 21 Updated Oct 21, 2025

typora-0.11.18 (last free version)

176 48 Updated Feb 18, 2024

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 3,880 291 Updated Oct 24, 2025

一个让 AI 模型在真实市场中进行实盘交易与对抗的实验平台。目标是通过不断迭代,让智能体真正学会在不确定市场中生存和盈利。

Python 233 51 Updated Oct 20, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,091 144 Updated Oct 24, 2025

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 142 26 Updated Oct 24, 2025

Allow torch tensor memory to be released and resumed later

Python 156 24 Updated Oct 24, 2025
Python 36 3 Updated Oct 21, 2025

RLP: Reinforcement as a Pretraining Objective

192 13 Updated Oct 5, 2025
Python 247 24 Updated Jul 27, 2025

Post-training with Tinker

Python 1,104 78 Updated Oct 21, 2025

[EMNLP 2025 Demo] PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/MCP/Docker/Zotero

Python 29,245 2,585 Updated Oct 20, 2025

chat log tool, easily use your own chat data. 聊天记录工具,轻松使用自己的聊天数据

9,086 1,829 Updated Oct 20, 2025

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Python 726 71 Updated Oct 23, 2025

General technology for enabling AI capabilities w/ LLMs and MLLMs

Python 4,156 340 Updated Jun 30, 2025

Excalidraw-CN 是支持中文手写和多画布的 Excalidraw 白板工具。Excalidraw-CN is a whiteboard supporting Chinese hand draw font and multi-canvas based on Excalidraw.

TypeScript 2,242 280 Updated Jan 16, 2024

Generate interactive call graphs for various languages

TypeScript 1,240 57 Updated Aug 24, 2025
Go 63 1 Updated Sep 15, 2025

MCP for xiaohongshu.com

Go 6,312 904 Updated Oct 23, 2025

Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"

Python 122 7 Updated Apr 10, 2025

Minimalistic large language model 3D-parallelism training

Python 2,270 251 Updated Sep 3, 2025

Fast, Flexible and Portable Structured Generation

C++ 1,322 93 Updated Oct 20, 2025

Implementation of a methodology that allows all sorts of user defined GPU kernel fusion, for non CUDA programmers.

C++ 25 2 Updated Oct 18, 2025

Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (LLM).

Python 374 39 Updated Oct 24, 2025

Towards a Unified View of Large Language Model Post-Training

Python 167 8 Updated Sep 8, 2025

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 784 58 Updated Oct 20, 2025

Tile-based language built for AI computation across all scales

C++ 71 2 Updated Oct 24, 2025
Next