Skip to content
View KaiLv69's full-sized avatar

Block or report KaiLv69

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A python module to repair invalid JSON from LLMs

Python 3,946 154 Updated Nov 11, 2025

Official implementation for our paper: Repurposing AlphaFold3-like Protein Folding Models for Antibody Sequence and Structure Co-design

Python 5 1 Updated Oct 27, 2025

[NIPS 2025] Mixing Expert Knowledge: Bring Human Thoughts Back to The Game of Go. Our model is originally named InternThinker-Go, and called LoGos in our paper.

Python 4 1 Updated Oct 15, 2025

Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping" by Zhiheng Xi et al.

Python 78 4 Updated Oct 25, 2025

Official Implementation of FastMCTS: A Simple Sampling Strategy for Data Synthesis

Python 108 12 Updated Jul 2, 2025

Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.

Python 160 2 Updated Sep 23, 2025

Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning

Python 20 1 Updated Sep 22, 2025

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Python 44 5 Updated Jul 23, 2025

[ICLR2025] ReAttention, a training-free approach to break the maximum context length in length extrapolation

Python 13 Updated Oct 6, 2025

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech…

Python 1,024 88 Updated Nov 4, 2025

Scaling RL on advanced reasoning models

Python 630 39 Updated Oct 20, 2025
Python 9 1 Updated May 23, 2025
Python 322 24 Updated Aug 29, 2025

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,825 299 Updated Nov 12, 2025

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,628 72 Updated May 11, 2025

Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.

Python 163 21 Updated Nov 12, 2025

[ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length

Python 128 7 Updated Oct 29, 2025

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting

Python 17 3 Updated Mar 4, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,931 286 Updated May 15, 2025

a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation

60 1 Updated Mar 31, 2025

GPT-4o-level, real-time spoken dialogue system.

Python 360 28 Updated Jan 27, 2025

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 331 41 Updated Apr 22, 2025

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,942 151 Updated Nov 12, 2025

A collection of benchmarks and datasets for evaluating LLM.

527 32 Updated Jul 13, 2024

[NeurIPS 2024] Can Language Models Learn to Skip Steps?

Python 20 Updated Jan 25, 2025

[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"

Python 78 4 Updated Nov 25, 2024

O1 Replication Journey

2,003 63 Updated Jan 14, 2025

[Findings of EMNLP'2024] Unified Active Retrieval for Retrieval Augmented Generation

Python 23 Updated Sep 30, 2024
Next