KaiLv69

Follow

Kai Lv KaiLv69

Follow

46 followers · 73 following

https://kailv69.github.io/

Achievements

Achievements

Lists (3)

Sort

LLM

Large language model

数据集

框架

Stars

MoonshotAI / Kimi-Linear

1,146 49 Updated Oct 31, 2025

mangiucugna / json_repair

A python module to repair invalid JSON from LLMs

Python 3,946 154 Updated Nov 11, 2025

yangnianzu0515 / MFDesign

Official implementation for our paper: Repurposing AlphaFold3-like Protein Folding Models for Antibody Sequence and Structure Co-design

Python 5 1 Updated Oct 27, 2025

Entarochuan / InternGo-LoGos

[NIPS 2025] Mixing Expert Knowledge: Bring Human Thoughts Back to The Game of Go. Our model is originally named InternThinker-Go, and called LoGos in our paper.

Python 4 1 Updated Oct 15, 2025

WooooDyy / BAPO

Codes for the paper "BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping" by Zhiheng Xi et al.

Python 78 4 Updated Oct 25, 2025

FlyingDutchman26 / FastMCTS

Official Implementation of FastMCTS: A Simple Sampling Strategy for Data Synthesis

Python 108 12 Updated Jul 2, 2025

InternLM / POLAR

Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.

Python 160 2 Updated Sep 23, 2025

OpenMOSS / Embodied-Planner-R1

Embodied-Planner-R1: Unleashing Embodied Task Planning Ability in LLMs via Reinforcement Learning

Python 20 1 Updated Sep 22, 2025

OpenMOSS / LongLLaDA

LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs

Python 44 5 Updated Jul 23, 2025

OpenMOSS / ReAttention

[ICLR2025] ReAttention, a training-free approach to break the maximum context length in length extrapolation

Python 13 Updated Oct 6, 2025

OpenMOSS / MOSS-TTSD

MOSS-TTSD is a spoken dialogue generation model that enables expressive dialogue speech synthesis in both Chinese and English, supporting zero-shot multi-speaker voice cloning, and long-form speech…

Python 1,024 88 Updated Nov 4, 2025

ChenxinAn-fdu / POLARIS

Scaling RL on advanced reasoning models

Python 630 39 Updated Oct 20, 2025

ayyyq / llm-retraction

Python 9 1 Updated May 23, 2025

InternLM / InternBootcamp

Python 322 24 Updated Aug 29, 2025

sustcsonglin / linear-attention-and-beyond-slides

TeX 95 2 Updated Feb 25, 2025

fla-org / flash-linear-attention

🚀 Efficient implementations of state-of-the-art linear attention models

Python 3,825 299 Updated Nov 12, 2025

BytedTsinghua-SIA / DAPO

An Open-source RL System from ByteDance Seed and Tsinghua AIR

Python 1,628 72 Updated May 11, 2025

OpenMOSS / Language-Model-SAEs

Performant framework for training, analyzing and visualizing Sparse Autoencoders (SAEs) and their frontier variants.

Python 163 21 Updated Nov 12, 2025

smart-lty / ParallelSpeculativeDecoding

[ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length

Python 128 7 Updated Oct 29, 2025

KaiLv69 / DuoDecoding

DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting

Python 17 3 Updated Mar 4, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,931 286 Updated May 15, 2025

OpenMOSS / Thus-Spake-Long-Context-LLM

a survey of long-context LLMs from four perspectives, architecture, infrastructure, training, and evaluation

60 1 Updated Mar 31, 2025

OpenMOSS / SpeechGPT-2.0-preview

GPT-4o-level, real-time spoken dialogue system.

Python 360 28 Updated Jan 27, 2025

hemingkx / Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Python 331 41 Updated Apr 22, 2025

mirage-project / mirage

Mirage Persistent Kernel: Compiling LLMs into a MegaKernel

C++ 1,942 151 Updated Nov 12, 2025

leobeeson / llm_benchmarks

A collection of benchmarks and datasets for evaluating LLM.

527 32 Updated Jul 13, 2024

tengxiaoliu / LM_skip

[NeurIPS 2024] Can Language Models Learn to Skip Steps?

Python 20 Updated Jan 25, 2025

HKUNLP / STRING

[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"

Python 78 4 Updated Nov 25, 2024

GAIR-NLP / O1-Journey

O1 Replication Journey

2,003 63 Updated Jan 14, 2025

xiami2019 / UAR

[Findings of EMNLP'2024] Unified Active Retrieval for Retrieval Augmented Generation

Python 23 Updated Sep 30, 2024