Skip to content
View Luc4Gui's full-sized avatar
  • Beijing
  • 06:20 (UTC +08:00)

Block or report Luc4Gui

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models

Python 2,398 164 Updated Nov 28, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,784 452 Updated Nov 27, 2025

slime is an LLM post-training framework for RL Scaling.

Python 2,615 288 Updated Nov 28, 2025

Bridge Megatron-Core to Hugging Face/Reinforcement Learning

Python 164 34 Updated Nov 27, 2025

Tools for merging pretrained large language models.

Python 6,495 638 Updated Nov 27, 2025

Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR training

Python 44 3 Updated Aug 25, 2025
Python 842 45 Updated Sep 15, 2025

[NeurIPS 2025 Spotlight] Reasoning Environments for Reinforcement Learning with Verifiable Rewards

Python 1,239 102 Updated Nov 13, 2025

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 502 31 Updated Nov 26, 2025

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 6,362 693 Updated Nov 28, 2025

Pip compatible CodeBLEU metric implementation available for linux/macos/win

Python 126 26 Updated Mar 31, 2025

⚡️ Express inspired web framework written in Go

Go 38,571 1,913 Updated Nov 28, 2025

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Python 286 37 Updated Nov 25, 2025
Python 43 7 Updated Oct 28, 2025

My learning notes/codes for ML SYS.

Python 4,294 259 Updated Nov 25, 2025

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,281 187 Updated Nov 27, 2025

🙌 OpenHands: Code Less, Make More

Python 65,279 7,968 Updated Nov 28, 2025

DiffuCoder: Understanding and Improving Masked Diffusion Models for Code Generation

Python 771 47 Updated Jul 9, 2025

A blazingly fast JSON serializing & deserializing library

Go 8,876 423 Updated Nov 20, 2025
Python 771 67 Updated Jun 26, 2025

[NeurIPS 2025 Spotlight] Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning

Python 134 13 Updated Sep 19, 2025

A project to improve skills of large language models

Python 626 116 Updated Nov 28, 2025

InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥

Python 11,868 873 Updated Jul 18, 2024

A live stream development of RL tunning for LLM agents

Python 3,632 506 Updated Oct 8, 2025

Lightweight coding agent that runs in your terminal

Rust 51,427 6,511 Updated Nov 28, 2025
Python 315 16 Updated May 24, 2025

This repository contains the official implementation of Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Python 7 Updated Apr 1, 2025

An Open Large Reasoning Model for Real-World Solutions

Python 1,528 80 Updated May 30, 2025

A PyTorch native platform for training generative AI models

Python 4,768 615 Updated Nov 25, 2025

🔥 A minimal training framework for scaling FLA models

Python 311 48 Updated Nov 15, 2025
Next