Skip to content
View nzw0301's full-sized avatar

Organizations

@apache @optuna

Block or report nzw0301

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Async RL Training at Scale

Python 755 129 Updated Nov 10, 2025

Kernel sources for https://huggingface.co/kernels-community

C++ 31 10 Updated Nov 7, 2025

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 294 28 Updated Nov 8, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,269 2,456 Updated Nov 10, 2025
Python 314 29 Updated Jul 25, 2024

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 296 32 Updated Nov 8, 2025

Train transformer language models with reinforcement learning.

Python 16,231 2,285 Updated Nov 8, 2025

A benchmark to evaluate search-augmented LLMs

Python 14 2 Updated Aug 28, 2025

Scalable data pre processing and curation toolkit for LLMs

Python 1,205 187 Updated Nov 7, 2025
Python 1 Updated Oct 15, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,193 554 Updated Nov 3, 2025

Tool for generating high quality Synthetic datasets

Python 1,372 192 Updated Oct 28, 2025
Python 115 19 Updated Dec 12, 2024

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,141 1,918 Updated Nov 1, 2025

Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings

Python 91 17 Updated Jul 27, 2025
Python 150 7 Updated Aug 18, 2025

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

TypeScript 39,209 2,374 Updated Nov 10, 2025
Python 87 11 Updated Oct 22, 2025

Implementation for our COLM paper "Off-Policy Corrected Reward Modeling for RLHF"

Python 7 Updated Jul 23, 2025

An action for automatically labelling pull requests

TypeScript 2,339 459 Updated Sep 30, 2025

Reasoning-based Evaluation and Ranking of Translations.

Python 18 3 Updated Jul 18, 2025

[ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

Python 73 7 Updated May 26, 2025

A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗

Python 1,068 66 Updated Nov 7, 2025

A simple, performant and scalable Jax LLM!

Python 1,976 420 Updated Nov 10, 2025

open-source coding LLM for software engineering tasks

Python 1,029 118 Updated Sep 30, 2025

The Optuna MCP Server is a Model Context Protocol (MCP) server to interact with Optuna APIs.

Python 65 21 Updated Nov 10, 2025

ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment

Python 57 1 Updated Jun 16, 2024
Python 296 14 Updated Oct 18, 2025
Next