Skip to content
View nzw0301's full-sized avatar

Organizations

@apache @optuna

Block or report nzw0301

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Modular, scalable library to train ML models

Python 191 19 Updated Jan 16, 2026

Accelerating MoE with IO and Tile-aware Optimizations

Python 551 44 Updated Jan 19, 2026

MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI training and inference, such as FP8 row-wise quantization and …

Python 35 16 Updated Jan 20, 2026

[Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Python 174 18 Updated Jan 12, 2026

Async RL Training at Scale

Python 1,015 175 Updated Jan 20, 2026

Kernel sources for https://huggingface.co/kernels-community

C++ 46 15 Updated Jan 19, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 347 34 Updated Jan 20, 2026

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,536 3,067 Updated Jan 20, 2026
Python 321 31 Updated Jul 25, 2024

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 342 41 Updated Dec 16, 2025

Train transformer language models with reinforcement learning.

Python 17,064 2,431 Updated Jan 20, 2026

A benchmark to evaluate search-augmented LLMs

Python 17 2 Updated Aug 28, 2025

Scalable data pre processing and curation toolkit for LLMs

Python 1,354 211 Updated Jan 19, 2026
Python 1 Updated Oct 15, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,354 701 Updated Nov 20, 2025

Tool for generating high quality Synthetic datasets

Python 1,471 206 Updated Oct 28, 2025
Python 127 22 Updated Dec 12, 2024

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,625 2,019 Updated Jan 13, 2026

Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings

Python 114 18 Updated Jul 27, 2025
Python 161 7 Updated Aug 18, 2025

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

TypeScript 40,116 2,491 Updated Jan 20, 2026
Python 98 18 Updated Jan 4, 2026

Implementation for our COLM paper "Off-Policy Corrected Reward Modeling for RLHF"

Python 7 Updated Jul 23, 2025

An action for automatically labelling pull requests

TypeScript 2,369 472 Updated Jan 13, 2026

Reasoning-based Evaluation and Ranking of Translations.

Python 18 4 Updated Jul 18, 2025

[ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

Python 82 7 Updated May 26, 2025

A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗

Python 1,223 94 Updated Jan 13, 2026

A simple, performant and scalable Jax LLM!

Python 2,100 452 Updated Jan 20, 2026
Next