Skip to content
View nzw0301's full-sized avatar

Organizations

@apache @optuna

Block or report nzw0301

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Accelerating MoE with IO and Tile-aware Optimizations

Python 532 39 Updated Jan 5, 2026

MSLK (Meta Superintelligence Labs Kernels) is a collection of PyTorch GPU operator libraries that are designed and optimized for GenAI training and inference, such as FP8 row-wise quantization and …

Python 22 10 Updated Jan 10, 2026

[Preprint] RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Python 165 17 Updated Jan 7, 2026

Async RL Training at Scale

Python 986 173 Updated Jan 10, 2026

Kernel sources for https://huggingface.co/kernels-community

C++ 42 15 Updated Jan 8, 2026

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 345 33 Updated Dec 23, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,197 2,995 Updated Jan 9, 2026
Python 320 31 Updated Jul 25, 2024

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 341 41 Updated Dec 16, 2025

Train transformer language models with reinforcement learning.

Python 16,918 2,411 Updated Jan 9, 2026

A benchmark to evaluate search-augmented LLMs

Python 16 2 Updated Aug 28, 2025

Scalable data pre processing and curation toolkit for LLMs

Python 1,338 205 Updated Jan 10, 2026
Python 1 Updated Oct 15, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,236 684 Updated Nov 20, 2025

Tool for generating high quality Synthetic datasets

Python 1,455 204 Updated Oct 28, 2025
Python 119 22 Updated Dec 12, 2024

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,572 2,014 Updated Nov 1, 2025

Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings

Python 112 18 Updated Jul 27, 2025
Python 160 7 Updated Aug 18, 2025

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

TypeScript 40,013 2,472 Updated Jan 10, 2026
Python 97 18 Updated Jan 4, 2026

Implementation for our COLM paper "Off-Policy Corrected Reward Modeling for RLHF"

Python 7 Updated Jul 23, 2025

An action for automatically labelling pull requests

TypeScript 2,366 471 Updated Nov 17, 2025

Reasoning-based Evaluation and Ranking of Translations.

Python 18 4 Updated Jul 18, 2025

[ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

Python 82 7 Updated May 26, 2025

A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗

Python 1,204 90 Updated Jan 9, 2026

A simple, performant and scalable Jax LLM!

Python 2,080 447 Updated Jan 10, 2026

open-source coding LLM for software engineering tasks

Python 1,088 130 Updated Sep 30, 2025
Next