nzw0301

Kento Nozawa nzw0301

190 followers · 132 following

Preferred Networks, Inc.
Japan
14:36 (UTC +09:00)
nzw0301.github.io

Achievements

x4 x3 x3 x2

Achievements

x4 x3 x3 x2

Organizations

Lists (4)

Sort

Stars

PrimeIntellect-ai / prime-rl

Async RL Training at Scale

Python 755 129 Updated Nov 10, 2025

huggingface / kernels-community

Kernel sources for https://huggingface.co/kernels-community

C++ 31 10 Updated Nov 7, 2025

ServiceNow / PipelineRL

A scalable asynchronous reinforcement learning implementation with in-flight weight updates.

Python 294 28 Updated Nov 8, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,269 2,456 Updated Nov 10, 2025

QwenLM / AutoIF

Python 314 29 Updated Jul 25, 2024

digital-go-jp / lawqa_jp

251 7 Updated Oct 31, 2025

facebookresearch / RAM

A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).

Python 296 32 Updated Nov 8, 2025

deepseek-ai / DeepSeek-V3.2-Exp

Python 970 69 Updated Oct 2, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,231 2,285 Updated Nov 8, 2025

reka-ai / research-eval

A benchmark to evaluate search-augmented LLMs

Python 14 2 Updated Aug 28, 2025

NVIDIA-NeMo / Curator

Scalable data pre processing and curation toolkit for LLMs

Python 1,205 187 Updated Nov 7, 2025

ke1337 / IFBench

Forked from allenai/IFBench

Python 1 Updated Oct 15, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,193 554 Updated Nov 3, 2025

meta-llama / synthetic-data-kit

Tool for generating high quality Synthetic datasets

Python 1,372 192 Updated Oct 28, 2025

google-research / metricx

Python 115 19 Updated Dec 12, 2024

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,141 1,918 Updated Nov 1, 2025

character-ai / pipelining-sft

Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings

Python 91 17 Updated Jul 27, 2025

ByteDance-Seed / Seed-X-7B

Python 150 7 Updated Aug 18, 2025

janhq / jan

Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.

TypeScript 39,209 2,374 Updated Nov 10, 2025

allenai / IFBench

Python 87 11 Updated Oct 22, 2025

JohannesAck / OffPolicyCorrectedRewardModeling

Implementation for our COLM paper "Off-Policy Corrected Reward Modeling for RLHF"

Python 7 Updated Jul 23, 2025

actions / labeler

An action for automatically labelling pull requests

TypeScript 2,339 459 Updated Sep 30, 2025

SakanaAI / TransEvalnia

Reasoning-based Evaluation and Ranking of Translations.

Python 18 3 Updated Jul 18, 2025

ChenWu98 / algorithmic-creativity

[ICML 2025] Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction

Python 73 7 Updated May 26, 2025

gradio-app / trackio

A lightweight, local-first, and 🆓 experiment tracking library from Hugging Face 🤗

Python 1,068 66 Updated Nov 7, 2025

AI-Hypercomputer / maxtext

A simple, performant and scalable Jax LLM!

Python 1,976 420 Updated Nov 10, 2025

MoonshotAI / Kimi-Dev

open-source coding LLM for software engineering tasks

Python 1,029 118 Updated Sep 30, 2025

optuna / optuna-mcp

The Optuna MCP Server is a Model Context Protocol (MCP) server to interact with Optuna APIs.

Python 65 21 Updated Nov 10, 2025

haozheji / exact-optimization

ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment

Python 57 1 Updated Jun 16, 2024

pfnet / plamo-translate-cli

Python 296 14 Updated Oct 18, 2025

Kento Nozawa nzw0301

Organizations

Lists (4)

datasets

resources

self-sup

tools

Stars