Skip to content
View nawnoes's full-sized avatar
😀
😀

Block or report nawnoes

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository for EXAONE 4.0 built by LG AI Research

93 6 Updated Aug 4, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,795 101 Updated Mar 18, 2025

Recipes to scale inference-time compute of open models

Python 1,124 131 Updated May 22, 2025

Official repository for EXAONE built by LG AI Research

181 13 Updated Aug 8, 2024

Official repository for EXAONE 3.5 built by LG AI Research

203 22 Updated Dec 16, 2024

official implementation of paper "Process Reward Model with Q-value Rankings"

Python 65 7 Updated Feb 5, 2025

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,787 2,217 Updated Mar 11, 2025

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

Python 5,597 714 Updated Jan 15, 2026

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

Python 114 11 Updated Jan 13, 2026

A flexible and efficient training framework for large-scale alignment tasks

Python 447 39 Updated Oct 23, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,878 372 Updated Dec 17, 2025

[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Python 184 16 Updated Jun 25, 2025
Python 287 21 Updated Jul 15, 2024

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 688 51 Updated Jan 20, 2025

Recipes to train reward model for RLHF.

Python 1,499 109 Updated Apr 24, 2025

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS 2025]

Jupyter Notebook 60 5 Updated Oct 11, 2024

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,838 68 Updated Jun 22, 2025
Python 4,288 466 Updated Jul 31, 2025

Schedule-Free Optimization in PyTorch

Python 2,252 72 Updated May 21, 2025

한국어 언어모델 다분야 사고력 벤치마크

Python 201 38 Updated Oct 17, 2024

terashuf shuffles multi-terabyte text files using limited memory

C++ 228 15 Updated Feb 5, 2023

Large Context Attention

Python 762 52 Updated Oct 13, 2025

Ring attention implementation with flash attention

Python 963 93 Updated Sep 10, 2025

Public Inflection Benchmarks

68 2 Updated Mar 6, 2024

Reward Model을 이용하여 언어모델의 답변을 평가하기

Python 29 2 Updated Feb 23, 2024

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models.   🤖💤

Python 1,088 55 Updated Feb 2, 2025

The official PyTorch implementation of Google's Gemma models

Python 5,594 573 Updated May 30, 2025

An open collection of implementation tips, tricks and resources for training large language models

Python 491 21 Updated Mar 8, 2023
Next