nawnoes

😀

Seonghwan Kim nawnoes

😀

72 followers · 19 following

Seoul, Korea

Achievements

Lists (9)

Sort

Stars

LG-AI-EXAONE / EXAONE-4.0

Official repository for EXAONE 4.0 built by LG AI Research

93 6 Updated Aug 4, 2025

PRIME-RL / PRIME

Scalable RL solution for advanced reasoning of language models

Python 1,795 101 Updated Mar 18, 2025

huggingface / search-and-learn

Recipes to scale inference-time compute of open models

Python 1,124 131 Updated May 22, 2025

LG-AI-EXAONE / EXAONE-3.0

Official repository for EXAONE built by LG AI Research

181 13 Updated Aug 8, 2024

LG-AI-EXAONE / EXAONE-3.5

Official repository for EXAONE 3.5 built by LG AI Research

203 22 Updated Dec 16, 2024

WindyLee0822 / Process_Q_Model

official implementation of paper "Process Reward Model with Q-value Rankings"

Python 65 7 Updated Feb 5, 2025

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,787 2,217 Updated Mar 11, 2025

kyegomez / swarms

The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

Python 5,597 714 Updated Jan 15, 2026

kyegomez / Lets-Verify-Step-by-Step

"Improving Mathematical Reasoning with Process Supervision" by OPENAI

Python 114 11 Updated Jan 13, 2026

alibaba / ChatLearn

A flexible and efficient training framework for large-scale alignment tasks

Python 447 39 Updated Oct 23, 2025

hijkzzz / Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,878 372 Updated Dec 17, 2025

tianyi-lab / Superfiltering

[ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

Python 184 16 Updated Jun 25, 2025

deepseek-ai / DeepSeek-Prover-V1.5

Python 553 231 Updated Aug 16, 2024

google-deepmind / nanodo

Python 287 21 Updated Jul 15, 2024

THUDM / ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 688 51 Updated Jan 20, 2025

RLHFlow / RLHF-Reward-Modeling

Recipes to train reward model for RLHF.

Python 1,499 109 Updated Apr 24, 2025

siyan-zhao / prepacking

The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS 2025]

Jupyter Notebook 60 5 Updated Oct 11, 2024

google-deepmind / penzai

A JAX research toolkit for building, editing, and visualizing neural networks.

Python 1,838 68 Updated Jun 22, 2025

openai / simple-evals

Python 4,288 466 Updated Jul 31, 2025

facebookresearch / schedule_free

Schedule-Free Optimization in PyTorch

Python 2,252 72 Updated May 21, 2025

instructkr / LogicKor

한국어 언어모델 다분야 사고력 벤치마크

Python 201 38 Updated Oct 17, 2024

alexandres / terashuf

terashuf shuffles multi-terabyte text files using limited memory

C++ 228 15 Updated Feb 5, 2023

haoliuhl / ringattention

Large Context Attention

Python 762 52 Updated Oct 13, 2025

zhuzilin / ring-flash-attention

Ring attention implementation with flash attention

Python 963 93 Updated Sep 10, 2025

InflectionAI / Inflection-Benchmarks

Public Inflection Benchmarks

68 2 Updated Mar 6, 2024

openai / transformer-debugger

Python 4,111 238 Updated Jun 4, 2024

HeegyuKim / ko-rm-judge

Reward Model을 이용하여 언어모델의 답변을 평가하기

Python 29 2 Updated Feb 23, 2024

datadreamer-dev / DataDreamer

DataDreamer: Prompt. Generate Synthetic Data. Train & Align Models. 🤖💤

Python 1,088 55 Updated Feb 2, 2025

google / gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Python 5,594 573 Updated May 30, 2025

huggingface / large_language_model_training_playbook

An open collection of implementation tips, tricks and resources for training large language models

Python 491 21 Updated Mar 8, 2023

Seonghwan Kim nawnoes

Lists (9)

🏃‍♂️ Crawl

Dataset

📔 Knowledge

📔 NLP

👣 Preprocess

🐎 Reinforcement&Meta Learning

🕵️‍♀️ Retrieval

🛠 Tool

💄 Transformers

Stars