eunkyeong stat-eklee

🏢

Working from Company

연구분야 : NLP, Graph(Recommandation), RL

14 followers · 11 following

Stars

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,795 281 Updated Aug 3, 2025

zhaoxin94 / awesome-domain-adaptation

A collection of AWESOME things about domain adaptation

5,369 884 Updated Sep 10, 2025

AI-in-Health / MedLLMsPracticalGuide

[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

1,707 144 Updated Sep 27, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,450 2,321 Updated Nov 27, 2025

GAIR-NLP / LIMO

[COLM 2025] LIMO: Less is More for Reasoning

Python 1,052 52 Updated Jul 30, 2025

mlabonne / llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

68,257 7,741 Updated Jun 4, 2025

deepseek-ai / DeepSeek-V3

Python 100,410 16,368 Updated Aug 28, 2025

daje0601 / Google_SCoRe

Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)

Jupyter Notebook 142 23 Updated Sep 21, 2024

StanfordMIMI / clin-summ

Clinical text summarization by adapting large language models

Python 150 32 Updated Jul 31, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 29,107 3,489 Updated Jan 26, 2025

opendilab / awesome-RLHF

A curated list of reinforcement learning with human feedback resources (continually updated)

4,216 249 Updated Sep 19, 2025

litian96 / FedProx

Federated Optimization in Heterogeneous Networks (MLSys '20)

Python 703 165 Updated Mar 24, 2023

davidkim205 / kollm_evaluation

자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가

Python 31 2 Updated May 31, 2024

EleutherAI / lm-evaluation-harness

A framework for few-shot evaluation of language models.

Python 10,772 2,879 Updated Nov 27, 2025

adap / flower

Flower: A Friendly Federated AI Framework

Python 6,450 1,102 Updated Nov 27, 2025

google-research-datasets / dices-dataset

This repository contains two datasets with multi-turn adversarial conversations generated by human agents interacting with a dialog model and rated for safety by two corresponding diverse rater pools.

29 5 Updated Jul 16, 2024