Skip to content
View stat-eklee's full-sized avatar
🏢
Working from Company
🏢
Working from Company

Block or report stat-eklee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 4,192 453 Updated Jul 31, 2025

Simple RL training for reasoning

Python 3,795 281 Updated Aug 3, 2025

A collection of AWESOME things about domain adaptation

5,369 884 Updated Sep 10, 2025

[Nature Reviews Bioengineering🔥] Application of Large Language Models in Medicine. A curated list of practical guide resources of Medical LLMs (Medical LLMs Tree, Tables, and Papers)

1,707 144 Updated Sep 27, 2025

Train transformer language models with reinforcement learning.

Python 16,450 2,321 Updated Nov 27, 2025

[COLM 2025] LIMO: Less is More for Reasoning

Python 1,052 52 Updated Jul 30, 2025

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

68,257 7,741 Updated Jun 4, 2025
Python 6 1 Updated Dec 12, 2024

Paper Reproduction Google SCoRE(Training Language Models to Self-Correct via Reinforcement Learning)

Jupyter Notebook 142 23 Updated Sep 21, 2024

Clinical text summarization by adapting large language models

Python 150 32 Updated Jul 31, 2024

The official Meta Llama 3 GitHub site

Python 29,107 3,489 Updated Jan 26, 2025

A curated list of reinforcement learning with human feedback resources (continually updated)

4,216 249 Updated Sep 19, 2025

Federated Optimization in Heterogeneous Networks (MLSys '20)

Python 703 165 Updated Mar 24, 2023

자체 구축한 한국어 평가 데이터셋을 이용한 한국어 모델 평가

Python 31 2 Updated May 31, 2024

A framework for few-shot evaluation of language models.

Python 10,772 2,879 Updated Nov 27, 2025

Flower: A Friendly Federated AI Framework

Python 6,450 1,102 Updated Nov 27, 2025

This repository contains two datasets with multi-turn adversarial conversations generated by human agents interacting with a dialog model and rated for safety by two corresponding diverse rater pools.

29 5 Updated Jul 16, 2024

Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI

Python 1,406 69 Updated Apr 11, 2024
Python 235 23 Updated Jun 11, 2024

[NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.

Jupyter Notebook 23 2 Updated Jul 26, 2023

🐙 OctoPack: Instruction Tuning Code Large Language Models

Jupyter Notebook 474 27 Updated Feb 5, 2025

✏️ 기술 면접 스터디 Cheat Sheet

226 18 Updated Oct 25, 2025

☁️ 구름(KULLM): 고려대학교에서 개발한, 한국어에 특화된 LLM

594 72 Updated May 1, 2024

Transformer related optimization, including BERT, GPT

C++ 6,354 923 Updated Mar 27, 2024

Korean BART

Python 466 95 Updated Jun 14, 2025

Code for T-Few from "Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning"

Python 456 62 Updated Sep 6, 2023

Assignments for CS294-112.

Python 1,638 1,042 Updated Mar 24, 2023

For experiments involving instruct gpt. Currently used for documenting open research questions.

71 4 Updated Nov 8, 2022
Next