Sunkyoung

Follow

🌱

Growing

Sunkyoung Sunkyoung

🌱

Growing

Follow

Being a Frog Out of the Well 🐸 💭

36 followers · 34 following

KAIST
https://sunkyoung.github.io

Achievements

Achievements

Organizations

Stars

ConardLi / easy-dataset

A powerful tool for creating fine-tuning datasets for LLM

JavaScript 11,164 1,083 Updated Sep 29, 2025

AlignmentResearch / tuned-lens

Tools for understanding how transformer predictions are built layer-by-layer

Python 533 59 Updated Aug 7, 2025

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,160 4,760 Updated Jun 2, 2025

dsdanielpark / Bard-API

The unofficial python package that returns response of Google Bard through cookie value.

Python 5,240 510 Updated Apr 24, 2024

mosaicml / llm-foundry

LLM training code for Databricks foundation models

Python 4,342 577 Updated Oct 13, 2025

hollobit / GenAI_LLM_timeline

ChatGPT, GenerativeAI and LLMs Timeline

955 58 Updated May 19, 2024

Instruction-Tuning-with-GPT-4 / GPT-4-LLM

Instruction Tuning with GPT-4

HTML 4,332 306 Updated Jun 11, 2023

AGI-Edgerunners / LLM-Adapters

Code for our EMNLP 2023 Paper: "LLM-Adapters: An Adapter Family for Parameter-Efficient Fine-Tuning of Large Language Models"

Python 1,201 117 Updated Mar 10, 2024

Beomi / KoAlpaca

KoAlpaca: 한국어 명령어를 이해하는 오픈소스 언어모델 (KoAlpaca: An open-source language model to understand Korean instructions)

Jupyter Notebook 1,579 232 Updated Oct 25, 2024

SinclairCoder / Instruction-Tuning-Papers

Reading list of Instruction-tuning. A trend starts from Natrural-Instruction (ACL 2022), FLAN (ICLR 2022) and T0 (ICLR 2022).

770 24 Updated Jul 20, 2023

openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Python 17,144 2,816 Updated Dec 18, 2024

joeljang / ELM

[ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning

Python 99 7 Updated Apr 26, 2023

seonghyeonye / TAPP

[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following

Python 78 2 Updated Sep 13, 2024

bigscience-workshop / xmtf

Crosslingual Generalization through Multitask Finetuning

Jupyter Notebook 537 43 Updated Sep 22, 2024

deepset-ai / haystack

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

Python 23,063 2,436 Updated Oct 17, 2025

swstarlab-infolab / format_converter

Space-efficient graph data converter

C++ 13 3 Updated Nov 3, 2022

microsoft / unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,776 2,663 Updated Jul 3, 2025

EleutherAI / polyglot

Polyglot: Large Language Models of Well-balanced Competence in Multi-languages

482 42 Updated Aug 22, 2023

bigscience-workshop / lm-evaluation-harness

Forked from EleutherAI/lm-evaluation-harness

A framework for few-shot evaluation of autoregressive language models.

Python 104 29 Updated May 9, 2023

boychaboy / KOLD

KOLD: Korean Offensive Language Dataset

82 6 Updated Nov 13, 2022

seonghyeonye / RoSPr

[EMNLP 2023 Findings] Efficiently Enhancing Zero-Shot Performance of Instruction Following Model via Retrieval of Soft Prompt

Python 20 Updated Nov 2, 2023

thunlp / PromptPapers

Must-read papers on prompt-based tuning for pre-trained language models.

4,286 389 Updated Jul 17, 2023

ivy-llc / ivy

Convert Machine Learning Code Between Frameworks

Python 14,240 5,595 Updated Oct 17, 2025

huggingface / datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

Python 20,749 2,983 Updated Oct 17, 2025

microsoft / TUTA_table_understanding

TUTA and ForTaP for Structure-Aware and Numerical-Reasoning-Aware Table Pre-Training

Python 121 25 Updated Nov 19, 2024

microsoft / Table-Pretraining

ICLR 2022 Paper, SOTA Table Pre-training Model, TAPEX: Table Pre-training via Learning a Neural SQL Executor

Python 297 38 Updated Feb 6, 2023

krishnap25 / mauve

Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.

Python 298 27 Updated Jul 12, 2024

EleutherAI / the-pile

Python 1,608 143 Updated Apr 27, 2023

allenai / dont-stop-pretraining

Code associated with the Don't Stop Pretraining ACL 2020 paper

Python 536 73 Updated Nov 15, 2021

JohnGiorgi / DeCLUTR

The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!

Python 380 33 Updated Apr 21, 2023