-
University of Minnesota, Twin Cities
- USA
- https://dykang.github.io/
- @dongyeopkang
- https://minnesotanlp.github.io/
- https://github.com/minnesotanlp
Highlights
- Pro
Stars
Our library for RL environments + evals
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Minimal reproduction of DeepSeek R1-Zero
Karin de Langis's EMNLP 2024 paper on "Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation"
The full dataset behind paperswithcode.com
Parkar and Kim et al.'s paper on :SelectLLM: Can LLMs Select Important Instructions to Annotate?"
Code repository for Kim et al's ACL 2024 paper: "Threads of Subtlety: Detecting Machine-Generated Texts Through Discourse Motifs"
Code for Das et al.'s paper "Which Modality should I use -- Text, Motif, or Image? : Understanding Graphs with Large Language Models"
Survey of LLM generated data
Code and dataset for de Langis et al's paper "A Comparative Study on Textual Saliency of Styles from Eye Tracking, Annotations, and Language Models"
Code for Hayati et al's paper "StyLEx: Explaining Styles with Lexicon-Based Human Perception"
The fastest pure-Python PEG parser I can muster
code and data for Hayati et al's paper on "How Far Can We Extract Diverse Perspectives from Large Language Models? Criteria-Based Diversity Prompting!"
Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"
Simple and easy to use widget with your GitHub profile — No dependencies
Jaehyung Kim et al's ICML23 paper "Prefer to Classify: Improving Text Classifiers via Auxiliary Preference Learning"
Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-information"
Official implementation of the paper "CoEdIT: Text Editing by Task-Specific Instruction Tuning" (EMNLP 2023)
Code for Koo and Martin et al's in2writing paper on "Decoding the End-to-end Writing Trajectory in Scholarly Manuscripts"
Codebase, data and models for the SummaC paper in TACL
ACL 2021 paper "Style is NOT a single variable: Case Studies for Cross-Style Language Understanding " by Dongyeop Kang and Eduard Hovy
Official implementation of Wan et al's paper "Everyone's Voice Matters: Quantifying Annotation Disagreement Using Demographic Information" (AAAI 2023)
Official implementation of the paper "IteraTeR: Understanding Iterative Revision from Human-Written Text" (ACL 2022)
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.