baeseongsu

🔥

Getting Things Done

baeseongsu baeseongsu

🔥

Getting Things Done

PhD Student @ KAIST AI | Ex-intern @microsoft (MSRA) | getting things done but NEVER losing the infimum in the rigor of research 🌕

157 followers · 527 following

KAIST
https://seongsubae.info

Achievements

Highlights

Lists (3)

Sort

Starred repositories

dirkhovy / MACE

Multi-Annotator Competence Estimation tool

Java 85 9 Updated Jan 19, 2026

PRAISELab-PicusLab / MMMED

🩺 MMMED is a benchmark dataset for evaluating Vision-Language Models (VLMs) on medical multiple-choice question answering (MCQA) tasks. 🏥💡 It features 194 real-world medical questions from Spanish …

Jupyter Notebook 4 Updated Jul 10, 2025

zjysteven / VLM-Visualizer

Visualizing the attention of vision-language models

Jupyter Notebook 277 22 Updated Feb 28, 2025

MoonshotAI / Kimi-VL

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,138 66 Updated Jul 15, 2025

NVIDIA-Medtech / NV-Segment-CTMR

Python 29 4 Updated Jan 18, 2026

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,568 491 Updated Jan 19, 2026

code-yeongyu / oh-my-opencode

The Best Agent Harness. Meet Sisyphus: The Batteries-Included Agent that codes like you.

TypeScript 20,170 1,415 Updated Jan 20, 2026

deepseek-ai / Engram

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 2,995 193 Updated Jan 14, 2026

ghuntley / how-to-ralph-wiggum

Forked from ClaytonFarr/ralph-playbook

The Ralph Wiggum Technique—the AI development methodology that reduces software costs to less than a fast food worker's wage.

HTML 765 80 Updated Jan 11, 2026

benchflow-ai / skillsbench

SkillsBench evaluates how well skills work and how effective agents are at using them

Python 240 168 Updated Jan 20, 2026

ibrahimethemhamamci / BTB3D

[NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging

Python 25 2 Updated Nov 4, 2025

sezginerr / example_download_script

This is an example download script to download CT-RATE

Python 17 1 Updated Apr 5, 2024

yuhui-zh15 / RadDiff

Python 2 Updated Jan 8, 2026

MedARC-AI / med-lm-envs

Automated LLM evaluation suite for medical tasks

Python 22 30 Updated Jan 19, 2026

StanfordMIMI / Merlin

Merlin is a 3D VLM for computed tomography that leverages both structured electronic health records (EHR) and unstructured radiology reports for pretraining.

Python 183 19 Updated Oct 22, 2025