-
KAIST
- https://seongsubae.info
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Starred repositories
🩺 MMMED is a benchmark dataset for evaluating Vision-Language Models (VLMs) on medical multiple-choice question answering (MCQA) tasks. 🏥💡 It features 194 real-world medical questions from Spanish …
Visualizing the attention of vision-language models
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
The Best Agent Harness. Meet Sisyphus: The Batteries-Included Agent that codes like you.
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
The Ralph Wiggum Technique—the AI development methodology that reduces software costs to less than a fast food worker's wage.
SkillsBench evaluates how well skills work and how effective agents are at using them
[NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging
This is an example download script to download CT-RATE
Automated LLM evaluation suite for medical tasks
Merlin is a 3D VLM for computed tomography that leverages both structured electronic health records (EHR) and unstructured radiology reports for pretraining.
Tool for robust segmentation of >100 important anatomical structures in CT and MR images
VinDr-CXR: An open dataset of chest X-rays with radiologist’s annotations
A character-level language diffusion model trained on Tiny Shakespeare
MCP-Zero: Active Tool Discovery for Autonomous LLM Agents
Free-Text Promptable Universal 3D Medical Image Segmentation
Data2Evidence is an end-to-end solution for management and analysis of OMOP data - https://data2evidence.org
SimSUM -- Simulated Benchmark with Structured and Unstructured Medical Records
Repo for "Adaptation of Agentic AI"
[Medical_NLP ➟ Awesome-AI4Med] medical-related LLMs, Multimodal systems, Datasets, Benchmarks, and more.