Skip to content
View baeseongsu's full-sized avatar
🔥
Getting Things Done
🔥
Getting Things Done

Highlights

  • Pro

Block or report baeseongsu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Multi-Annotator Competence Estimation tool

Java 85 9 Updated Jan 19, 2026

🩺 MMMED is a benchmark dataset for evaluating Vision-Language Models (VLMs) on medical multiple-choice question answering (MCQA) tasks. 🏥💡 It features 194 real-world medical questions from Spanish …

Jupyter Notebook 4 Updated Jul 10, 2025

Visualizing the attention of vision-language models

Jupyter Notebook 277 22 Updated Feb 28, 2025

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,138 66 Updated Jul 15, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,568 491 Updated Jan 19, 2026

The Best Agent Harness. Meet Sisyphus: The Batteries-Included Agent that codes like you.

TypeScript 20,170 1,415 Updated Jan 20, 2026

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

Python 2,995 193 Updated Jan 14, 2026

The Ralph Wiggum Technique—the AI development methodology that reduces software costs to less than a fast food worker's wage.

HTML 765 80 Updated Jan 11, 2026

SkillsBench evaluates how well skills work and how effective agents are at using them

Python 240 168 Updated Jan 20, 2026

[NeurIPS 2025] Better Tokens for Better 3D: Advancing Vision-Language Modeling in 3D Medical Imaging

Python 25 2 Updated Nov 4, 2025

This is an example download script to download CT-RATE

Python 17 1 Updated Apr 5, 2024
Python 2 Updated Jan 8, 2026

Automated LLM evaluation suite for medical tasks

Python 22 30 Updated Jan 19, 2026

Merlin is a 3D VLM for computed tomography that leverages both structured electronic health records (EHR) and unstructured radiology reports for pretraining.

Python 183 19 Updated Oct 22, 2025

Tool for robust segmentation of >100 important anatomical structures in CT and MR images

Python 2,422 395 Updated Jan 8, 2026

VinDr-CXR: An open dataset of chest X-rays with radiologist’s annotations

Python 58 9 Updated Jan 4, 2021

A character-level language diffusion model trained on Tiny Shakespeare

Python 833 79 Updated Jan 16, 2026

MedRAX-2

Python 15 2 Updated Jan 2, 2026

MCP-Zero: Active Tool Discovery for Autonomous LLM Agents

Python 437 49 Updated Jul 2, 2025

Free-Text Promptable Universal 3D Medical Image Segmentation

Python 91 7 Updated Dec 27, 2025

Data2Evidence is an end-to-end solution for management and analysis of OMOP data - https://data2evidence.org

TypeScript 31 8 Updated Jan 20, 2026

SimSUM -- Simulated Benchmark with Structured and Unstructured Medical Records

Jupyter Notebook 4 Updated Jan 14, 2026

Repo for "Adaptation of Agentic AI"

569 44 Updated Dec 22, 2025
Python 6 1 Updated Jan 15, 2026
Python 7 Updated Dec 25, 2025
Python 158 17 Updated Dec 18, 2025

[Medical_NLP ➟ Awesome-AI4Med] medical-related LLMs, Multimodal systems, Datasets, Benchmarks, and more.

2,491 434 Updated Jan 14, 2026

First, Do NOHARM AI Benchmark Leaderboard

TypeScript 9 1 Updated Jan 10, 2026
Next