Skip to content
View Kimokcheon's full-sized avatar

Highlights

  • Pro

Block or report Kimokcheon

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repository of InternSVG.

Python 78 1 Updated Oct 16, 2025

A list of VLMs tailored for medical RG and VQA; and a list of medical vision-language datasets

199 13 Updated Mar 19, 2025

A PDF comparison utility in Python.

Python 499 79 Updated Dec 6, 2024
Python 5 Updated Oct 7, 2025

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Python 522 47 Updated May 19, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 66 4 Updated Aug 8, 2025

A comprehensive collection of process reward models.

122 3 Updated Oct 4, 2025

R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization

Python 437 Updated Oct 21, 2025

Academic Survey Paper Generation.

TeX 936 88 Updated Jun 22, 2025
Python 30 1 Updated Jul 31, 2025

😼 优雅地使用基于 clash/mihomo 的代理环境

Shell 6,276 803 Updated Nov 27, 2025

支持GPT-4/Claude/Deepseek/Sakura等大语言模型的Galgame自动化翻译解决方案 Automated translation solution for visual novels supporting GPT-4/Claude/Deepseek/Sakura

Cython 1,863 123 Updated Oct 11, 2025

Developing VLMs for expert-level performance in specific medical specialties

Python 19 4 Updated Apr 25, 2025

[NeurIPS 2025] Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models

Python 51 3 Updated Sep 29, 2025

Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models

Python 93 11 Updated Jul 7, 2025

🌟100+ 原创 LLM / RL 原理图📚,《大模型算法》作者巨献!💥(100+ LLM/RL Algorithm Maps )

Python 1,909 207 Updated Nov 13, 2025

Devcontainer for using LaTeX in VS Code with auto-formatting and one-click arXiv export and link check.

Shell 29 3 Updated Oct 5, 2025

EH-Benchmark: Ophthalmic Hallucination Benchmark and Agent-Driven Top-Down Traceable Reasoning Workflow

Python 2 Updated Sep 29, 2025

Towards a Unified View of Large Language Model Post-Training

Python 187 11 Updated Sep 8, 2025

UrFound: Towards Universal Retinal Foundation Models via Knowledge-Guided Masked Modeling

Python 19 Updated Dec 6, 2024

PPTAgent: Generating and Evaluating Presentations Beyond Text-to-Slides [EMNLP 2025]

Python 2,304 273 Updated Nov 26, 2025

MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search tools.

Python 357 17 Updated Aug 26, 2025

Is the medical segmentation problem solved-Survey

17 2 Updated Aug 29, 2025

Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning"

Python 403 33 Updated Oct 14, 2025

用于备份飞书文档,可以将飞书文档转成markdown下载。

TypeScript 484 55 Updated Nov 25, 2025

Latest Advances on (RL based) Multimodal Reasoning and Generation in Multimodal Large Language Models

43 Updated Oct 30, 2025

🔥🔥First-ever hour scale video understanding models

Python 578 37 Updated Jul 14, 2025

Awesome List for Agentic RL

HTML 559 18 Updated Nov 27, 2025
Next