jerife

Follow

Jaewoo Park jerife

Follow

The first step for AI Research Engineer 🚀

29 followers · 35 following

Yonsei University
Seoul, Republic of Korea
https://jerife.org

Achievements

Achievements

Highlights

Pro

Organizations

Stars

showlab / Awesome-GUI-Agent

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

951 53 Updated Aug 17, 2025

seilk / VisAttnSink

[ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models

Python 65 8 Updated Feb 16, 2025

deepseek-ai / DeepSeek-VL

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 3,997 581 Updated Apr 24, 2024

manoja328 / tallyqacode

Official Code for "TallyQA: Answering Complex Counting Questions" published at AAAI 2018

Python 9 3 Updated Dec 2, 2021

manoja328 / TallyQA_dataset

TallyQA: Answering Complex Counting Questions dataset

27 1 Updated Feb 19, 2024

GuoleiSun / Indiscernible-Object-Counting

CVPR 2023

64 3 Updated Jan 15, 2025

bytedance / UI-TARS

Python 8,077 568 Updated Oct 30, 2025

jamespark3922 / SyntheticVG

Python 16 2 Updated Jun 12, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,017 2,407 Updated Nov 1, 2025

microsoft / rStar

Python 1,327 118 Updated Sep 12, 2025

lichengunc / refer

Referring Expression Datasets API

Jupyter Notebook 544 84 Updated Aug 27, 2024

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,292 531 Updated Oct 31, 2025

pull-ups / XR_train

Python 1 Updated Oct 31, 2025

pull-ups / XR_curation

Python 1 Updated Oct 31, 2025

yejinc00 / PREMIR

[EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"

Python 12 Updated Aug 26, 2025

jglovier / resume-template

📄💼🎩 A simple Jekyll + GitHub Pages powered resume template.

HTML 1,937 1,892 Updated Nov 27, 2024

GAIR-NLP / ReasonEval

[AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy

Python 74 4 Updated Oct 9, 2025

ddhelop / my-agents

✨ agents in use

84 14 Updated Aug 3, 2025

eth-nlped / mathdial

🧮 MathDial: A Dialog Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems, EMNLP Findings 2023

Python 67 5 Updated Sep 17, 2025

ekonwang / Awesome-Tool-Integrated-Reasoning

A technical report / research paper repository for tool integrated reasoning.

8 Updated Jun 20, 2025

microsoft / autogen

A programming framework for agentic AI

Python 51,326 7,820 Updated Oct 8, 2025

Yushi-Hu / VisualSketchpad

Codes for Visual Sketchpad: Sketching as a Visual Chain of Thought for Multimodal Language Models

Jupyter Notebook 264 15 Updated Aug 5, 2025

stevenshinechen / Interactive-Sketchpad

Python 19 3 Updated Mar 11, 2025

understanding-search / maze-dataset

maze datasets for investigating OOD behavior of ML systems

Jupyter Notebook 64 8 Updated Oct 20, 2025

hukkelas / deep_privacy2

DeepPrivacy2 - A Toolbox for Realistic Image Anonymization

Python 355 45 Updated Jan 28, 2024

hukkelas / DeepPrivacy

DeepPrivacy: A Generative Adversarial Network for Face Anonymization

Python 1,301 174 Updated Nov 19, 2023

zhaochen0110 / Awesome_Think_With_Images

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,076 37 Updated Oct 4, 2025

chengzu-li / MVoT

Imagine While Reasoning in Space: Multimodal Visualization-of-Thought (ICML 2025)

Python 58 3 Updated Apr 12, 2025

microsoft / visualization-of-thought

[NeurIPS 2024]Repos for "Visualization-of-Thought" dataset, construction code and evaluation.

Python 32 3 Updated Oct 23, 2024

ekonwang / VisuoThink

[Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics]: VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search

Python 30 3 Updated Jul 24, 2025