eric-xw

💭

I may be slow to respond.

Xin (Eric) Wang eric-xw

💭

I may be slow to respond.

Researcher in natural language processing, computer vision, and machine learning.

229 followers · 23 following

University of California, Santa Barbara

Achievements

Highlights

Organizations

Stars

eric-ai-lab / DMLR

Official codebase for the paper "Reasoning Within the Mind: Dynamic Multimodal Interleaving in Latent Space"

Python 40 Updated Dec 17, 2025

alvr-workshop / alvr-workshop.github.io

CSS 1 Updated Aug 15, 2024

eric-ai-lab / EvoPresent

Official codebase for the paper "Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic Presentations"

Python 330 22 Updated Oct 14, 2025

QwenLM / Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 3,228 202 Updated Jan 8, 2026

zai-org / GLM-4.5

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 3,738 387 Updated Dec 23, 2025

microsoft / GUI-Actor

[NeurIPS'25] GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

Python 370 46 Updated Oct 29, 2025

MLRM-Halu / MLRM-Halu

[NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models

Python 73 3 Updated May 31, 2025

eric-ai-lab / SafeKey

[EMNLP 2025] Official code for the paper "SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning"

Python 14 1 Updated Jun 30, 2025

eric-ai-lab / GRIT

Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"

Python 172 10 Updated Jan 8, 2026

eric-ai-lab / Soft-Thinking

Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"

Python 296 37 Updated Dec 12, 2025

simular-ai / Agent-S

Agent S: an open agentic framework that uses computers like a human

Python 9,392 1,076 Updated Dec 16, 2025

mem0ai / mem0

Universal memory layer for AI Agents

Python 45,313 4,945 Updated Jan 10, 2026

eric-ai-lab / EditRoom

[ICLR 2025] EditRoom: LLM-parameterized Graph Diffusion for Composable 3D Room Layout Editing

Python 20 4 Updated Apr 1, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

36,128 1,966 Updated Aug 1, 2024

eric-ai-lab / MMIR

[ACL 2025 Findings] "Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models"

Python 13 Updated Feb 25, 2025

eric-ai-lab / Mojito

Official repo for the paper "Mojito: Motion Trajectory and Intensity Control for Video Generation""

Python 5 1 Updated Jun 11, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,676 1,502 Updated Jan 4, 2026

facebookresearch / large_concept_model

Large Concept Models: Language modeling in a sentence representation space

Python 2,323 206 Updated Jan 29, 2025

NVIDIA / Cosmos

New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos

8,074 524 Updated Jan 6, 2026

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 24,180 2,076 Updated Sep 12, 2025

openai / swarm

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 20,772 2,218 Updated Mar 11, 2025

eric-ai-lab / MSSBench

[ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"

Python 30 2 Updated Jun 23, 2025

GengzeZhou / NavGPT-2

[ECCV 2024] Official implementation of NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models

Python 232 13 Updated Sep 20, 2024

eric-ai-lab / ViCor

This is the implementation of ACL 2024 Findings paper ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models

4 Updated Jun 11, 2024

InternLM / InternLM-XComposer

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,906 177 Updated May 26, 2025

eric-ai-lab / Screen-Point-and-Read

Code repo for "Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding"

Python 28 3 Updated Jul 31, 2024

eric-ai-lab / via-video

26 Updated Jun 20, 2024

eric-ai-lab / MMWorld

Official repo of the ICLR 2025 paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"

Python 29 1 Updated Jul 15, 2025

eric-ai-lab / ProbMed

[ACL 2025 Findings] "Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA"

Python 25 1 Updated Feb 21, 2025

letta-ai / letta

Letta is the platform for building stateful agents: open AI with advanced memory that can learn and self-improve over time.

Python 20,588 2,145 Updated Jan 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xin (Eric) Wang eric-xw

Achievements

Achievements

Highlights

Organizations

Block or report eric-xw

Stars

eric-ai-lab / DMLR

alvr-workshop / alvr-workshop.github.io

eric-ai-lab / EvoPresent

QwenLM / Qwen3-Omni

zai-org / GLM-4.5

microsoft / GUI-Actor

MLRM-Halu / MLRM-Halu

eric-ai-lab / SafeKey

eric-ai-lab / GRIT

eric-ai-lab / Soft-Thinking

simular-ai / Agent-S

mem0ai / mem0

eric-ai-lab / EditRoom

karpathy / LLM101n

eric-ai-lab / MMIR

eric-ai-lab / Mojito

QwenLM / Qwen3-VL

facebookresearch / large_concept_model

NVIDIA / Cosmos

microsoft / OmniParser

openai / swarm

eric-ai-lab / MSSBench

GengzeZhou / NavGPT-2

eric-ai-lab / ViCor

InternLM / InternLM-XComposer

eric-ai-lab / Screen-Point-and-Read

eric-ai-lab / via-video

eric-ai-lab / MMWorld

eric-ai-lab / ProbMed

letta-ai / letta