-
Tsinghua University
- Beijing
- https://fi.ee.tsinghua.edu.cn/~gaochen
Starred repositories
Awesome paper list and repos of the paper "A comprehensive survey of embodied world models".
[Neurips’25] Code for the paper "Balanced Token Pruning: Accelerating Vision Language Models Beyond Local Optimization"
[ACM MM'25] Code for the paper "Open3D-VQA: A Benchmark for Embodied Spatial Reasoning with Multimodal Large Language Model in Open Space"
[ACM MM'25] Code for the paper 'AirScape: An Aerial Generative World Model with Motion Controllability'
PyTorch code and models for VJEPA2 self-supervised learning from video.
A list of works on video generation towards world model
Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.
A curated list of awesome curated lists of many topics.
A a curated list of curated lists of awesome lists.
A curated list of awesome lists of awesome lists.
[ACL'25 Oral] Code for the paper "UrbanVideo-Bench: Benchmarking Vision-Language Models on Embodied Intelligence with Video Data in Urban Spaces"
tsinghua-fib-lab / LLM4SBR
Forked from QEpiphany/LLM4SBRA Lightweight and Effective LLM-enhanced Framework for Session-based Recommendation
🔥[CVPR2025] EventGPT: Event Stream Understanding with Multimodal Large Language Models
An index for papers on large language model agents for recommendation and search.
AgentSociety: Large-scale Social Simulation to Understand Human Behaviors and Society through LLM-driven Agents
The official repository of "SmartAgent: Chain-of-User-Thought for Embodied Personalized Agent in Cyber World".
🔥 The free & Open Source DocuSign alternative
The official code repository of our RecSys 2023 paper.
Official implementation for "UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal Prediction" (KDD 2024)
The official implementation of the paper "AgentSquare: Automatic LLM Agent Search in Modular Design Space""
[NeurIPS'24] "Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration"