Skip to content
View cocoshe's full-sized avatar
💤
Sleeping
💤
Sleeping
  • Fujian, China
  • 12:48 (UTC +08:00)

Highlights

  • Pro

Block or report cocoshe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2023 Highlight & TPAMI] Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning

Python 122 5 Updated Dec 28, 2024

[ICLR 2025] TRACE: Temporal Grounding Video LLM via Casual Event Modeling

Python 135 3 Updated Aug 22, 2025

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 734 39 Updated Sep 19, 2025

The official GitHub page for the survey paper "Discrete Tokenization for Multimodal LLMs: A Comprehensive Survey". And this paper is under review.

69 1 Updated Aug 9, 2025

🌐 Permanent Hosting Site: http://ai-paper-finder.info/ 🌐 Hugging Face Hosting: https://huggingface.co/spaces/wenhanacademia/ai-paper-finder

Jupyter Notebook 143 8 Updated Nov 6, 2025
Python 21 3 Updated Apr 15, 2025

Summary of Spatio-Temporal Representation Learning Models.

77 6 Updated Jan 26, 2023

[Pytorch] The repo contains the code for "FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets"

Python 110 11 Updated Nov 6, 2025

[CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video

Python 200 11 Updated May 25, 2025

Rate-Adaptive Quantization: A Multi-Rate Codebook Adaptation for Vector Quantization-based Generative Models

Python 15 1 Updated Sep 10, 2025

Code for "FlashWorld: High-quality 3D Scene Generation within Seconds"

Python 510 35 Updated Oct 22, 2025

Minimal reproduction of OneRec

Python 316 33 Updated Nov 7, 2025

FB (Facebook) + GEMM (General Matrix-Matrix Multiplication) - https://code.fb.com/ml-applications/fbgemm/

C++ 1,471 676 Updated Nov 8, 2025
Python 47 3 Updated Oct 15, 2025

[ICCV 2023] - Composed Image Retrieval on Common Objects in context (CIRCO) dataset

Python 78 2 Updated Aug 6, 2025
Python 160 40 Updated Mar 7, 2022

[CVPR 2025] Official implementation for "Empowering LLMs to Understand and Generate Complex Vector Graphics" https://arxiv.org/abs/2412.11102

Python 584 13 Updated May 22, 2025

[EMNLP 2025] Awesome RAG Reasoning Resources

343 26 Updated Jul 24, 2025

WildSeg3D: Segment Any 3D Objects in the Wild from 2D Images

6 Updated Jul 17, 2025

Examples for Recommenders - easy to train and deploy on accelerated infrastructure.

Python 165 36 Updated Nov 5, 2025

[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"

Python 80 7 Updated Jul 4, 2024

Public repository for Skills

Python 15,956 1,386 Updated Oct 18, 2025

Contexts Optical Compression

Python 19,924 1,443 Updated Oct 25, 2025

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 556 51 Updated Oct 7, 2025

KDD2025, Generative Next POI Recommendation with Semantic ID

Jupyter Notebook 22 2 Updated Oct 11, 2025

[ICDE'24] Code of "Adapting Large Language Models by Integrating Collaborative Semantics for Recommendation."

Python 174 17 Updated Sep 9, 2024
Next