Skip to content
View yuchenlichuck's full-sized avatar
😋
I am now research intern in Amazon Science
😋
I am now research intern in Amazon Science

Highlights

  • Pro

Organizations

@fossasia

Block or report yuchenlichuck

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[NeurIPS 2025] SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models.

10 Updated Sep 25, 2025

MotionStream: Real-Time Video Generation with Interactive Motion Controls

245 9 Updated Nov 4, 2025

Examples of my Claude Code infrastructure with skill auto-activation, hooks, and agents

Shell 4,985 649 Updated Oct 31, 2025

[ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"

Python 190 7 Updated Sep 26, 2024

FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation.

Python 217 7 Updated Nov 9, 2025

Your AI mate who chats on tinder and schedules dates for you.

Python 78 17 Updated Jan 8, 2025

A collection of MCP servers.

74,535 6,241 Updated Nov 4, 2025

Auto Swipe for Tinder/Bumble

JavaScript 12 2 Updated Nov 12, 2019

Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"

Python 161 8 Updated Oct 20, 2025

Collect some World Models for Autonomous Driving (and Robotic) papers.

1,514 62 Updated Nov 4, 2025

The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]

Python 164 5 Updated Jun 5, 2025

[ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining

Python 263 11 Updated Oct 18, 2025

NeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Python 576 35 Updated Oct 20, 2024

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Jupyter Notebook 468 38 Updated Jan 19, 2024

大模型基础: 一文了解大模型基础知识

6,144 517 Updated Feb 24, 2025

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 9,619 967 Updated Oct 10, 2025

ACL 2025: Synthetic data generation pipelines for text-rich images.

Python 145 23 Updated Mar 1, 2025

AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)

Python 51 1 Updated Mar 1, 2025

ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors

Python 274 9 Updated Feb 27, 2025

Official implementation of DepthLM

Python 247 9 Updated Oct 7, 2025

开源免费的 Wispr Flow 替代方案 | 集成FunASR本地模型和可配置大语言模型的下一代中文桌面语音工作流

JavaScript 1,672 154 Updated Oct 8, 2025

利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 47,620 6,654 Updated Jun 11, 2025

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 352 40 Updated Oct 28, 2025

人人都能用英语

TypeScript 31,858 4,533 Updated Apr 13, 2025

[ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segmentation

Python 21 1 Updated Apr 3, 2025

A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of-the-art methods, innovative applications, and key advanceme…

159 4 Updated Oct 29, 2025

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Python 974 88 Updated Nov 8, 2024

Enjoy the magic of Diffusion models!

Python 10,609 990 Updated Nov 7, 2025

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Python 1,154 70 Updated Jun 6, 2024
Next