yuchenlichuck

Follow

😋

I am now research intern in Amazon Science

Yuchen Li yuchenlichuck

😋

I am now research intern in Amazon Science

Follow

ALIBABA SUMMER OF CODE @ APACHE ROCKETMQ

90 followers · 96 following

mbzuai
abu dhabi
https://liyc.pw
https://www.xiaohongshu.com/user/profile/5b788ddd2cc31d0001487a94

Achievements

Achievements

Highlights

Pro

Organizations

Lists (1)

Sort

🔮 Future ideas

Starred repositories

sunye23 / SAMA

[NeurIPS 2025] SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models.

10 Updated Sep 25, 2025

alex4727 / MotionStream

MotionStream: Real-Time Video Generation with Interactive Motion Controls

245 9 Updated Nov 4, 2025

diet103 / claude-code-infrastructure-showcase

Examples of my Claude Code infrastructure with skill auto-activation, hooks, and agents

Shell 4,985 649 Updated Oct 31, 2025

wayveai / LingoQA

[ECCV 2024] Official GitHub repository for the paper "LingoQA: Visual Question Answering for Autonomous Driving"

Python 190 7 Updated Sep 26, 2024

Bria-AI / FIBO

FIBO is a SOTA, first open-source, JSON-native text-to-image model built for controllable, predictable, and legally safe image generation.

Python 217 7 Updated Nov 9, 2025

Grigorij-Dudnik / TinderGPT

Your AI mate who chats on tinder and schedules dates for you.

Python 78 17 Updated Jan 8, 2025

punkpeye / awesome-mcp-servers

A collection of MCP servers.

74,535 6,241 Updated Nov 4, 2025

mmuyskens / autoswipe

Auto Swipe for Tinder/Bumble

JavaScript 12 2 Updated Nov 12, 2019

eric-ai-lab / GRIT

Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"

Python 161 8 Updated Oct 20, 2025

LMD0311 / Awesome-World-Model

Collect some World Models for Autonomous Driving (and Robotic) papers.

1,514 62 Updated Nov 4, 2025

TIGER-AI-Lab / VL-Rethinker

The official code of "VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning" [NeurIPS25]

Python 164 5 Updated Jun 5, 2025

unique1i / SceneSplat

[ICCV 2025 Oral] SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining

Python 263 11 Updated Oct 18, 2025

SkyworkAI / Vitron

NeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Python 576 35 Updated Oct 20, 2024

kohjingyu / gill

🐟 Code and models for the NeurIPS 2023 paper "Generating Images with Multimodal Language Models".

Jupyter Notebook 468 38 Updated Jan 19, 2024

datawhalechina / so-large-lm

大模型基础: 一文了解大模型基础知识

6,144 517 Updated Feb 24, 2025

Lordog / dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

Jupyter Notebook 9,619 967 Updated Oct 10, 2025

allenai / pixmo-docs

ACL 2025: Synthetic data generation pipelines for text-rich images.

Python 145 23 Updated Mar 1, 2025

sarahESL / AlignCLIP

AlignCLIP: Improving Cross-Modal Alignment in CLIP (ICLR 2025)

Python 51 1 Updated Mar 1, 2025

jiahao-shao1 / ChronoDepth

ChronoDepth: Learning Temporally Consistent Video Depth from Video Diffusion Priors

Python 274 9 Updated Feb 27, 2025

facebookresearch / DepthLM_Official

Official implementation of DepthLM

Python 247 9 Updated Oct 7, 2025

yan5xu / ququ

开源免费的 Wispr Flow 替代方案 | 集成FunASR本地模型和可配置大语言模型的下一代中文桌面语音工作流

JavaScript 1,672 154 Updated Oct 8, 2025

harry0703 / MoneyPrinterTurbo

利用AI大模型，一键生成高清短视频 Generate short videos with one click using AI LLM.

Python 47,620 6,654 Updated Jun 11, 2025

zjysteven / lmms-finetune

A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.

Python 352 40 Updated Oct 28, 2025

ZuodaoTech / everyone-can-use-english

人人都能用英语

TypeScript 31,858 4,533 Updated Apr 13, 2025

jdg900 / MMR

[ICLR 2025] Official Pytorch Implementation of MMR: A Large-scale Benchmark Dataset for Multi-target and Multi-granularity Reasoning Segmentation

Python 21 1 Updated Apr 3, 2025

mc-lan / Awesome-MLLM-Segmentation

A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of-the-art methods, innovative applications, and key advanceme…

159 4 Updated Oct 29, 2025

hkchengrex / Cutie

[CVPR 2024 Highlight] Putting the Object Back Into Video Object Segmentation

Python 974 88 Updated Nov 8, 2024

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 10,609 990 Updated Nov 7, 2025

DecartAI / Lucy-Edit-ComfyUI

Python 645 70 Updated Nov 7, 2025

UMass-Embodied-AGI / 3D-LLM

Code for 3D-LLM: Injecting the 3D World into Large Language Models

Python 1,154 70 Updated Jun 6, 2024

Starred topics

3D

differentiable-rendering

neural-network-visualizations

Electron

Markdown

Django

scikit-learn

Python

React

JavaScript

See all starred topics