vinthony

I may be slow to respond.

Xiaodong Cun vinthony

I may be slow to respond.

Building @GVCLab

684 followers · 208 following

GVC Lab, Great Bay University
Dongguan, China
12:46 (UTC +08:00)
http://vinthony.github.io
@shadocun

Achievements

x4 x2

Achievements

x4 x2

Organizations

Lists (4)

Sort

Stars

GVCLab / EasyOmnimatte

EasyOmnimatte: Taming Pretrained Inpainting Diffusion Models for End-to-End Video Layered Decomposition

3 Updated Dec 25, 2025

Ranger-Liang / CNN-demo

JavaScript 3 Updated Dec 26, 2025

MineDojo / NitroGen

A Foundation Model for Generalist Gaming Agents

Python 968 111 Updated Dec 23, 2025

QwenLM / Qwen-Image-Layered

Qwen-Image-Layered: Layered Decomposition for Inherent Editablity

Python 987 67 Updated Dec 25, 2025

okdalto / ComfyUI-PersonaLive

This is a ComfyUI custom node implementation of 'PersonaLive: Expressive Portrait Image Animation for Live Streaming'.

Python 58 3 Updated Dec 18, 2025

thu-ml / TurboDiffusion

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

Python 2,068 129 Updated Dec 26, 2025

GVCLab / PersonaLive

PersonaLive! : Expressive Portrait Image Animation for Live Streaming

Python 766 85 Updated Dec 24, 2025

Future-Scholars / paperlib

An open-source academic paper management tool.

TypeScript 2,047 97 Updated Dec 22, 2025

Red-Fairy / ShadowDraw

Python 13 Updated Dec 17, 2025

Tom-roujiang / Awesome-LLM-Quantitative-Trading-Papers

🚀 A curated collection of papers focusing on LLM-based quantitative trading.

3 Updated Dec 22, 2025

Tongyi-MAI / Z-Image

Python 7,901 466 Updated Dec 25, 2025

LTH14 / JiT

PyTorch implementation of JiT https://arxiv.org/abs/2511.13720

Python 1,869 111 Updated Dec 8, 2025

Intellindust-AI-Lab / SKEL-CF

Pytorch implementation of "SKEL-CF: Coarse-to-Fine Biomechanical Skeleton and Surface Mesh Recovery"

Jupyter Notebook 39 3 Updated Dec 17, 2025

ModelTC / LightX2V

Light Video Generation Inference Framework

Python 1,474 97 Updated Dec 26, 2025

kandinskylab / kandinsky-5

Kandinsky 5.0: A family of diffusion models for Video & Image generation

Python 640 42 Updated Dec 22, 2025

facebookresearch / sam-3d-body

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 2,334 222 Updated Dec 19, 2025

facebookresearch / sam-3d-objects

SAM 3D Objects

Python 5,109 481 Updated Dec 16, 2025

lillian039 / VARC

Python 166 8 Updated Nov 26, 2025

Video-Reason / VMEvalKit

This is a framework for evaluating reasoning in foundational Video Models.

Python 45 5 Updated Dec 21, 2025

meituan-longcat / LongCat-Video

Python 1,704 229 Updated Dec 20, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 21,577 1,929 Updated Oct 25, 2025

EzioBy / Ditto

[Preprint 2025] Ditto: Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset

Python 540 43 Updated Oct 29, 2025

nv-tlabs / lyra

Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation

Python 648 36 Updated Oct 2, 2025

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,654 55 Updated Nov 15, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 39,269 4,980 Updated Dec 23, 2025

Mentra-Community / MentraOS

MentraOS is the leading smart glasses platform + SDK. Stream your view, transcribe audio, talk to AI and capture photos hands-free on compatible glasses.

TypeScript 1,603 201 Updated Dec 26, 2025

thuml / MiniVeo3-Reasoner

Thinking with Videos from Open-Source Priors. We reproduce chain-of-frames visual reasoning by fine-tuning open-source video models. Give it a star 🌟 if you find it useful.

Python 196 7 Updated Oct 12, 2025

TencentARC / RollingForcing

Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Python 283 12 Updated Oct 31, 2025

zyhbili / MV-Performer

[SIGGRAPH Asia 2025] The official repo for the conference paper "MV-Performer: Taming Video Diffusion Model for Faithful and Synchronized Multi-view Performer Synthesis".

Python 29 1 Updated Dec 13, 2025

Intellindust-AI-Lab / DEIMv2

[DEIMv2] Real Time Object Detection Meets DINOv3

Jupyter Notebook 1,303 132 Updated Dec 13, 2025

Xiaodong Cun vinthony

Organizations

Lists (4)

interesting

Mywork

SVG

Video Models

Stars