sheqian36

sheqian36

3 followers · 7 following

Lists (5)

Sort

Stars

bytedance / ATI

Official implementation of ATI: Any Trajectory Instruction for Controllable Video Generation. https://arxiv.org/pdf/2505.22944

Python 317 16 Updated Aug 7, 2025

wz0919 / EPiC

Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance

Python 45 1 Updated Jun 2, 2025

KwaiVGI / SynCamMaster

[ICLR'25] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Python 640 18 Updated May 23, 2025

KwaiVGI / ReCamMaster

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,591 76 Updated Oct 23, 2025

alibaba-damo-academy / Uni3C

Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation [Siggraph Asian 2025]

Python 428 23 Updated Sep 21, 2025

deepseek-ai / DeepSeek-OCR

Contexts Optical Compression

Python 19,728 1,391 Updated Oct 25, 2025

GoatWu / Self-Forcing-Plus

Forked from guandeh17/Self-Forcing

Unofficial extension implementation of Self-Forcing to support I2V && 14B training.

Python 235 15 Updated Sep 29, 2025

hiyouga / LLaMA-Factory

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 61,982 7,494 Updated Nov 6, 2025

ModelTC / Wan2.2-Lightning

Forked from Wan-Video/Wan2.2

Wan2.2-Lightning: Speed up wan2.2 model with distillation

Python 214 13 Updated Sep 28, 2025

QwenLM / Qwen-Image

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 5,932 324 Updated Nov 7, 2025

Tencent-Hunyuan / SRPO

Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference

Python 1,167 37 Updated Oct 26, 2025

guandeh17 / Self-Forcing

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 2,798 198 Updated Sep 12, 2025

yifan123 / flow_grpo

[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL

Python 1,550 83 Updated Nov 4, 2025

WeThinkIn / AIGC-Interview-Book

【三年面试五年模拟】AIGC算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、AI Agent、机器学习、计算机视觉、自然语言处理、强化学习、大数据挖掘、具身智能、元宇宙、AGI等AI行业面试笔试干货经验与核心知识。

2,480 284 Updated Nov 5, 2025

EmbraceAGI / AIGC_Interview

📚 AIGC 求职面经、必备基础知识、提示词工程、ChatGPT、Stable Diffusion、Prompt、Embedding、Fintune 等 AIGC 求职你所需要知道的一切~

737 59 Updated Jun 26, 2024

OpenGVLab / InternVL

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 9,431 734 Updated Sep 22, 2025

Fantasy-AMAP / fantasy-talking2

FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation

58 2 Updated Aug 20, 2025

zai-org / GLM-V

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 1,732 103 Updated Oct 28, 2025

ModelTC / LightX2V

Light Video Generation Inference Framework

Python 764 47 Updated Nov 6, 2025

tulerfeng / Video-R1

Video-R1: Reinforcing Video Reasoning in MLLMs [🔥the first paper to explore R1 for video]

Python 730 39 Updated Sep 19, 2025

tdrussell / diffusion-pipe

A pipeline parallel training script for diffusion models.

Python 1,693 227 Updated Nov 5, 2025

Phantom-video / Phantom

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,447 90 Updated Sep 11, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 11,459 1,273 Updated Oct 12, 2025

ABTols / ColorSurge

[Siggraph2025] The official code of the paper "ColorSurge: Bringing Vibrancy and Efficiency to Automatic Video Colorization via Dual-Branch Fusion"

Python 11 1 Updated Jul 26, 2025

zhaoyuzhi / SVCNet

SVCNet: Scribble-based Video Colorization Network with Temporal Aggregation. IEEE TIP, 2023

Python 17 1 Updated Jul 21, 2025

CIntellifusion / VideoDPO

Official Implementation of VideoDPO

Python 146 2 Updated Jun 1, 2025

TencentARC / VideoPainter

[SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"

Python 513 31 Updated Apr 8, 2025

TIGER-AI-Lab / AnyV2V

Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks" [TMLR 2024]

Jupyter Notebook 629 46 Updated Oct 29, 2024

bilibili / Index-anisora

Python 2,211 115 Updated Nov 2, 2025

liangyuwang / Tiny-DeepSpeed

Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library

Python 48 8 Updated Aug 20, 2025

sheqian36

Lists (5)

LLM

ML

multimodal

steamtools

上色

Stars