Skip to content
View cfeng16's full-sized avatar
🐢
🐢

Highlights

  • Pro

Block or report cfeng16

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Let's finetune video generation models!

Python 532 29 Updated Sep 15, 2025

PyTorch implementation of "VFM-VAE" (arXiv:2510.18457).

Python 24 1 Updated Jan 8, 2026

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,689 58 Updated Dec 26, 2025

A comprehensive JAX/NNX library for diffusion and flow matching generative algorithms, featuring DiT (Diffusion Transformer) and its variants as the primary backbone with support for ImageNet train…

Python 127 9 Updated Oct 16, 2025

Open-source unified multimodal model

Python 5,559 487 Updated Oct 27, 2025

Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.

Python 6,970 404 Updated Dec 31, 2025

The Sound of Simulation (CoRL 2025 Best Paper Finalist)

Python 10 1 Updated Sep 22, 2025

Tools for checking ACL paper submissions

Python 887 59 Updated Dec 6, 2025

[ACM MM Award] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset

Python 170 11 Updated Aug 3, 2025

NeurIPS 2025 Spotlight; ICLR2024 Spotlight; CVPR 2024; EMNLP 2024

Python 1,800 75 Updated Nov 27, 2025
Python 24 1 Updated Jun 18, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,052 226 Updated Sep 12, 2025
Python 24 Updated May 23, 2025

A framework for few-shot evaluation of language models.

Python 11,163 2,959 Updated Jan 13, 2026
Python 3,905 660 Updated Aug 24, 2025

[NeurIPS 2023] A faithful benchmark for vision-language compositionality

Python 89 10 Updated Feb 13, 2024

Lightweight coding agent that runs in your terminal

Rust 56,051 7,208 Updated Jan 13, 2026

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 3,980 291 Updated Jan 5, 2026

[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Python 1,704 81 Updated Nov 28, 2025
Python 105 6 Updated Jun 10, 2025

Tracking Any Point (TAP)

Jupyter Notebook 1,775 174 Updated Oct 16, 2025

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 2,500 302 Updated Jan 12, 2026

This repository contains the source code for the paper First Order Motion Model for Image Animation

Jupyter Notebook 14,985 3,289 Updated Nov 14, 2024

Code for Scaling Language-Free Visual Representation Learning (WebSSL)

245 2 Updated Apr 24, 2025

Train vision models using JAX and 🤗 transformers

Python 100 11 Updated Dec 14, 2025

unofficial implementation of DiffMAE

Python 17 4 Updated May 31, 2024

Official repo for CFG-Zero*

Python 698 24 Updated May 2, 2025

[ICML 2024] CLLMs: Consistency Large Language Models

Python 412 19 Updated Nov 16, 2024

Code for the paper "Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers" [ICCV 2025]

Python 98 10 Updated Jul 28, 2025

DataComp: In search of the next generation of multimodal datasets

Python 765 64 Updated Apr 28, 2025
Next