Skip to content

kevinmel2000/DC-Gen

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space

teaser_page1

🔥🔥 News

  • (🔥 New) [2025/9/30] We released the DC-Gen technical report on arXiv.
  • (🔥 New) [2025/9/30] We released DC-AE-Lite.
  • [2025/6] DC-AE 1.5 is accepted by ICCV 2025!

💡 Introduction

DC-Gen is a new acceleration framework for diffusion models. DC-Gen works with any pre-trained diffusion model, boosting efficiency by transferring it into a deeply compressed latent space with lightweight post-training. For example, applying DC-Gen to FLUX.1-Krea-12B takes just 40 H100 GPU days. The resulting DC-Gen-FLUX delivers the same quality as the base model while achieving dramatic gains—53× faster inference on H100 at 4K resolution. And when paired with NVFP4, DC-Gen-FLUX (20 sampling steps) generates a 4K image in only 3.5 seconds on a single NVIDIA 5090 GPU, a total latency reduction of 138× compared to the base FLUX.1-Krea model.

Highlight 1: DC-Gen Enables Native High-Resolution Image Generation with Exceptional Efficiency

  • FLUX does not support 4K image generation due to prohibitive training and inference costs. DC-Gen-FLUX addresses this limitation by reducing token redundancy with DC-AE-f64c128.

teaser_page2

  • DC-Gen-FLUX achieves approximately 53× latency reduction and a 56× throughput improvement over the base model at 4K resolution on H100.

teaser_page3

Highlight 2: DC-Gen Maintains Base Model Quality

FLUX.1-Krea is recognized for its superior realism and text-rendering capabilities but suffers from lower throughput. DC-Gen-FLUX successfully preserves these qualities while delivering a significant speedup over FLUX.1-Krea, achieving the highest throughput among the models compared.

  • Comparison to Previous Models on 1024×1024 Resolution.

teaser_page4

Highlight 3: DC-Gen Facilitates Rapid Autoencoder Adaptation

Previously, changing the autoencoder required retraining diffusion models from scratch, which was highly inefficient. DC-Gen introduces Embedding Alignment to transfer the base model’s knowledge to the new latent space. After this alignment, the model can generate images with correct semantics in the new latent space without finetuning the diffusion model’s weights.

teaser_page5

Following embedding alignment, we can fully recover the quality through LoRA finetuning.

teaser_page5

Getting Started

conda create -n dc_gen python=3.10
conda activate dc_gen
pip install -U -r requirements.txt

Content

DC-Gen

  • The code and pretrained models will be released after the legal review is completed.

Deep Compression Autoencoder

Contact

Han Cai

Related Projects

Reference

@article{he2025dcgen,
 title={DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space},
 author={He, Wenkun and Gu, Yuchao and Chen, Junyu and Zou, Dongyun and Lin, Yujun and Zhang, Zhekai and Xi, Haocheng and Li, Muyang and Zhu, Ligeng and Yu, Jincheng and Chen, Junsong and Xie, Enze and Han, Song and Cai, Han},
 journal={arXiv preprint arXiv:2509.25180},
 year={2025}
}

@article{chen2024deep,
  title={Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models},
  author={Chen, Junyu and Cai, Han and Chen, Junsong and Xie, Enze and Yang, Shang and Tang, Haotian and Li, Muyang and Lu, Yao and Han, Song},
  journal={arXiv preprint arXiv:2410.10733},
  year={2024}
}

@article{chen2025dc,
  title={DC-AE 1.5: Accelerating Diffusion Model Convergence with Structured Latent Space},
  author={Chen, Junyu and Zou, Dongyun and He, Wenkun and Chen, Junsong and Xie, Enze and Han, Song and Cai, Han},
  journal={arXiv preprint arXiv:2508.00413},
  year={2025}
}
@misc{zou2025dcaelite,
  title  = {DC-AE-Lite},
  author = {Zou, Dongyun and Chen, Junyu and He, Wenkun and Chen, Junsong and Xie, Enze and Han, Song and Cai, Han},
  url    = {https://github.com/dc-ai-projects/DC-Gen/blob/main/projects/DC-AE-Lite.md},
  month  = Sep,
  year   = {2025}
}

About

DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%