Saber: Scaling Zero-Shot Reference-to-Video Generation

Zijian Zhou^1,2, Shikun Liu¹, Haozhe Liu¹, Haonan Qiu¹, Zhaochong An¹, Weiming Ren¹, Zhiheng Liu¹, Xiaoke Huang¹, Kam Woh Ng¹, Tian Xie¹, Xiao Han¹, Yuren Cong¹, Hang Li¹, Chuyan Zhu¹, Aditya Patel¹, Tao Xiang¹, Sen He¹

¹ Meta AI ² King's College London

The training and inference code will be released once it has been organized. Please stay tuned.

Project Overview

Saber is a scalable zero-shot framework for reference-to-video (R2V) generation. By introducing a masked training strategy, Saber bypasses the bottleneck of explicit reference image-video-text triplet datasets, training exclusively on video-text pairs to achieve zero-shot generation capabilities without explicit R2V data.

Citation

If you find Saber useful for your research, please cite our paper:

@article{zhou2025scaling,
    title={Scaling Zero-Shot Reference-to-Video Generation},
    author={Zhou, Zijian and Liu, Shikun and Liu, Haozhe and Qiu, Haonan and An, Zhaochong and Ren, Weiming and Liu, Zhiheng and Huang, Xiaoke and Ng, Kam Woh and Xie, Tian and Han, Xiao and Cong, Yuren and Li, Hang and Zhu, Chuyan and Patel, Aditya and Xiang, Tao and He, Sen},
    journal={arXiv preprint arXiv:2512.06905},
    year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
assets		assets
clarity		clarity
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.html		index.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Saber: Scaling Zero-Shot Reference-to-Video Generation

Project Overview

Citation

About

Uh oh!

Releases

Packages

Languages

License

franciszzj/Saber

Folders and files

Latest commit

History

Repository files navigation

Saber: Scaling Zero-Shot Reference-to-Video Generation

Project Overview

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages