Pinned Loading
-
FoundationVision/VAR
FoundationVision/VAR Public[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…
-
FoundationVision/Waver
FoundationVision/Waver PublicA video foundation model for unified Text-to-Video (T2V) and Image-to-Video (I2V) generation.
-
PeizeSun/SparseR-CNN
PeizeSun/SparseR-CNN Public[CVPR2021, PAMI2023] End-to-End Object Detection with Learnable Proposal
-
FoundationVision/Infinity
FoundationVision/Infinity Public[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
-
FoundationVision/Liquid
FoundationVision/Liquid Public(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
-
FoundationVision/UniTok
FoundationVision/UniTok Public[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.