Stars
Official repository for NeurIPS 2025 paper "Understanding and Improving Adversarial Robustness of Neural Probabilistic Circuits"
Official code for FaCT: Faithful Concept Traces for Explaining Neural Network Decisions. NeurIPS 2025
[NeurIPS 2025] Robustness in Both Domains: CLIP Needs a Robust Text Encoder
MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]
[NeurIPS 2025] An official source code for paper "Continual Multimodal Contrastive Learning"
[NeurIPS 2025] Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation
[Neurips 2025 NextVid Workshop Oral✨] Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention
[NeurIPS 2025][OralGPT & MMOral] Towards Better Dental AI: A Multimodal Benchmark and Instruction Dataset for Digital Dentistry
Compressed Radiation Treatment Planning [NeurIPS'24, MP'2025, PMB'23]
Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)
[NeurIPS 2025 Official Codes] Nabla-R2D3: Effective and Efficient 3D Diffusion Alignment with 2D Rewards
[NeurIPS 2025] 4KAgent: Agentic Any Image to 4K Super-Resolution. An intelligent computer vision agent that can magically restore any image to perfect-4K!
[NeurIPS 2025] Official implementation of "XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT Modulation".
[NeurIPS 2025 Spotlight] TPA: Tensor ProducT ATTenTion Transformer (T6) (https://arxiv.org/abs/2501.06425)
TransMLA: Multi-Head Latent Attention Is All You Need (NeurIPS 2025 Spotlight)
[NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example
[NeurIPS 2025] Efficient Reasoning Vision Language Models
[NeurIPS 2025] Direct3D‑S2: Gigascale 3D Generation Made Easy with Spatial Sparse Attention
[NeurIPS 2025] Codes for paper Foundation Cures Personalization: Improving Personalized Models' Prompt Consistency via Hidden Foundation Knowledge
[NeurIPS 2024 Spotlight ⭐️ & TPAMI 2025] Parameter-Inverted Image Pyramid Networks (PIIP)
[NeurIPS 2025 Spotlight] Q-Insight: Understanding Image Quality via Visual Reinforcement Learning
[NeurIPS 2025] Official PyTorch implementation of paper "CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up".
Official code for NeurIPS 2025 paper "GRIT: Teaching MLLMs to Think with Images"
[NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation
[NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation
Bayesian Low-Rank Adaptation of LLMs: BLoB [NeurIPS 2024] and TFB [NeurIPS 2025]
[NeurIPS 2025] Latent Zoning Networks
[NeurIPS 2025] PanTS: The Pancreatic Tumor Segmentation Dataset