Stars
Edit-R1: Reinforce Image Editing with Diffusion Negative-Aware Finetuning and MLLM Implicit Feedback
[TPAMI 2025] ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis
Official repository for the UAE paper, unified-GRPO, and unified-Bench
ReMoMask: Retrieval-Augmented Masked Motion Generation
Code for the paper "AsFT: Anchoring Safety During LLM Fune-Tuning Within Narrow Safety Basin".
LLM Reasoning Benchmark & Chain-of-Thoughts Dataset for Chemistry
Code for paper "Rethinking Text-based Protein Understanding: Retrieval or LLM?"
[NeurIPS 2025 D&B🔥] OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation
[NeurIPS 2025 D&B🔥] ImgEdit: A Unified Image Editing Dataset and Benchmark
MaSIF- Molecular surface interaction fingerprints. Geometric deep learning to decipher patterns in molecular surfaces.
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation
Tensors and Dynamic neural networks in Python with strong GPU acceleration
GPT as a Monte Carlo Language Tree: A Probabilistic Perspective
[AAAI26] Next Patch Prediction
[CVPR 2025🔥] Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
Official Repository for the Uni-Mol Series Methods
[ICCV 2025] LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
[NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Machine Learning models for in vitro enzyme kinetic parameter prediction
Saprot: Protein Language Model with Structural Alphabet (AA+3Di)
⏰ Collaboratively track worldwide conference deadlines (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Code for the ProteinMPNN paper
A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment