Skip to content
View Jacky-hate's full-sized avatar
  • USTC
  • China
  • 18:46 (UTC +08:00)

Block or report Jacky-hate

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

DiffusionNFT: Online Diffusion Reinforcement with Forward Process

Python 447 11 Updated Sep 22, 2025

Native Multimodal Models are World Learners

Python 1,298 46 Updated Nov 28, 2025

Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model

Python 1,752 179 Updated Oct 4, 2025

Optimal Transport Aggregation for Visual Place Recognition

Python 286 29 Updated Oct 27, 2025

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 11,803 1,238 Updated Oct 11, 2025

DUSt3R: Geometric 3D Vision Made Easy

Python 6,761 713 Updated Sep 24, 2025

Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"

Python 1,309 80 Updated Jun 16, 2025
Python 15 1 Updated Oct 16, 2025

Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer

Python 123 4 Updated Oct 14, 2025

Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)

Python 224 12 Updated Sep 8, 2025

The MATH Dataset (NeurIPS 2021)

Python 1,260 110 Updated Sep 6, 2025

A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.

Python 1,747 82 Updated Nov 27, 2025

Evaluation code for Ref-L4, a new REC benchmark in the LMM era

Python 51 1 Updated Dec 28, 2024

[ICLR & NeurIPS 2025] Repository for Show-o series, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 1,795 77 Updated Oct 22, 2025

Unified layout planning and image generation, ICCV2025

Python 35 1 Updated Apr 14, 2025

Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning

Python 229 7 Updated May 30, 2025

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,432 68 Updated Mar 16, 2025

Enjoy the magic of Diffusion models!

Python 10,831 1,015 Updated Nov 30, 2025

Official PyTorch implementation of One-Minute Video Generation with Test-Time Training

Python 2,307 191 Updated Jun 5, 2025

Scaling Vision Pre-Training to 4K Resolution

Python 216 10 Updated Aug 28, 2025

[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 3,447 241 Updated Oct 17, 2025

A demo for the Direct Ascent Synthesis: Hidden Generative Capabilities in Discriminative Models paper (https://arxiv.org/abs/2502.07753)

Jupyter Notebook 41 1 Updated Mar 5, 2025

This is a repo to track the latest autoregressive visual generation papers.

412 5 Updated Jun 25, 2025

Next-Token Prediction is All You Need

Python 2,255 89 Updated Nov 19, 2025

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 4,286 365 Updated Jun 15, 2025

[ICLR2025] A versatile image-to-image visual assistant, designed for image generation, manipulation, and translation based on free-from user instructions.

Python 210 3 Updated May 5, 2025

This is a list of papers on the topic of how machine learning methods (including AI/LLM) are leveraged for specific tasks in quantum physics scenarios. (ML/AI/LLM for quantum science)

6 Updated Sep 3, 2024

Updating BibTeX entries with information from dblp

Python 4 2 Updated Feb 9, 2024
Next