Skip to content
View ionvision's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report ionvision

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

[ICLR 2026] Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation

Python 381 12 Updated Feb 18, 2026

[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation

Python 377 12 Updated Mar 26, 2025

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Python 876 58 Updated Oct 15, 2025

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Python 1,636 83 Updated Oct 29, 2025

SigLIP-based Aesthetic Score Predictor

Python 382 9 Updated Dec 18, 2024

Carbon Language's main repository: documents, design, implementation, and related tools. (NOTE: Carbon Language is experimental; see README)

C++ 33,637 1,525 Updated Feb 18, 2026

Turn any computer or edge device into a command center for your computer vision projects.

Python 2,194 245 Updated Feb 18, 2026

[ICLR 2026] GenCompositor: Generative Video Compositing with Diffusion Transformer

Python 148 6 Updated Feb 18, 2026

[ICLR 2026] Streamlining Cartoon Production with Generative Post-Keyframing

Python 542 52 Updated Aug 20, 2025

Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model

Python 937 58 Updated Dec 27, 2025

[ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

Python 323 13 Updated Oct 31, 2025

[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention

Python 627 38 Updated Feb 3, 2026

Online resources for Python Crash Course, 3rd edition, from No Starch Press.

Python 2,130 882 Updated Dec 1, 2025
Python 81 Updated Oct 18, 2025

DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder

178 7 Updated Oct 5, 2025

FlashInfer: Kernel Library for LLM Serving

Python 4,988 721 Updated Feb 18, 2026

DC-Gen: Post-Training Diffusion Acceleration with Deeply Compressed Latent Space

Python 346 9 Updated Oct 5, 2025

⚡️SwanLab - an open-source, modern-design AI training tracking and visualization tool. Supports Cloud / Self-hosted use. Integrated with PyTorch / Transformers / verl / LLaMA Factory / ms-swift / U…

Python 3,600 186 Updated Feb 12, 2026

LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via efficient conversion, runtime, and optimization

C++ 1,482 196 Updated Feb 18, 2026

Nano vLLM

Python 11,726 1,585 Updated Nov 3, 2025

Generate a timeline of your day, automatically

Swift 5,742 292 Updated Feb 6, 2026

[ICCV 2025] Official pytorch implementation of "FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors"

Python 407 11 Updated Mar 10, 2025

Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)

Python 3,138 241 Updated Sep 12, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 9,600 740 Updated Feb 17, 2026

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 6,421 701 Updated Feb 4, 2026

Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference in pure C/C++

C++ 5,415 533 Updated Feb 10, 2026

A unified inference and post-training framework for accelerated video generation.

Python 3,088 266 Updated Feb 18, 2026

Generative Omnimatte (CVPR 2025)

Python 166 14 Updated Jun 3, 2025
Next