Stars
Directly Aligning the Full Diffusion Trajectory with Fine-Grained Human Preference
Implementation of "S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models"
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
[NeurIPS 2025] 4KAgent: Agentic Any Image to 4K Super-Resolution. An intelligent computer vision agent that can magically restore any image to perfect-4K!
📚 AIGC 求职面经、必备基础知识、提示词工程、ChatGPT、Stable Diffusion、Prompt、Embedding、Fintune 等 AIGC 求职你所需要知道的一切~
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
In-context subject-driven image generation while preserving foreground fidelity
Piece it Together: Part-Based Concepting with IP-Priors
An open-source auto clicker on images for Android
[CVPR2025] We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference ima…
A Swift library that uses the Accelerate framework to provide high-performance functions for matrix math, digital signal processing, and image manipulation.
JNeRF is a NeRF benchmark based on Jittor. JNeRF re-implemented instant-ngp and achieved same performance with original paper.
A real-time motion capture system for 3D virtual character animating.
An Blender addon uses ROMP to extract human's 3D poses from image, video or webcam and drive your own 3D character.
[CVPR 2021 Oral]Bidirectional Projection Network for Cross Dimension Scene Understanding
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
A curated list of awesome ARKit projects and resources. Feel free to contribute!
Augmented reality (AR) development resources(增强现实开发资源汇总)---AIRX整理
[AAAI 2021] EMLight: Lighting Estimation via Spherical Distribution Approximation, [TIP] GMLight: Lighting Estimation via Geometric Distribution Approximation
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene Understanding
SE-SSD: Self-Ensembling Single-Stage Object Detector From Point Cloud, CVPR 2021.
List of awesome papers on Intrinsic Decomposition & Inverse Rendering
A template for modern C++ projects using CMake, Clang-Format, CI, unit testing and more, with support for downstream inclusion.
Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes (CVPR 2021 Oral)