Skip to content
View PkuRainBow's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report PkuRainBow

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 24 Updated Jul 16, 2025

[NeurIPS' 2025] JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent

JavaScript 689 24 Updated Nov 15, 2025
851 50 Updated Aug 30, 2025

aider is AI pair programming in your terminal

Python 38,661 3,699 Updated Nov 22, 2025

Official implementation of Inductive Moment Matching

Python 564 13 Updated Jul 11, 2025

Muon is an optimizer for hidden layers in neural networks

Python 2,053 96 Updated Nov 23, 2025

A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.

Jupyter Notebook 197 13 Updated Jun 26, 2025

Single-pass Adaptive Image Tokenization for Minimum Program Search | What's the Kolmogorov Complexity of an Image?

Jupyter Notebook 42 2 Updated Jul 26, 2025

Kimi K2 is the large language model series developed by Moonshot AI team

9,609 682 Updated Nov 7, 2025

Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.

Python 3,101 272 Updated Nov 19, 2025

SkyReels-V2: Infinite-length Film Generative model

Python 5,041 740 Updated Aug 11, 2025

PyTorch Code for Energy-Based Transformers paper -- generalizable reasoning and scalable learning

Python 561 74 Updated Nov 12, 2025

The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usable locally.

TypeScript 5,189 460 Updated Nov 10, 2025

Official implementation for "SVGFusion: Scalable Text-to-SVG Generation via Vector Space Diffusion" https://arxiv.org/abs/2412.10437

67 1 Updated Dec 13, 2024

Official DINO-X Model Context Protocol (MCP) server that empowers LLMs with real-world visual perception through image object detection, localization, and captioning APIs.

TypeScript 105 10 Updated Oct 28, 2025

Rex-Thinker: Grounded Object Refering via Chain-of-Thought Reasoning

Python 127 6 Updated Jun 30, 2025

OmniGen2: Exploration to Advanced Multimodal Generation.

Jupyter Notebook 3,949 9 Updated Sep 30, 2025

Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

Python 501 30 Updated Sep 23, 2025

FlexTok: Resampling Images into 1D Token Sequences of Flexible Length

Jupyter Notebook 274 14 Updated Jun 2, 2025
Python 24 1 Updated Jun 18, 2025

Roblox Foundation Model for 3D Intelligence

Jupyter Notebook 863 79 Updated Jul 22, 2025

[NeurIPS 2025 Oral] Exploring Diffusion Transformer Designs via Grafting

Jupyter Notebook 64 2 Updated Jun 18, 2025

This is official Pytorch implementation of "Rethinking the necessity of image fusion in high-level vision tasks: A practical infrared and visible image fusion network based on progressive semantic …

Python 203 10 Updated Apr 28, 2025

Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Python 675 90 Updated Oct 29, 2025

MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.

Python 2,997 264 Updated Jul 7, 2025

MMMG: A Massive, Multidisciplinary, Multi-Tier Generation Benchmark for Text-to-Image Reasoning [NeurIPS 2025 Poster]

Python 21 Updated Oct 9, 2025

PrismLayers: Open Data for High-Quality Multi-Layer Transparent Image Generative Models

Jupyter Notebook 22 1 Updated Aug 11, 2025

Mobile-Agent: The Powerful GUI Agent Family

Python 6,428 651 Updated Nov 26, 2025

[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"

Python 245 19 Updated Nov 24, 2024
Next