Highlights
- Pro
Lists (32)
Sort Name ascending (A-Z)
2d-gen
2d-tryon
3d-gen
4dgs
agent or ai coding
AI note
ar
calib
comfyui
dataset
depth
detection | segmentation
diffusion model
gpu-acc
graphics
gs & mesh
gs render
✨ Inspiration
lessons
opti flow
pose alignment & pose about
reconstruction
segment
sfm | mvs
slam
tools
tracking
tts
video about
visual & llm
vla | embody ai
volumetric video
Stars
Development repository for the Triton language and compiler
VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control
Orient Anything V2, NeurIPS 2025 Spotlight
Context management for Claude Code. Hooks maintain state via ledgers and handoffs. MCP execution without context pollution. Agent orchestration with isolated context windows.
DreamID-V: Bridging the Image-to-Video Gap for High-Fidelity Face Swapping via Diffusion Transformer
Tinker ↔ KernelBench Integration enabling RL for GPU Kernel Generation
[TRO 2022] Observability-Aware Intrinsic and Extrinsic Calibration of LiDAR-IMU Systems
OCR model that handles complex tables, forms, handwriting with full layout.
PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation
MiroThinker is an open-source search agent model, built for tool-augmented reasoning and real-world information seeking, aiming to match the deep research experience of OpenAI Deep Research and Gem…
Code for "InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields"
✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
Unity MCP acts as a bridge, allowing AI assistants (like Claude, Cursor) to interact directly with your Unity Editor via a local MCP (Model Context Protocol) Client. Give your LLM tools to manage a…
[ASPLOS 2026] CLM: Removing the GPU Memory Barrier for 3D Gaussian Splatting with CPU Offloading
Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.
[arXiv 2025] Official implementation of BiCo: Composing Concepts from Images and Videos via Concept-prompt Binding
ManifoldPlus: A Robust and Scalable Watertight Manifold Surface Generation Method for Triangle Soups
🚀 AI 全自动短视频引擎 | AI Fully Automated Short Video Engine
NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos
TabletopGen: Instance-Level Interactive 3D Tabletop Scene Generation from Text or Single Image
一个基于nano banana pro🍌的原生AI PPT生成应用,迈向真正的"Vibe PPT"; 支持上传任意模板图片;上传任意素材&智能解析;一句话/大纲/页面描述自动生成PPT;口头修改指定区域、一键导出可编辑ppt - An AI-native PPT generator based on nano banana pro🍌
How can we build a true AI agent? Like Claude Code.
A curated list of recent diffusion models for video generation, editing, and various other applications.
Get 10X more out of Claude Code, Codex or any coding agent
HY-Motion model for 3D character animation generation.
An interface library for RL post training with environments.
SpotEdit:Selective Region Editing in Diffusion Transformers