Lists (23)
Sort Name ascending (A-Z)
CG
DriveWithLLM
Dynamic3DReconstruction
e2e-AD
EgoCentric
Embodied Intellignce
GPT系列
OCC
Real-to-Sim-to-Real
robot
simulation
SLAM
StableDiffusion
VLA
worldModel
三维重建
动作生成
娱乐向CV任务
定位与建图
局部矢量地图
工具类
强化学习
深度估计
Starred repositories
Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.
OmniNWM: Omniscient Navigation World Models for Autonomous Driving
Official repository for the paper "MVP4D: Multi-View Portrait Video Diffusion for Animatable 4D Avatars"
Fast and Universal 3D reconstruction model for versatile tasks
[ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
[Arxiv 2025] FlexPainter: Flexible and Multi-View Consistent Texture Generation
TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild Videos
LeGO-LOAM, LIO-SAM, LVI-SAM, FAST-LIO2, Faster-LIO, VoxelMap, R3LIVE, Point-LIO, KISS-ICP, DLO, DLIO, Ada-LIO, PV-LIO, SLAMesh, ImMesh, FAST-LIO-MULTI, M-LOAM, LOCUS, SLICT, MA-LIO, CT-ICP, GenZ-IC…
Code for SIGGRAPH 2020 paper "RigNet: Neural Rigging for Articulated Characters"
Official Implementation of MAMM: Motion Control via Metric-Aligning Motion Matching
Synthetic animal image dataset for pose and shape reconstruction.
[ICCV 2025] TeRA: Rethinking Text-guided Realistic 3D Avatar Generation
Code of π^3: Permutation-Equivariant Visual Geometry Learning
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
[ICCV 2025] GaussianSpeech: Audio-Driven Gaussian Avatars
[ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.
[ICCV 2025] Nautilus: Locality-aware Autoencoder for Scalable Mesh Generation
[ICCV 2025] FaceLift: Learning Generalizable Single Image 3D Face Reconstruction from Synthetic Heads
[CVPR 2025] Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation
This repository contains the official implementation of "The Language of Motion: Unifying Verbal and Non-verbal Language of 3D Human Motion".