Skip to content
View hxy-123's full-sized avatar

Block or report hxy-123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A Large-scale Video Action Dataset

Python 236 5 Updated Jan 16, 2026

Official implementation of Video-DPM

Python 84 3 Updated Jan 16, 2026
Python 88 4 Updated Jan 18, 2026

The repository provides code for EgoMAN model and dataset creation scripts.

Python 20 Updated Dec 31, 2025

InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation​

Python 272 14 Updated Jan 16, 2026

Code for "InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields"

446 9 Updated Jan 7, 2026

Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching

414 8 Updated Dec 18, 2025

Decoupled Q-Chunking

Python 49 3 Updated Jan 10, 2026

Any4D: Unified Feed-Forward Metric 4D Reconstruction

Python 239 5 Updated Dec 12, 2025

PaCoRe: Learning to Scale Test-Time Compute with Parallel Coordinated Reasoning

Python 276 6 Updated Jan 13, 2026

[NeurIPS 2025]"DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling"

Python 92 3 Updated Dec 21, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,852 1,530 Updated Jan 4, 2026

SAM 3D Objects

Python 5,667 603 Updated Jan 9, 2026

The repository provides code for running inference with the SAM 3D Body Model (3DB), links for downloading the trained model checkpoints and datasets, and example notebooks that show how to use the…

Python 2,501 252 Updated Dec 19, 2025

The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading the trained model checkpoints, and example notebooks that sho…

Python 7,126 897 Updated Jan 12, 2026

Official Pytorch Implementation for "Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising"

Python 319 26 Updated Dec 23, 2025

Depth Anything 3

Python 4,032 362 Updated Dec 12, 2025

Scaling Novel View Synthesis for Static and Dynamic Scenes

Python 577 Updated Oct 26, 2025

Fast and Universal 3D reconstruction model for versatile tasks

Python 955 87 Updated Jan 9, 2026

StreamingVLM: Real-Time Understanding for Infinite Video Streams

Python 838 54 Updated Oct 15, 2025

Automatically claims free games and DLCs on the Epic Games Store, Amazon Prime Gaming and GOG.

JavaScript 3,941 225 Updated Dec 31, 2025

Native Multimodal Models are World Learners

Python 1,402 54 Updated Dec 30, 2025
Python 310 19 Updated Oct 30, 2025

Python wrapper for the NVIDIA cuSFM library

Python 212 16 Updated Dec 9, 2025

Official code for paper "InstantSfM: Fully Sparse and Parallel Structure-from-Motion"

Python 380 25 Updated Jan 17, 2026

The best ChatGPT that $100 can buy.

Python 40,438 5,227 Updated Jan 18, 2026

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,700 59 Updated Dec 26, 2025

Hunyuan3D-Omni: A Unified Framework for Controllable Generation of 3D Assets

Python 497 44 Updated Oct 17, 2025
Next