-
Massachusetts Institute of Technology
- Cambridge
-
20:50
(UTC -05:00) - https://toytiny.github.io
- https://orcid.org/0000-0001-5287-7128
- in/fangqiang-ding-337239189
- @Toytiny3
- https://www.researchgate.net/profile/Fangqiang-Ding
- https://orcid.org/0000-0001-5287-7128
Stars
A toolbox for skeleton-based action recognition.
A curated paper list of awesome skeleton-based action recognition.
[CVPR 2025 Highlight] PyTorch implementation of "Revealing Key Details to See Differences: A Novel Prototypical Perspective for Skeleton-based Action Recognition"
[CVPR 2024] TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation
Official repository of Human3.6M 3D WholeBody (H3WB) dataset
Official implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"
[NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modeling
[TPAMI] Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
Lets make video diffusion practical!
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).
openvla / openvla
Forked from TRI-ML/prismatic-vlmsOpenVLA: An open-source vision-language-action model for robotic manipulation.
Cosmos-Transfer1-7B-Sample-AV Toolkits
Ray tracing and hybrid rasterization of Gaussian particles
High-Resolution Image Synthesis with Latent Diffusion Models
Code for the paper "FinRL-DeepSeek: LLM-Infused Risk-Sensitive Reinforcement Learning for Trading Agents" arXiv:2502.07393
FinRL®: Financial Reinforcement Learning. 🔥
Stanford-ILIAD / openvla-mini
Forked from openvla/openvlaOpenVLA: An open-source vision-language-action model for robotic manipulation.
The unitree_il_lerobot open-source project is a modification of the LeRobot open-source training framework, enabling the training and testing of data collected using the dual-arm dexterous hands of…
Automate Creation of YouTube Shorts using MoviePy.
Automate the process of making money online.
Official implementation of Continuous 3D Perception Model with Persistent State