Skip to content
View dragonlong's full-sized avatar

Block or report dragonlong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Simulation benchmark from Toyota Research Institute containing 49 tasks that measure the performance of Large Behavior Model policies

Python 41 Updated Nov 6, 2025

Post-training scripts and samples for NVIDIA Cosmos ecosystem

Python 68 21 Updated Nov 26, 2025

Collection of step-by-step playbooks for setting up AI/ML workloads on NVIDIA DGX Spark devices with Blackwell architecture.

TypeScript 212 67 Updated Nov 25, 2025

The best ChatGPT that $100 can buy.

Python 37,645 4,615 Updated Nov 17, 2025

Official code of RDT 2

Python 586 26 Updated Oct 11, 2025

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,624 181 Updated Oct 27, 2025

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 1,411 83 Updated Sep 22, 2025

GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks

Python 86 6 Updated Jul 1, 2025
Python 462 36 Updated Aug 28, 2025

Enjoy the magic of Diffusion models!

Python 10,812 1,015 Updated Nov 27, 2025

🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)

Python 90 4 Updated Jul 2, 2024

The official Python SDK for Model Context Protocol servers and clients

Python 20,294 2,808 Updated Nov 27, 2025

Efficient Part-level 3D Object Generation via Dual Volume Packing

Python 778 67 Updated Jun 26, 2025

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.

Python 494 44 Updated Sep 8, 2025

Online video temporal grounding

Python 11 Updated Oct 20, 2025

ICML 2025 Hypo3D: Exploring Hypothetical Reasoning in 3D

Python 43 6 Updated Jul 17, 2025

Python package to create manipulation scenes.

Python 209 21 Updated Jun 28, 2025

[NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Python 203 8 Updated Jun 30, 2025

Train your Agent model via our easy and efficient framework

Python 1,632 155 Updated Nov 17, 2025

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,566 613 Updated Nov 20, 2025

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,097 1,372 Updated Nov 14, 2025

The absolute trainer to light up AI agents.

Python 8,952 715 Updated Nov 27, 2025

[AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal Prompting

Python 59 4 Updated Oct 30, 2025

[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale

Jupyter Notebook 200 23 Updated Nov 13, 2023

Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)

Python 18 1 Updated Jul 16, 2025

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

Python 365 12 Updated Nov 27, 2025

Train transformer language models with reinforcement learning.

Python 16,446 2,320 Updated Nov 27, 2025

A Scientific Multimodal Foundation Model

607 31 Updated Sep 30, 2025
Next