dragonlong

Xiaolong dragonlong

3D AI Researcher, Nvidia

112 followers · 43 following

Santa Clara
02:08 (UTC -12:00)
https://dragonlong.github.io/
@lxiaol9

Achievements

Lists (1)

Sort

🔮 Future ideas

1 repository

Stars

ToyotaResearchInstitute / lbm_eval

Simulation benchmark from Toyota Research Institute containing 49 tasks that measure the performance of Large Behavior Model policies

Python 41 Updated Nov 6, 2025

nvidia-cosmos / cosmos-cookbook

Post-training scripts and samples for NVIDIA Cosmos ecosystem

Python 68 21 Updated Nov 26, 2025

NVIDIA / dgx-spark-playbooks

Collection of step-by-step playbooks for setting up AI/ML workloads on NVIDIA DGX Spark devices with Blackwell architecture.

TypeScript 212 67 Updated Nov 25, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 37,645 4,615 Updated Nov 17, 2025

thu-ml / RDT2

Official code of RDT 2

Python 586 26 Updated Oct 11, 2025

OpenDriveLab / AgiBot-World

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,624 181 Updated Oct 27, 2025

AgibotTech / Genie-Envisioner

Python 320 15 Updated Nov 27, 2025

AIDC-AI / Ovis

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 1,411 83 Updated Sep 22, 2025

The-AI-Alliance / GEO-Bench-VLM

GEOBench-VLM: Benchmarking Vision-Language Models for Geospatial Tasks

Python 86 6 Updated Jul 1, 2025

DangMinh21 / Multimodal-and-Multi-task-Fusion-for-Spatial-Reasoning

Python 2 Updated Sep 19, 2025

mingyin0312 / RLFromScratch

Python 462 36 Updated Aug 28, 2025

modelscope / DiffSynth-Studio

Enjoy the magic of Diffusion models!

Python 10,812 1,015 Updated Nov 27, 2025

yeliudev / R2-Tuning

🌀 R2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)

Python 90 4 Updated Jul 2, 2024

modelcontextprotocol / python-sdk

The official Python SDK for Model Context Protocol servers and clients

Python 20,294 2,808 Updated Nov 27, 2025

NVlabs / PartPacker

Efficient Part-level 3D Object Generation via Dual Volume Packing

Python 778 67 Updated Jun 26, 2025

OPPO-PersonalAI / Agent_Foundation_Models

Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.

Python 494 44 Updated Sep 8, 2025

minghangz / OnVTG

Online video temporal grounding

Python 11 Updated Oct 20, 2025

MatchLab-Imperial / Hypo3D

ICML 2025 Hypo3D: Exploring Hypothetical Reasoning in 3D

Python 43 6 Updated Jul 17, 2025

NVlabs / scene_synthesizer

Python package to create manipulation scenes.

Python 209 21 Updated Jun 28, 2025

qizekun / SoFar

[NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation

Python 203 8 Updated Jun 30, 2025

Simple-Efficient / RL-Factory

Train your Agent model via our easy and efficient framework

Python 1,632 155 Updated Nov 17, 2025

facebookresearch / dinov3

Reference PyTorch implementation and models for DINOv3

Jupyter Notebook 8,566 613 Updated Nov 20, 2025

Wan-Video / Wan2.2

Wan: Open and Advanced Large-Scale Video Generative Models

Python 12,097 1,372 Updated Nov 14, 2025

microsoft / agent-lightning

The absolute trainer to light up AI agents.

Python 8,952 715 Updated Nov 27, 2025

yunlong10 / CAT-V

[AAAI 26 Demo] Offical repo for CAT-V - Caption Anything in Video: Object-centric Dense Video Captioning with Spatiotemporal Multimodal Prompting

Python 59 4 Updated Oct 30, 2025

antoyang / VidChapters

[NeurIPS 2023 D&B] VidChapters-7M: Video Chapters at Scale

Jupyter Notebook 200 23 Updated Nov 13, 2023

zihuixue / ProgCaptioner

Code release for the paper "Progress-Aware Video Frame Captioning" (CVPR 2025)

Python 18 1 Updated Jul 16, 2025

AIGeeksGroup / 3D-R1

3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding

Python 365 12 Updated Nov 27, 2025

huggingface / trl

Train transformer language models with reinforcement learning.

Python 16,446 2,320 Updated Nov 27, 2025

InternLM / Intern-S1

A Scientific Multimodal Foundation Model

607 31 Updated Sep 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Xiaolong dragonlong

Achievements

Achievements

Block or report dragonlong

Lists (1)

🔮 Future ideas

Stars

ToyotaResearchInstitute / lbm_eval

nvidia-cosmos / cosmos-cookbook

NVIDIA / dgx-spark-playbooks

karpathy / nanochat

thu-ml / RDT2

OpenDriveLab / AgiBot-World

AgibotTech / Genie-Envisioner

AIDC-AI / Ovis

The-AI-Alliance / GEO-Bench-VLM

DangMinh21 / Multimodal-and-Multi-task-Fusion-for-Spatial-Reasoning

mingyin0312 / RLFromScratch

modelscope / DiffSynth-Studio

yeliudev / R2-Tuning

modelcontextprotocol / python-sdk

NVlabs / PartPacker

OPPO-PersonalAI / Agent_Foundation_Models

minghangz / OnVTG

MatchLab-Imperial / Hypo3D

NVlabs / scene_synthesizer

qizekun / SoFar

Simple-Efficient / RL-Factory

facebookresearch / dinov3

Wan-Video / Wan2.2

microsoft / agent-lightning

yunlong10 / CAT-V

antoyang / VidChapters

zihuixue / ProgCaptioner

AIGeeksGroup / 3D-R1

huggingface / trl

InternLM / Intern-S1