Skip to content
View zhangyp15's full-sized avatar
  • Tsinghua University
  • Beijing, China

Block or report zhangyp15

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time

Python 577 28 Updated May 7, 2025

Let us control diffusion models!

Python 33,206 2,975 Updated Feb 25, 2024

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 11,389 1,159 Updated Oct 11, 2025

[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions

Python 793 46 Updated Aug 21, 2025

[IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Python 2,543 175 Updated Oct 22, 2025

[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 6,850 681 Updated Jan 22, 2025

Unified framework for robot learning built on NVIDIA Isaac Sim

Python 5,246 2,546 Updated Oct 25, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,126 1,749 Updated Oct 13, 2025

Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long chain-of-thought reasoning processes.

Python 761 60 Updated Oct 1, 2025

Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

Jupyter Notebook 367 75 Updated Aug 20, 2025

RDT-1B: a Diffusion Foundation Model for Bimanual Manipulation

Python 1,502 142 Updated Sep 28, 2025

NVIDIA Isaac GR00T N1.5 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,116 796 Updated Oct 13, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 50,473 8,823 Updated Oct 13, 2025

🦜🔗 Build context-aware reasoning applications

Python 118,024 19,428 Updated Oct 24, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 15,482 1,205 Updated Oct 22, 2025

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,828 894 Updated Sep 30, 2025

HE-Drive: Human-Like End-to-End Driving with Vision Language Models

Python 245 17 Updated Aug 17, 2025

OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.

Python 829 101 Updated May 13, 2025
Python 94 1 Updated Dec 30, 2024

GPD-1: Generative Pre-training for Driving

Python 78 1 Updated Dec 12, 2024

[ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model

Python 83 4 Updated Dec 10, 2024

TuShare is a utility for crawling historical data of China stocks

Python 14,000 4,370 Updated Mar 13, 2024

More relighting!

Python 8,262 520 Updated Feb 20, 2025

[ECCV 2024 Oral] LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Python 1,952 133 Updated Aug 20, 2024

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

956 14 Updated Jun 21, 2024

SparseDrive: End-to-End Autonomous Driving via Sparse Scene Representation

Python 746 100 Updated Mar 17, 2025

[NeurIPS 2024] A Generalizable World Model for Autonomous Driving

Python 803 57 Updated Jul 2, 2025

Visualize streams of multimodal data. Free, fast, easy to use, and simple to integrate. Built in Rust.

Rust 9,448 551 Updated Oct 25, 2025

NeuroNCAP benchmark for end-to-end autonomous driving

Python 220 10 Updated Oct 14, 2024
Next