Stars
Time Blindness: Why Video-Language Models Can't See What Humans Can?
OpenStereo: A Comprehensive Benchmark for Stereo Matching
[CVPR 2025] Official Repository for Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments
VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
Outline: Access to the free and open Internet
OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.
3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
The official implementation of the ECCV 2024 paper: Continuity Preserving Online CenterLine Graph Learning
Model summary in PyTorch similar to `model.summary()` in Keras
This project extends the idea of the innovative architecture of Kolmogorov-Arnold Networks (KAN) to the Convolutional Layers, changing the classic linear transformation of the convolution to learna…
Argoverse 2: Next generation datasets for self-driving perception and forecasting.
SLIDE is C++ code that simulates degradation of lithium ion cells. It extends the single particle model with various degradation models from literature. Users can select which degradation models th…
[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving
[NeurIPS 2023 Track Datasets and Benchmarks] OpenLane-V2: The First Perception and Reasoning Benchmark for Road Driving
[AAAI 2024] Official implementation of "SQLdepth: Generalizable Self-Supervised Fine-Structured Monocular Depth Estimation", and more.
HOG feature descriptor, the kind of feature transform before we put our image into SVM. This repository also provides hog visualization both before and after doing block normalization.
Deep learning inference nodes for ROS / ROS2 with support for NVIDIA Jetson and TensorRT
[CVPR 2024] LMDrive: Closed-Loop End-to-End Driving with Large Language Models