Skip to content
View Christarye's full-sized avatar

Block or report Christarye

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2025] UniScene: Unified Occupancy-centric Driving Scene Generation

Python 524 25 Updated Oct 30, 2025

[IROS 2024] ODTFormer: Efficient Obstacle Detection and Tracking with Stereo Cameras Based on Transformer

Python 7 Updated Mar 7, 2025

Slam Toolbox for lifelong mapping and localization in potentially massive maps with ROS

C++ 2,207 618 Updated Nov 25, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 52,696 6,160 Updated Sep 18, 2024

[CVPR2024] Code for "SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation".

Python 616 70 Updated Jul 9, 2024

Visual SLAM/odometry package based on NVIDIA-accelerated cuVSLAM

C++ 1,192 168 Updated Nov 14, 2025

Simple demos to help you explore OpenCV

C++ 7 5 Updated Oct 27, 2020

Ultralytics YOLO 🚀

Python 49,256 9,517 Updated Nov 28, 2025

Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"

Python 119 7 Updated Feb 14, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,678 306 Updated Nov 28, 2025

Student version of Assignment 2 for Stanford CS336 - Language Modeling From Scratch

Python 127 269 Updated Jul 25, 2025

Student version of Assignment 1 for Stanford CS336 - Language Modeling From Scratch

Python 953 1,235 Updated Aug 29, 2025

张正友标定法的数学原理以及python源码实现(详细)

Python 22 12 Updated Jun 1, 2022

Training VLM agents with multi-turn reinforcement learning

Python 324 38 Updated Nov 9, 2025

Paper Survey for Transformer-based SLAM

214 14 Updated Nov 24, 2025

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

Python 9,642 1,420 Updated Nov 27, 2025

《Build a Large Language Model (From Scratch)》是一本深入探讨大语言模型原理与实现的电子书,适合希望深入了解 GPT 等大模型架构、训练过程及应用开发的学习者。为了让更多中文读者能够接触到这本极具价值的教材,我决定将其翻译成中文,并通过 GitHub 进行开源共享。

HTML 2,673 465 Updated Sep 7, 2025

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 95,429 26,035 Updated Nov 28, 2025

OpenMMLab Pose Estimation Toolbox and Benchmark.

Python 7,108 1,427 Updated Aug 4, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 24,057 2,670 Updated Aug 12, 2024

stereoCamera,calibration,stereo matching,SGBM,双目摄像头,相机标定,视差图生成,深度图生成,点云数据生成。

Python 7 Updated Oct 26, 2025

A modern, C++-native, test framework for unit-tests, TDD and BDD - using C++14, C++17 and later (C++11 support is in v2.x branch, and C++03 on the Catch1.x branch)

C++ 19,988 3,172 Updated Nov 9, 2025

Pangolin is a lightweight portable rapid development library for managing OpenGL display / interaction and abstracting video input.

C++ 2,646 882 Updated Nov 13, 2025

RTAB-Map's ROS package.

C++ 1,319 622 Updated Nov 12, 2025

This code contains an algorithm to compute stereo visual SLAM by using both point and line segment features.

C++ 776 245 Updated Nov 24, 2019

Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities

C++ 10,013 4,748 Updated May 15, 2024

Official code for BEVStereo

Python 277 15 Updated Sep 22, 2022
Next