Skip to content
View Rangooo123's full-sized avatar
☃️
Winter is here, and it is colllllld!
☃️
Winter is here, and it is colllllld!

Block or report Rangooo123

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,774 1,372 Updated Nov 28, 2025

A curated list of resources for articulated objects understanding.

Python 111 4 Updated Mar 17, 2025
Python 15 1 Updated Nov 16, 2024

[CVPR'24] Consistent Novel View Synthesis without 3D Representation

Python 166 5 Updated Aug 27, 2024

Dense Prediction Transformers

Python 2,282 283 Updated Dec 18, 2024

OpenVLA: An open-source vision-language-action model for robotic manipulation.

Python 4,583 551 Updated Mar 23, 2025

HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model

Python 322 9 Updated Oct 3, 2025

Democratization of RT-2 "RT-2: New model translates vision and language into action"

Python 533 66 Updated Jul 26, 2024

RynnEC: Bringing MLLMs into Embodied World

Jupyter Notebook 381 17 Updated Oct 29, 2025

[Neurips'24 Spotlight] Visual CoT: Advancing Multi-Modal Language Models with a Comprehensive Dataset and Benchmark for Chain-of-Thought Reasoning

Python 400 21 Updated Dec 22, 2024

MME-CoT: Benchmarking Chain-of-Thought in LMMs for Reasoning Quality, Robustness, and Efficiency

Python 134 5 Updated Aug 5, 2025

[CVPR' 25] Interleaved-Modal Chain-of-Thought

Python 94 4 Updated Nov 26, 2025
Python 7 Updated Sep 29, 2023

Example models using DeepSpeed

Python 6,736 1,110 Updated Oct 15, 2025

一个手把手教你从零开始编写GPT并训练大语言模型的教程

Jupyter Notebook 90 10 Updated Jan 20, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 50,274 8,412 Updated Nov 12, 2025

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

2,046 126 Updated Oct 27, 2025

Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images

Python 102 4 Updated Sep 3, 2025

[CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction" by David Charatan, Sizhe Lester Li, Andrea Tagliasacch…

Python 1,161 74 Updated Jan 13, 2025

[ICLR 2025] HiSplat: Hierarchical 3D Gaussian Splatting for Generalizable Sparse-View Reconstruction

Python 108 5 Updated Jan 24, 2025

Index repo for Kimera code

2,000 237 Updated Jan 30, 2021

Segment Anything in 3D with NeRFs (NeurIPS 2023 & IJCV 2025)

Python 993 63 Updated May 19, 2025

Code for LERF: Language Embedded Radiance Fields

Python 708 74 Updated Jul 9, 2024

[IROS 2025] LiDAR-Augmented Gaussian Splatting and Neural SDF for Geometrically Consistent Rendering and Reconstruction

C++ 338 23 Updated Oct 15, 2025

Environment light tools.

Python 69 8 Updated Jan 19, 2024

Official implementation of BARD-GS: Blur-Aware Reconstruction of Dynamic Scenes via Gaussian Splatting

Python 18 2 Updated Jul 7, 2025

[NeurIPS'22] MonoSDF: Exploring Monocular Geometric Cues for Neural Implicit Surface Reconstruction

Python 605 53 Updated May 7, 2023
Next