Stars
VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
A curated list of large VLM-based VLA models for robotic manipulation.
[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide
[Actively Maintained🔥] A list of Embodied AI papers accepted by top conferences (ICLR, NeurIPS, ICML, RSS, CoRL, ICRA, IROS, CVPR, ICCV, ECCV).
Lumina Robotics Talent Call | Lumina社区具身智能招贤榜 | A list for Embodied AI / Robotics Jobs (PhD, RA, intern, full-time, etc
A curated list of awesome Multimodal studies.
📖 This is a repository for organizing papers, codes and other resources related to unified multimodal models.
An open-source implementaion for fine-tuning Molmo-7B-D and Molmo-7B-O by allenai.
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
Empowering Unified MLLM with Multi-granular Visual Generation
[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space
Run Segment Anything Model 2 on a live video stream
[ECCV 2024] PowerPaint, a versatile image inpainting model that supports text-guided object inpainting, object removal, image outpainting and shape-guided object inpainting with only a single model…
The most comprehensive database of Chinese poetry 🧶最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
智能计算系统实验 在Cambricon编程平台上实现用BangC实现五个算子