Stars
MichalZawalski / embodied-CoT
Forked from openvla/openvlaEmbodied Chain of Thought: A robotic policy that reason to solve the task.
Official code of Motus: A Unified Latent Action World Model
Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning
LITEN: Learning from Inference Time Execution for VLAs
HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos
⏰ Collaboratively track worldwide conference deadlines (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
A comprehensive list of papers about Robot Manipulation, including papers, codes, and related websites.
🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。接入 MCP 架构,赋能 AI 自然语言对话分析、情感洞察与趋势预测。支持 Docker 一键部署,数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。⭐
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.
GUI for visualization and interactive editing of SMPL-family body models ie. SMPL, SMPL-X, MANO, FLAME.
VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
🔥中文 prompt 精选🔥,ChatGPT 使用指南,提升 ChatGPT 可玩性和可用性!🚀
Being-H0: Vision-Language-Action Pretraining from Large-Scale Human Videos
Drawing Bayesian networks, graphical models, tensors, technical frameworks, and illustrations in LaTeX.
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.