Official repo of paper "Reconstruction Alignment Improves Unified Multimodal Models". Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning.

Python 284 10 Updated Oct 14, 2025

InternRobotics / F1-VLA

F1: A Vision Language Action Model Bridging Understanding and Generation to Actions

Python 114 8 Updated Oct 8, 2025

fudan-zvg / UniUGG

UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding

52 Updated Aug 19, 2025

InternRobotics / InternVLA-A1

InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation

Python 46 Updated Sep 18, 2025

InternRobotics / InternVLA-M1

InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy

Python 142 6 Updated Oct 16, 2025

MIV-XJTU / FSDrive

[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"

Python 381 15 Updated Sep 28, 2025

DriveVLA / OpenDriveVLA

OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model

Python 431 30 Updated Aug 16, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 14,684 1,123 Updated Oct 15, 2025

qibin0506 / Cortex

个人构建MoE大模型：从预训练到DPO的完整实践

Python 1,600 126 Updated Oct 15, 2025

virattt / ai-hedge-fund

An AI Hedge Fund Team

Python 41,868 7,394 Updated Oct 11, 2025

WooooDyy / AgentGym

Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Python 617 85 Updated Sep 11, 2025

WooooDyy / AgentGym-RL

Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning" by Zhiheng Xi et al.

Python 437 41 Updated Sep 11, 2025

crewAIInc / crewAI

Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.

Python 39,229 5,213 Updated Oct 16, 2025

github / spec-kit

💫 Toolkit to help you get started with Spec-Driven Development

Python 37,215 3,162 Updated Oct 15, 2025

emcie-co / parlant

LLM agents built for control. Designed for real-world use. Deployed in minutes.

Python 13,713 1,114 Updated Oct 15, 2025

khoj-ai / khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous …

Python 31,331 1,841 Updated Sep 16, 2025