Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …

Python 12,109 1,119 Updated Jan 12, 2026

apple / embedding-atlas

Embedding Atlas is a tool that provides interactive visualizations for large embeddings. It allows you to visualize, cross-filter, and search embeddings and metadata.

TypeScript 4,526 251 Updated Jan 11, 2026

microsoft / PIKE-RAG

PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation

Python 2,360 224 Updated Sep 10, 2025

TianxingChen / Embodied-AI-Guide

[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide

10,564 725 Updated Jan 7, 2026

langchain-ai / open_deep_research

Python 10,164 1,480 Updated Aug 27, 2025

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 18,323 1,894 Updated Sep 8, 2025

roboterax / video-prediction-policy

Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io

Python 322 20 Updated May 17, 2025

leigest519 / ScreenCoder

ScreenCoder — Turn any UI screenshot into clean, editable HTML/CSS with full control. Fast, accurate, and easy to customize.

Python 2,529 244 Updated Oct 22, 2025

coze-dev / coze-studio

An AI agent development platform with all-in-one visual tools, simplifying agent creation, debugging, and deployment like never before. Coze your way to AI Agent creation.

TypeScript 19,406 2,758 Updated Jan 7, 2026

TEN-framework / ten-turn-detection

Turn detection for full-duplex dialogue communication

Python 506 33 Updated Dec 26, 2025

TEN-framework / ten-vad

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

C 1,885 148 Updated Dec 23, 2025

kyutai-labs / moshi

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 9,257 841 Updated Jan 8, 2026

bytedance / SALMONN

SALMONN family: A suite of advanced multi-modal LLMs

1,378 112 Updated Sep 28, 2025

NVIDIA / Isaac-GR00T

NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.

Jupyter Notebook 5,884 929 Updated Dec 18, 2025

OpenDriveLab / UniVLA

[RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions

Python 937 54 Updated Nov 19, 2025

HKUDS / AI-Researcher

[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat

Python 3,935 467 Updated Oct 16, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lambooking

Block or report lambooking

Lists (1)

🚀 My stack

Stars

rail-berkeley / serl

haosulab / ManiSkill

algorithmicsuperintelligence / openevolve

GuanxingLu / vlarl

dexmal / dexbotic

78 / xiaozhi-esp32

facebookresearch / dinov3

facebookresearch / dino

X-Square-Robot / wall-x

AccumulateMore / CV

IDEA-Research / Grounding-DINO-1.5-API

IDEA-Research / Grounded-SAM-2

RLinf / RLinf

Alibaba-NLP / DeepResearch

modelscope / ms-swift