Starred repositories
Open-source and strong foundation image recognition models.
Learning to Use Medical Tools with Multi-modal Agent
A collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-edge models like RF-DETR, YOLO11, SAM …
Universal skills loader for AI coding agents - npm i -g openskills
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive visualization.
Kilo is the all-in-one agentic engineering platform. Build, ship, and iterate faster with the most popular open source coding agent. #1 on OpenRouter. 1.5M+ Kilo Coders. 25T+ tokens processed
[MLsys2026]: RAG on Everything with LEANN. Enjoy 97% storage savings while running a fast, accurate, and 100% private RAG application on your personal device.
[NeurIPS 2025 spotlight] Official implementation for "FutureSightDrive: Thinking Visually with Spatio-Temporal CoT for Autonomous Driving"
All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.
Langbase open source Serverless AI agents, pipes, memory, and AI examples.
LLM agents built for control. Designed for real-world use. Deployed in minutes.
An agentic AI for tech career coaching 程序员技术学习AI智能体 (based on JoyAgent-JDGenie with local RAG and code analyzing)
Build and run AI agents using Docker Compose. A collection of ready-to-use examples for orchestrating open-source LLMs, tools, and agent runtimes.
aider is AI pair programming in your terminal
"MiniRAG: Making RAG Simpler with Small and Open-Sourced Language Models"
Scripting tool for downloading Dify plugin package from Dify Marketplace and Github and repackaging [true] offline package.
AI memory OS for LLM and Agent systems(moltbot,clawdbot,openclaw), enabling persistent Skill memory for cross-task skill reuse and evolution.
ChatDev 2.0: Dev All through LLM-powered Multi-Agent Collaboration
A lightweight LMM-based Document Parsing Model
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
JetLinks 基于Java,Spring Boot ,WebFlux,Netty,Vert.x,Reactor等开发, 是一个全响应式的企业级物联网平台。支持统一物模型管理,多种设备,多种厂家,统一管理。统一设备连接管理,多协议适配(TCP,MQTT,UDP,CoAP,HTTP等),屏蔽网络编程复杂性,灵活接入不同厂家不同协议等设备。实时数据处理,设备告警,消息通知,数据转发。地理位置,…
AI Agent for producing the Higress monthly report