wusongyuan

wusongyuan

Achievements

Stars

RLinf / RLinf

RLinf is a flexible and scalable open-source infrastructure designed for post-training foundation models (LLMs, VLMs, VLAs) via reinforcement learning.

Python 1,116 105 Updated Nov 7, 2025

xlang-ai / OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 2,304 325 Updated Nov 7, 2025

PaddlePaddle / PaddleOCR

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 63,168 9,283 Updated Nov 6, 2025

karpathy / nanochat

The best ChatGPT that $100 can buy.

Python 36,207 4,246 Updated Nov 5, 2025

PRBonn / rko_lio

A Robust Approach for LiDAR-Inertial Odometry Without Sensor-Specific Modelling

C++ 335 19 Updated Nov 5, 2025

Ilya-Fradlin / Interactive4D

[ICRA 2025] Interactive4D: Interactive 4D LiDAR Segmentation

Python 96 6 Updated May 7, 2025

ZiyuGuo99 / SAM2Point

The Most Faithful Implementation of Segment Anything (SAM) in 3D

Python 349 16 Updated Sep 11, 2024

dataelement / bisheng

BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…

TypeScript 9,906 1,630 Updated Nov 7, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,263 2,455 Updated Nov 9, 2025

jlowin / fastmcp

🚀 The fast, Pythonic way to build MCP servers and clients

Python 20,110 1,479 Updated Nov 9, 2025

microsoft / vscode

Visual Studio Code

TypeScript 178,391 36,047 Updated Nov 9, 2025

Liuziyu77 / Visual-RFT

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 2,246 100 Updated Oct 29, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,706 979 Updated Nov 6, 2025

deepseek-ai / open-infra-index

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,930 286 Updated May 15, 2025

deepseek-ai / FlashMLA

FlashMLA: Efficient Multi-head Latent Attention Kernels

C++ 11,855 897 Updated Sep 30, 2025

deepseek-ai / DeepGEMM

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Cuda 5,869 739 Updated Oct 15, 2025

sunnyiisc / Fire-Detection-from-FLAME-Dataset

Deep Learning model implementation for Fire detection both classification and segmentation from the FLAME dataset.

Python 27 2 Updated Dec 12, 2022

btahir / open-deep-research

Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.

TypeScript 2,094 198 Updated Mar 15, 2025

kvcache-ai / ktransformers

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 15,506 1,119 Updated Nov 8, 2025

deepseek-ai / DeepSeek-R1

91,459 11,782 Updated Jun 27, 2025

open-webui / open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

JavaScript 114,700 15,993 Updated Nov 9, 2025

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 25,619 2,401 Updated Sep 8, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 23,811 2,043 Updated Sep 12, 2025

OpenBMB / MiniCPM-V

MiniCPM-V 4.5: A GPT-4o Level MLLM for Single Image, Multi Image and High-FPS Video Understanding on Your Phone

Python 22,199 1,664 Updated Sep 24, 2025

FellouAI / eko

Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai

TypeScript 4,731 420 Updated Oct 30, 2025

NVIDIA / nv-ingest

NeMo Retriever extraction is a scalable, performance-oriented document content and metadata extraction microservice. NeMo Retriever extraction uses specialized NVIDIA NIM microservices to find, con…

Python 2,760 273 Updated Nov 8, 2025

bytedance / Sa2VA

Official Repo For "Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos"

Python 1,392 97 Updated Nov 4, 2025

OpenSPG / KAG

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 8,167 620 Updated Sep 22, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,242 420 Updated Nov 9, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,140 1,287 Updated Oct 27, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wusongyuan

Achievements

Achievements

Block or report wusongyuan

Stars

RLinf / RLinf

xlang-ai / OSWorld

PaddlePaddle / PaddleOCR

karpathy / nanochat

PRBonn / rko_lio

Ilya-Fradlin / Interactive4D

ZiyuGuo99 / SAM2Point

dataelement / bisheng

volcengine / verl

jlowin / fastmcp

microsoft / vscode

Liuziyu77 / Visual-RFT

deepseek-ai / DeepEP

deepseek-ai / open-infra-index

deepseek-ai / FlashMLA

deepseek-ai / DeepGEMM

sunnyiisc / Fire-Detection-from-FLAME-Dataset

btahir / open-deep-research

kvcache-ai / ktransformers

deepseek-ai / DeepSeek-R1

open-webui / open-webui

huggingface / open-r1

microsoft / OmniParser

OpenBMB / MiniCPM-V

FellouAI / eko

NVIDIA / nv-ingest

bytedance / Sa2VA

OpenSPG / KAG

kvcache-ai / Mooncake

QwenLM / Qwen3-VL