Skip to content
View yuzhaouoe's full-sized avatar
🌴
On vacation
🌴
On vacation

Block or report yuzhaouoe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

MCP server for integrating OpenAI's Deep Research APIs and Hugging Face's Open Deep Research with Claude Code and other AI assistants

Python 37 3 Updated Oct 20, 2025
Python 112 7 Updated Oct 3, 2025

FlashInfer: Kernel Library for LLM Serving

Cuda 4,063 565 Updated Nov 13, 2025

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,395 186 Updated Nov 13, 2025

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,049 303 Updated Nov 3, 2025

Training VLM agents with multi-turn reinforcement learning

Python 302 38 Updated Nov 9, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,546 2,511 Updated Nov 13, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,121 37 Updated Oct 4, 2025

GUI Grounding for Professional High-Resolution Computer Use

Python 281 34 Updated Oct 27, 2025

Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.

Python 224 51 Updated Nov 10, 2025

A research prototype of a human-centered web agent

Python 7,931 824 Updated Nov 3, 2025

Witness the aha moment of VLM with less than $3.

Python 3,983 290 Updated May 19, 2025

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

Python 1,846 147 Updated Oct 4, 2025

The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"

Python 160 12 Updated Oct 3, 2025

Collect some World Models for Autonomous Driving (and Robotic) papers.

1,527 61 Updated Nov 4, 2025

A framework for training world models with virtual environments, complete with annotated environment dataset (RetroAct), exploration agent (AutoExplore Agent), and GenieRedux-G - an implementation …

Python 60 10 Updated Oct 25, 2025

Building a comprehensive and handy list of papers for GUI agents

Python 546 29 Updated Oct 27, 2025

This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.

313 22 Updated Jun 5, 2025

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 785 83 Updated Apr 30, 2025

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 95,795 10,687 Updated Nov 12, 2025

open source interpretability platform 🧠

TypeScript 483 66 Updated Nov 13, 2025
Python 22 Updated Aug 7, 2025

A Bionic Reading Extension for Zotero with Verbs and Nouns Highlight

TypeScript 111 Updated Apr 11, 2025

[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?

Python 82 8 Updated Mar 18, 2025

Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models

Jupyter Notebook 14 Updated Jul 7, 2025

The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Python 40 Updated Oct 15, 2025

Paper list for Efficient Reasoning.

719 25 Updated Oct 25, 2025

Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)

Python 35 5 Updated Mar 7, 2025

Jacobian SAEs for sparsifying LLM computation, rather than just representations

Jupyter Notebook 6 3 Updated Sep 8, 2025

[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers

Python 73 4 Updated Jun 23, 2025
Next