yuzhaouoe

Follow

🌴

On vacation

Yu Zhao yuzhaouoe

🌴

On vacation

Follow

PhD Student @ University of Edinburgh, CDT in NLP

14 followers · 20 following

Achievements

Achievements

Stars

pminervini / deep-research-mcp

MCP server for integrating OpenAI's Deep Research APIs and Hugging Face's Open Deep Research with Claude Code and other AI assistants

Python 37 3 Updated Oct 20, 2025

Yan98 / GTA1

Python 112 7 Updated Oct 3, 2025

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 4,063 565 Updated Nov 13, 2025

mll-lab-nu / RAGEN

RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.

Python 2,395 186 Updated Nov 13, 2025

hiyouga / EasyR1

EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL

Python 4,049 303 Updated Nov 3, 2025

mll-lab-nu / VAGEN

Training VLM agents with multi-turn reinforcement learning

Python 302 38 Updated Nov 9, 2025

volcengine / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 15,546 2,511 Updated Nov 13, 2025

zhaochen0110 / Awesome_Think_With_Images

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,121 37 Updated Oct 4, 2025

likaixin2000 / ScreenSpot-Pro-GUI-Grounding

GUI Grounding for Professional High-Resolution Computer Use

Python 281 34 Updated Oct 27, 2025

EleutherAI / delphi

Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models know themselves through automated interpretability.

Python 224 51 Updated Nov 10, 2025

microsoft / magentic-ui

A research prototype of a human-centered web agent

Python 7,931 824 Updated Nov 3, 2025

StarsfieldAI / R1-V

Witness the aha moment of VLM with less than $3.

Python 3,983 290 Updated May 19, 2025

microsoft / Magma

[CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents

Python 1,846 147 Updated Oct 4, 2025

EdinburghNLP / MMLongBench

The official repo of the paper "MMLongBench Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly"

Python 160 12 Updated Oct 3, 2025

LMD0311 / Awesome-World-Model

Collect some World Models for Autonomous Driving (and Robotic) papers.

1,527 61 Updated Nov 4, 2025

insait-institute / GenieRedux

A framework for training world models with virtual environments, complete with annotated environment dataset (RetroAct), exploration agent (AutoExplore Agent), and GenieRedux-G - an implementation …

Python 60 10 Updated Oct 25, 2025

OSU-NLP-Group / GUI-Agents-Paper-List

Building a comprehensive and handy list of papers for GUI agents

Python 546 29 Updated Oct 27, 2025

Elvin-Yiming-Du / Survey_Memory_in_AI

This repository introduce a comprehensive paper list, datasets, methods and tools for memory research.

313 22 Updated Jun 5, 2025

microsoft / WindowsAgentArena

Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.

Python 785 83 Updated Apr 30, 2025

Anduin2017 / HowToCook

程序员在家做饭方法指南。Programmer's guide about how to cook at home (Simplified Chinese only).

Dockerfile 95,795 10,687 Updated Nov 12, 2025

hijohnnylin / neuronpedia

open source interpretability platform 🧠

TypeScript 483 66 Updated Nov 13, 2025

ethz-spylab / jailbreak-tax

Python 22 Updated Aug 7, 2025

DrUsagi / Colorful-Bionic

A Bionic Reading Extension for Zotero with Verbs and Nouns Highlight

TypeScript 111 Updated Apr 11, 2025

google-deepmind / latent-multi-hop-reasoning

[ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?

Python 82 8 Updated Mar 18, 2025

Sckathach / subspace-rerouting

Using Mechanistic Interpretability to Craft Adversarial Attacks against Large Language Models

Jupyter Notebook 14 Updated Jul 7, 2025

sail-sg / SkyLadder

Forked from jzhang38/TinyLlama

The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling

Python 40 Updated Oct 15, 2025

hemingkx / Awesome-Efficient-Reasoning

Paper list for Efficient Reasoning.

719 25 Updated Oct 25, 2025

NathanGodey / qfilters

Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)

Python 35 5 Updated Mar 7, 2025

lucyfarnik / jacobian-saes

Jacobian SAEs for sparsifying LLM computation, rather than just representations

Jupyter Notebook 6 3 Updated Sep 8, 2025

dmis-lab / Monet

[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers

Python 73 4 Updated Jun 23, 2025