Timothyxxx

🧑‍💻

struggle with paradox

Tianbao Xie Timothyxxx

🧑‍💻

struggle with paradox

PhD student of the University of Hong Kong @xlang-ai @HKUNLP. Previously in @HIT-SCIR. Not a typical NLP researcher.

602 followers · 443 following

The University of Hong Kong
Hong Kong, SAR
20:46 (UTC +08:00)
tianbaoxie.com
@TianbaoX

Achievements

x3 x2

Achievements

x3 x2

Highlights

Developer Program Member

Organizations

Lists (4)

Sort

Stars

MintyCo0kie / MGA4OSWorld

Memory_Driven_GUI_Agent

Python 5 1 Updated Nov 4, 2025

ranpox / claude-code-navel-gazing

Claude Code Reverse Engineering Itself

13 2 Updated Aug 12, 2025

xlang-ai / VideoAgentTrek

The official repo of VideoAgentTrek

Python 29 3 Updated Oct 24, 2025

anthropics / skills

Public repository for Skills

Python 15,983 1,393 Updated Oct 18, 2025

LightChen233 / AutoPR

This is the official implementation for **"AUTOPR: LET'S AUTOMATE YOUR ACADEMIC PROMOTION!**".

Python 80 4 Updated Oct 16, 2025

google-gemini / computer-use-preview

Python 1,733 221 Updated Oct 29, 2025

agent-infra / sandbox

All-in-One Sandbox for AI Agents that combines Browser, Shell, File, MCP and VSCode Server in a single Docker container.

Python 1,278 112 Updated Nov 9, 2025

thinking-machines-lab / batch_invariant_ops

Python 887 68 Updated Nov 4, 2025

allenai / discoveryworld

A virtual environment for developing and evaluating automated scientific discovery agents.

Python 189 13 Updated Mar 10, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 4,092 250 Updated Nov 6, 2025

inclusionAI / ASearcher

An Open-Source Large-Scale Reinforcement Learning Project for Search Agents

Python 489 30 Updated Oct 8, 2025

WukLab / osworld-human

OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents

Python 16 1 Updated Aug 16, 2025

SunzeY / SEAgent

Official implementation of "SEAgent: Self-Evolving Computer Use Agent with Autonomous Learning from Experience"

Python 208 19 Updated Aug 7, 2025

openai / harmony

Renderer for the harmony response format to be used with gpt-oss

Rust 3,988 223 Updated Nov 5, 2025

openai / gpt-oss

gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI

Python 19,136 1,917 Updated Nov 1, 2025

QwenLM / qwen-code

Qwen Code is a coding agent that lives in the digital world.

TypeScript 15,177 1,254 Updated Nov 9, 2025

CharlesQ9 / Self-Evolving-Agents

606 56 Updated Oct 15, 2025

VeriGUI-Team / VeriGUI

VeriGUI: Verifiable Long-Chain GUI Dataset

Python 82 2 Updated Oct 23, 2025

SWE-agent / mini-swe-agent

The 100 line AI agent that solves GitHub issues or helps you in your command line. Radically simple, no huge configs, no giant monorepo—but scores >70% on SWE-bench verified!

Python 2,009 225 Updated Nov 3, 2025

sierra-research / tau2-bench

τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment

Python 397 74 Updated Nov 8, 2025

MoonshotAI / Kimi-K2

Kimi K2 is the large language model series developed by Moonshot AI team

8,757 585 Updated Nov 7, 2025

agentsea / surfkit

A toolkit for building computer use AI agents

Python 177 18 Updated Jun 26, 2025

Yan98 / GTA1

Python 113 7 Updated Oct 3, 2025

xlang-ai / OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Python 2,304 325 Updated Nov 7, 2025

OpenDCAI / DataFlow

Easy Data Preparation with latest LLMs-based Operators and Pipelines.

Python 1,452 101 Updated Nov 7, 2025

HW-whistleblower / True-Story-of-Pangu

诺亚盘古大模型研发背后的真正的心酸与黑暗的故事。

11,379 1,365 Updated Jul 9, 2025

xlang-ai / OpenCUA

OpenCUA: Open Foundations for Computer-Use Agents

Python 553 63 Updated Oct 12, 2025

tml-epfl / os-harm

OS-Harm: A Benchmark for Measuring Safety of Computer Use Agents [NeurIPS 2025 Spotlight]

Jupyter Notebook 38 Updated Sep 18, 2025

laude-institute / terminal-bench

A benchmark for LLMs on complicated tasks in the terminal

Python 1,041 378 Updated Nov 7, 2025

OPPO-PersonalAI / TaskCraft

A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.

Python 167 18 Updated Jul 6, 2025

Tianbao Xie Timothyxxx

Highlights

Organizations

Lists (4)

code_base

🔮 Future ideas

✨ Inspiration

mixi food

Stars