Skip to content
View dreamerlin's full-sized avatar
🎯
Focusing
🎯
Focusing
  • HKU IDS | HKU-MMLab
  • Hong Kong

Block or report dreamerlin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

😎 Finding duplicate images made easy!

Python 5,542 474 Updated Aug 15, 2025

Text-Guided Synthesis of Scientific Vector Graphics with TikZ

Python 105 8 Updated Mar 19, 2025

A curated list of awesome TikZ documentations, libraries and resources

1,697 148 Updated Oct 5, 2024

This repo hosts the related content of LaTeX Sparkle Project.

HTML 5 Updated Oct 26, 2023

Animation engine for explanatory math videos

Python 82,134 6,955 Updated Oct 20, 2025

Read SVG files and convert them to other formats.

Python 350 86 Updated Nov 24, 2025

📚 A collection of papers about Sketch Synthesis (Generation).

542 43 Updated Nov 18, 2025

Building a comprehensive and handy list of papers for GUI agents

Python 561 30 Updated Oct 27, 2025

Witness the aha moment of VLM with less than $3.

Python 3,994 291 Updated May 19, 2025

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 19,629 1,869 Updated Nov 28, 2025
Python 8,290 579 Updated Nov 12, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,436 1,525 Updated Apr 24, 2025

My tools for the Slurm HPC workload manager

Shell 557 110 Updated Sep 22, 2025

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

Python 284 12 Updated Jul 18, 2025

AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

Python 479 101 Updated Nov 29, 2025

Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents

Python 389 39 Updated Feb 8, 2025

✨✨Latest Papers and Datasets on Mobile and PC GUI Agent

140 11 Updated Nov 29, 2024

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

1,519 107 Updated Sep 26, 2025

[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Jupyter Notebook 168 11 Updated Oct 8, 2025

All-in-one Web Agent framework for post-training. Start building with a few clicks!

Python 274 19 Updated Jul 7, 2025

A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)

247 26 Updated Sep 12, 2025

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

370 17 Updated Aug 16, 2025

哈尔滨工业大学(深圳)计算机专业课程攻略 | Guidance for courses in Department of Computer Science, Harbin Institute of Technology (Shenzhen)

C 1,813 299 Updated Oct 26, 2025

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,832 196 Updated May 21, 2025

🙌 OpenHands: Code Less, Make More

Python 65,301 7,978 Updated Nov 30, 2025

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

987 54 Updated Aug 17, 2025

A repo lists papers related to LLM based agent

Python 2,129 131 Updated Jul 12, 2025

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

2,164 177 Updated Apr 30, 2025

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Python 4,624 503 Updated Nov 18, 2024
Next