Skip to content
View dreamerlin's full-sized avatar
🎯
Focusing
🎯
Focusing
  • HKU IDS | HKU-MMLab
  • Hong Kong

Block or report dreamerlin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

A curated list of skills, tools, tutorials, and capabilities for AI coding agents (Claude, Codex, Copilot, VS Code)

1,468 105 Updated Dec 29, 2025

😎 Finding duplicate images made easy!

Python 5,571 476 Updated Aug 15, 2025

Text-Guided Synthesis of Scientific Vector Graphics with TikZ

Python 108 8 Updated Mar 19, 2025

A curated list of awesome TikZ documentations, libraries and resources

1,720 149 Updated Oct 5, 2024

This repo hosts the related content of LaTeX Sparkle Project.

HTML 5 Updated Oct 26, 2023

Animation engine for explanatory math videos

Python 83,796 7,072 Updated Oct 20, 2025

Read SVG files and convert them to other formats.

Python 356 85 Updated Jan 19, 2026

📚 A collection of papers about Sketch Synthesis (Generation).

553 43 Updated Dec 16, 2025

Building a comprehensive and handy list of papers for GUI agents

Python 610 32 Updated Oct 27, 2025

Witness the aha moment of VLM with less than $3.

Python 4,023 288 Updated May 19, 2025

The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra

TypeScript 24,365 2,359 Updated Jan 14, 2026

Pioneering Automated GUI Interaction with Native Agents

Python 9,004 639 Updated Jan 19, 2026

Minimal reproduction of DeepSeek R1-Zero

Python 12,608 1,543 Updated Apr 24, 2025

My tools for the Slurm HPC workload manager

Shell 563 112 Updated Jan 19, 2026

[ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents

Python 292 14 Updated Jul 18, 2025

AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and reproducibility.

Python 497 105 Updated Jan 16, 2026

Open-sourced, Fast and Context-aware Action Grounding from GUI Instructions for GUI/Computer-use Agents

Python 394 39 Updated Feb 8, 2025

✨✨Latest Papers and Datasets on Mobile and PC GUI Agent

144 11 Updated Nov 29, 2024

A curated list of resources about AI agents for Computer Use, including research papers, projects, frameworks, and tools.

1,573 109 Updated Sep 26, 2025

[ACL 2025] Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Jupyter Notebook 175 12 Updated Oct 8, 2025

All-in-one Web Agent framework for post-training. Start building with a few clicks!

Python 275 21 Updated Jul 7, 2025

A curated list of of awesome UI agents resources, encompassing Web, App, OS, and beyond (continually updated)

265 31 Updated Dec 15, 2025

This is the repo for the paper "OS Agents: A Survey on MLLM-based Agents for Computer, Phone and Browser Use" (ACL 2025 Oral).

376 19 Updated Aug 16, 2025

哈尔滨工业大学(深圳)计算机专业课程攻略 | Guidance for courses in Department of Computer Science, Harbin Institute of Technology (Shenzhen)

C 1,844 298 Updated Oct 26, 2025

Out-of-the-box (OOTB) GUI Agent for Windows and macOS

Python 1,866 200 Updated May 21, 2025

🙌 OpenHands: AI-Driven Development

Python 66,784 8,301 Updated Jan 19, 2026

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

1,074 60 Updated Aug 17, 2025

A repo lists papers related to LLM based agent

Python 2,201 134 Updated Jul 12, 2025

Awesome things about LLM-powered agents. Papers / Repos / Blogs / ...

2,192 183 Updated Apr 30, 2025
Next