Highlights
- Pro
Lists (8)
Sort Name ascending (A-Z)
Stars
Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations between selected general UI elements and their text labels. A…
Your AI Operator for Web, Android, Automation & Testing.
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
A standalone version of the readability lib
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
[TPAMI 2025] Towards Visual Grounding: A Survey
XMind2TestCase基于python实现,提供了一个高效测试用例设计的解决方案!
A Survey on Large Language Model-Based Game Agents
Official Implementation of "KBLaM: Knowledge Base augmented Language Model"
Recognize graphic user interface layout through grouping GUI elements according to their visual attributes
The Implementation of BinGo model (Published on AisaCCS 2024).
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Community guide to using YubiKey for GnuPG and SSH - protect secrets with hardware crypto.
symbolic execution fuzzing with KLEE
A neurosymbolic framework for vulnerability detection in code
A manually vetted dataset for security vulnerability detection in Java projects
MegaVul - The largest, high-quality, extensible, continuously updated, C/C++/Java vulnerability dataset
A collection of AWESOME things about Graph-Related LLMs.
Vul4J: A Dataset of Reproducible Java Vulnerabilities
Self-hosted huggingface mirror service. 自建huggingface镜像服务。