Skip to content
View jszh's full-sized avatar

Organizations

@meomoe

Block or report jszh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

verl: Volcano Engine Reinforcement Learning for LLMs

Python 18,294 3,017 Updated Jan 13, 2026

Building a comprehensive and handy list of papers for GUI agents

Python 606 31 Updated Oct 27, 2025
Python 105 26 Updated Nov 19, 2025

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

1,262 40 Updated Dec 23, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 65,601 7,977 Updated Jan 13, 2026

VIP cheatsheet for Stanford's CME 295 Transformers and Large Language Models

3,916 539 Updated Jul 27, 2025

Web Content Accessibility Guidelines

HTML 1,350 342 Updated Jan 12, 2026

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 24,143 2,806 Updated Dec 11, 2025

ScreenCoder — Turn any UI screenshot into clean, editable HTML/CSS with full control. Fast, accurate, and easy to customize.

Python 2,527 244 Updated Oct 22, 2025

FULL Augment Code, Claude Code, Cluely, CodeBuddy, Comet, Cursor, Devin AI, Junie, Kiro, Leap.new, Lovable, Manus, NotionAI, Orchids.app, Perplexity, Poke, Qoder, Replit, Same.dev, Trae, Traycer AI…

107,864 28,340 Updated Jan 8, 2026

Vibetest MCP - automated QA testing using Browser-Use agents

Python 755 74 Updated Sep 2, 2025

C++ implementation of a ScienceDirect paper "An accelerating cpu-based correlation-based image alignment for real-time automatic optical inspection"

C++ 1,080 254 Updated Aug 22, 2025

An open-source audio wake word (or phrase) detection framework with a focus on performance and simplicity.

Jupyter Notebook 1,674 199 Updated Dec 30, 2025

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2

Jupyter Notebook 3,193 366 Updated Nov 11, 2025

Run Segment Anything Model 2 on a live video stream

Jupyter Notebook 560 91 Updated Jun 3, 2025

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 9,302 789 Updated Jul 11, 2025
Jupyter Notebook 127 18 Updated Dec 4, 2023

Datasets on Website Aesthetics for Machine Learning

R 13 4 Updated Mar 28, 2023

Android in docker solution with noVNC supported and video recording

Python 14,036 1,622 Updated Jan 7, 2026
Python 96 18 Updated Jul 12, 2022

A command-line utility and Scrapy middleware for scraping time series data from Archive.org's Wayback Machine.

Python 468 80 Updated Feb 23, 2024

A Python Perceptual Image Hashing Module

Python 3,782 340 Updated Apr 17, 2025

Pretty good call graphs for dynamic languages

Python 4,505 329 Updated Jul 27, 2025

MagentaA11y is a tool built to simplify the process of accessibility testing.

TypeScript 77 22 Updated Jan 12, 2026

Google play scraper for Python inspired by <facundoolano/google-play-scraper>

Python 934 240 Updated Aug 5, 2024

Node.js scraper to get data from Google Play

JavaScript 2,705 694 Updated Dec 16, 2025

Consists of ~500k human annotations on the RICO dataset identifying various icons based on their shapes and semantics, and associations between selected general UI elements and their text labels. A…

32 3 Updated Jun 27, 2024

142,416 structured images for icon classification and recognition

20 8 Updated Apr 9, 2022

Code released for our CHI2023 paper "UEyes: Understanding Visual Saliency across User Interface Types"

Jupyter Notebook 31 4 Updated Jul 16, 2024
Next