- 浙江 . 杭州
- http://www.ucasp.net
Stars
Capstone Project: Training and Finetuning for OWL ViT for Referring Expression Task
object detection based on owl-vit
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
This repository contains the code of the CVPR 2022 paper "Image Segmentation Using Text and Image Prompts".
A .NET library for manipulating PowerPoint presentations
Diffusion model(SD,Flux,Wan,Qwen Image,...) inference in pure C/C++
Windows版本微信客户端(非网页版)自动化,可实现简单的发送、接收微信消息,简单微信机器人
The open-source CapCut alternative
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
[CVPR 2023] Official code for "Zero-shot Referring Image Segmentation with Global-Local Context Features"
A browser extension for automating your browser by connecting blocks
Mobile remote display and control in browser page.浏览器投屏控制远程手机
[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Mobile UI viewer in browser, view the UI in a tree view, and generate XPath automatically.
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!
Inspect tool to inspect UIs from an automation perspective
Chrome Extensions Samples