Skip to content
View shijiatongxue's full-sized avatar
Focusing
Focusing

Block or report shijiatongxue

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

The open source coding agent.

TypeScript 78,718 6,961 Updated Jan 20, 2026

Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing and logging. [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthr…

Python 34,126 5,392 Updated Jan 20, 2026

State-of-the-art Machine Learning for the web. Run 🤗 Transformers directly in your browser, with no need for a server!

JavaScript 15,229 1,068 Updated Jan 14, 2026
TypeScript 10,313 760 Updated Jan 19, 2026

kill trees of processes

JavaScript 353 39 Updated Jun 17, 2020

🚀 The fast, Pythonic way to build MCP servers and clients

Python 22,098 1,662 Updated Jan 20, 2026

The official TypeScript SDK for Model Context Protocol servers and clients

TypeScript 11,375 1,567 Updated Jan 19, 2026

视觉UI分析工具

Python 427 83 Updated Jul 26, 2023

🥢像老乡鸡🐔那样做饭。主要部分于2024年完工,非老乡鸡官方仓库。文字来自《老乡鸡菜品溯源报告》,并做归纳、编辑与整理。CookLikeHOC.

JavaScript 22,890 2,314 Updated Oct 17, 2025

AGENTS.md — a simple, open format for guiding coding agents

TypeScript 15,650 1,085 Updated Dec 19, 2025

PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…

Python 1,489 112 Updated May 27, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,858 1,533 Updated Jan 4, 2026

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 5,750 317 Updated Jan 16, 2026

✏️ Web-based image segmentation tool for object detection, localization, and keypoints

Vue 2,264 472 Updated Jan 30, 2025
Python 8,674 519 Updated Oct 9, 2024

[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"

Python 2,725 306 Updated Jul 31, 2024

[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"

Python 1,483 153 Updated Dec 20, 2023

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,768 457 Updated Aug 19, 2024

Tesseract Open Source OCR Engine (main repository)

C++ 71,978 10,467 Updated Jan 8, 2026

Pure Javascript OCR for more than 100 Languages 📖🎉🖥

JavaScript 37,762 2,360 Updated Jan 1, 2026

The headless rich text editor framework for web artisans.

TypeScript 34,635 2,837 Updated Jan 19, 2026

The ProseMirror WYSIWYM editor

JavaScript 8,547 369 Updated Dec 31, 2025

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 53,207 6,201 Updated Sep 18, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,720 451 Updated May 29, 2024

Simple Python version management

Roff 44,087 3,234 Updated Jan 15, 2026

Turn any PDF or image document into structured data for your AI. A powerful, lightweight OCR toolkit that bridges the gap between images/PDFs and LLMs. Supports 100+ languages.

Python 68,336 9,666 Updated Jan 19, 2026

GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

Python 2,123 146 Updated Dec 18, 2025

GLM-4.5: Agentic, Reasoning, and Coding (ARC) Foundation Models

Python 3,876 398 Updated Jan 19, 2026

Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.

Python 52,419 4,352 Updated Jan 19, 2026
Next