Skip to content
View lihuibng's full-sized avatar

Block or report lihuibng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

YTsaurus is a scalable and fault-tolerant open-source big data platform.

C++ 2,116 187 Updated Dec 30, 2025

OpenOCR: An Open-Source Toolkit for General OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful re…

Python 910 79 Updated Dec 27, 2025

The low-level, core functionality of boto3 and the AWS CLI.

Python 1,594 1,136 Updated Dec 30, 2025

The Institutional Data Initiative's pipeline for analyzing, refining, and publishing the Institutional Books 1.0 collection.

Python 48 6 Updated Nov 21, 2025

WebMainBench is a specialized benchmark tool for end-to-end evaluation of web main content extraction quality.

Python 9 8 Updated Nov 26, 2025

[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥

Python 4,677 546 Updated Dec 3, 2025

KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale

Go 1,045 130 Updated Dec 30, 2025

A Rust-based regex crate wrapper for Python3 to get faster performance. 👾

Python 140 10 Updated Jul 9, 2024

High-performance regular expression matching library

C++ 5,235 775 Updated Apr 2, 2025

The Rust package manager

Rust 14,398 2,761 Updated Dec 30, 2025

An Open Phone Agent Model & Framework. Unlocking the AI Phone for Everyone

Python 20,178 3,238 Updated Dec 22, 2025

YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis

Python 146 20 Updated Aug 3, 2025

12 weeks, 26 lessons, 52 quizzes, classic Machine Learning for all

Jupyter Notebook 82,707 19,486 Updated Dec 25, 2025

🐲 ZHLID: Open-source language identification tool specialized for fine-grained Chinese varieties.

Python 2 1 Updated Sep 29, 2025

Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors

TypeScript 40,521 2,003 Updated Dec 30, 2025

Code highlight tokenizer written in C++

C++ 83 39 Updated Jun 6, 2024

Telegram-iOS

Swift 7,845 2,289 Updated Dec 9, 2025

cuTile is a programming model for writing parallel kernels for NVIDIA GPUs

Python 1,712 88 Updated Dec 20, 2025

Open-Source Frontier Voice AI

Python 19,255 2,135 Updated Dec 17, 2025

cloud-native distributed storage

Go 5,407 689 Updated Dec 30, 2025

Extract and convert data from any document, images, pdfs, word doc, ppt or URL into multiple formats (Markdown, JSON, CSV, HTML) with intelligent structured data extraction and advanced OCR.

Python 1,105 105 Updated Oct 31, 2025

MinerU-HTML: An SLM-powered HTML main content extractor that outputs clean HTML bodies. Perfect for Deep Research Agents, RAG applications, and training data generation.

HTML 154 18 Updated Dec 25, 2025

High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg.dev team. Kreuzberg.dev is a fast, polyglot document intelligence engine with a Rust core. It extra…

HTML 454 41 Updated Dec 30, 2025

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]

Python 2,941 274 Updated Oct 6, 2025

Quick illustration of how one can easily read books together with LLMs. It's great and I highly recommend it.

Python 2,728 356 Updated Nov 18, 2025

LLM Council works together to answer your hardest questions

Python 12,375 2,368 Updated Nov 22, 2025

The best ChatGPT that $100 can buy.

Python 39,522 5,033 Updated Dec 30, 2025
Next