Stars
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
JAXB-based Java library for Word docx, Powerpoint pptx, and Excel xlsx files
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
Windows GUI Automation with Python (based on text properties)
Low-code programming for event-driven applications
A next-generation cloud native kernel designed to unlock best-in-class performance, security primitives and efficiency savings.
NocoBase is the most extensible AI-powered no-code/low-code platform for building business applications and enterprise solutions.
[CVPR 2025] DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception
Generic automation framework for acceptance testing and RPA
A high-performance hardware acceleration algorithm library of OpenSSL engine based on Kunpeng processor
A library for efficient similarity search and clustering of dense vectors.
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
基于序列表格识别算法推理库,集成PP-Structure和modelscope等表格识别算法。
检测和提取各种场景图片中的表格区域,并纠正透视和旋转问题 Detect and extract table regions from images in various scenarios, and correct perspective and rotation issues.
A Unified Toolkit for Deep Learning-Based Table Extraction
UniTable: Towards a Unified Table Foundation Model
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…
Reference PyTorch implementation and models for DINOv3
Get your documents ready for gen AI
This repository contains the official implementation of "FastVLM: Efficient Vision Encoding for Vision Language Models" - CVPR 2025
A Comprehensive Toolkit for High-Quality PDF Content Extraction
Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.
Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚