- Tokyo/Japan
- @santa128bit
Highlights
- Pro
Stars
Chrome DevTools for coding agents
A reference implementation for the specification that can create and configure a dev container from a devcontainer.json.
Run Claude Code, Gemini, Codex — or any coding agent — in a clean, isolated sandbox with sensitive data redaction and observability baked in.
Kiro compatible Spec-Driven Development for Claude Code, Cursor, Gemini CLI and Qwen Code. High quality commands that enforce structured requirements→design→tasks workflow and steering, transformin…
A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.
SPRINT: Script-agnostic Structure Recognition in Tables
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
The source code repository for the paper.
Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud(通义点金:面向金融行业的大模型)
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
A Python library to extract tabular data from PDFs
A Unified Toolkit for Deep Learning-Based Table Extraction
A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…
DocLayout-YOLO: Enhancing Document Layout Analysis through Diverse Synthetic Data and Global-to-Local Adaptive Perception
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
Multilingual Document Layout Parsing in a Single Vision-Language Model
Yomitoku is an AI-powered document image analysis package designed specifically for the Japanese language.
AlphaGo Moment for Model Architecture Discovery.
An official pytorch implementation of EACL2024 short paper "Flow Matching for Conditional Text Generation in a Few Sampling Steps"
Kanban board to manage your AI coding agents
Context7 MCP Server -- Up-to-date code documentation for LLMs and AI code editors
MemOS (Preview) | Intelligence Begins with Memory
日本の国家予算をインタラクティブに可視化し, 自由に編集しながら試行錯誤し, 自分の考えた予算案をシェアできます
Create, develop, and deploy Slack apps from the command-line ✨
Copilot Chat extension for VS Code
State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!
An open-source AI agent that brings the power of Gemini directly into your terminal.
🥰 Building AI-based conversational avatars lightning fast ⚡️💬