- Zheng Zhou, China
Starred repositories
Just a simple Christmas tree, based on reddit story
这是一个解析OFD文档的工具。 This is a tool for parse the (.ofd) document.
use python parse OFD file: ofd2img ofd2pdf pdf2ofd img2ofd ;(纯 python的ofd解析)
Translators is a library that aims to bring free, multiple, enjoyable translations to individuals and students in Python. 「翻译官」是一个旨在用Python为个人和学生带来免费、多样、愉快翻译的库。
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker
Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.
PDF2zh for Zotero | Zotero PDF中文翻译插件
The code used to train and run inference with MMDocIR
Multilingual Document Layout Parsing in a Single Vision-Language Model
OpenOCR: An Open-Source Toolkit for General OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful re…
A flexible free and unlimited python tool to translate between different languages in a simple way using multiple translators.
Simultaneous speech-to-text model
A python wrapper for the Doc2X API and comes with native texts processing (to improve PDF recall in RAG). | Doc2X API的python封装,同时附带本地的文本处理(提升PDF在RAG中的召回率)。
This is a database of 300.000+ symbols containing Equities, ETFs, Funds, Indices, Currencies, Cryptocurrencies and Money Markets.
A Python library to access ISO country, subdivision, language, currency and script definitions and their translations.
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,算法链路全流程,算力租赁平台,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU虚拟化,边缘计算,标注平台,自动化标注,deepseek等大模型sft微调/奖励模型/强化学习训练,vllm/ollama/mindie大模型多机推理,私有知识库,AI模型市场,支持国…
A self-hosted Telegram file downloader for continuous, stable, and unattended downloads.
MagicTryOn is a video virtual try-on framework based on a large-scale video diffusion Transformer.
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 600+ LLMs (Qwen3, Qwen3-MoE, DeepSeek-R1, GLM4.5, InternLM3, Llama4, ...) and 300+ MLLMs (Qwen3-VL, Qwen3-Omni, InternVL3.5, Ovis2.5, GLM4.5v, Llava, …
Modeling and Bayesian analysis of fluorescence and absorbance assays.