-
snow-shot Public
Forked from mg-chao/snow-shot超好用的截图工具 snow-shot
TypeScript GNU General Public License v3.0 UpdatedNov 5, 2025 -
pyvideotrans Public
Forked from jianchang512/pyvideotrans将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。 Translate the video from one language to another and add dubbing.
Python GNU General Public License v3.0 UpdatedSep 18, 2025 -
MonkeyOCR Public
Forked from Yuliang-Liu/MonkeyOCRA lightweight LMM-based Document Parsing Model
Python Apache License 2.0 UpdatedSep 10, 2025 -
MNN Public
Forked from alibaba/MNNMNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba. Full multimodal LLM Android App:[MNN-LLM-Android](./apps/Android/MnnLlmChat/READ…
C++ Apache License 2.0 UpdatedAug 23, 2025 -
RAG-Anything Public
Forked from HKUDS/RAG-Anything"RAG-Anything: All-in-One RAG System"
Python MIT License UpdatedAug 15, 2025 -
qdrant Public
Forked from qdrant/qdrantQdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
Rust Apache License 2.0 UpdatedAug 7, 2025 -
MultiTalk Public
Forked from MeiGen-AI/MultiTalk基于音频的多人对话视频生成技术-Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
Python Apache License 2.0 UpdatedJul 12, 2025 -
pptxtojson Public
Forked from pipipi-pikachu/pptxtojsonOffice PowerPoint(.pptx) file to JSON | 将 PPTX 文件转为可读的 JSON 数据
JavaScript MIT License UpdatedJun 22, 2025 -
Kimi-Audio Public
Forked from MoonshotAI/Kimi-AudioKimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
Python UpdatedJun 21, 2025 -
LiveTalking Public
Forked from lipku/LiveTalking流式2D数字人
Python Apache License 2.0 UpdatedJun 12, 2025 -
developer-roadmap Public
Forked from kamranahmedse/developer-roadmap各领域开发学习路线,互动式路线图、指南以及其他教育内容,助力开发者的职业发展。
TypeScript Other UpdatedMay 24, 2025 -
livecc Public
Forked from showlab/liveccLiveCC:大规模流式语音转录学习视频语言模型(CVPR 2025)
Python UpdatedApr 27, 2025 -
GPT-SoVITS Public
Forked from RVC-Boss/GPT-SoVITS1分钟的语音数据也可以用来训练一个很好的TTS模型!(少量镜头语音克隆)
Python MIT License UpdatedApr 25, 2025 -
ChatTTS-ui Public
Forked from jianchang512/ChatTTS-ui一个简单的本地网页界面,使用ChatTTS将文字合成为语音,同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.
Python Other UpdatedApr 21, 2025 -
-
VideoCaptioner Public
Forked from WEIFENG2333/VideoCaptioner🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理!- A powered tool for easy and efficient video subtitling.
Python GNU General Public License v3.0 UpdatedApr 18, 2025 -
OpenAvatarChat Public
Forked from HumanAIGC-Engineering/OpenAvatarChat实时对话2D数字人,可识别物体
Python Apache License 2.0 UpdatedApr 10, 2025 -
Spark-TTS Public
Forked from SparkAudio/Spark-TTSSpark-TTS 推理代码
Python Apache License 2.0 UpdatedApr 9, 2025 -
triton-windows Public
Forked from woct0rdho/triton-windowsTriton-Windows-GPU优化-模型加速Fork of the Triton language and compiler for Windows support and easy installation
MLIR MIT License UpdatedMar 29, 2025 -
MoneyPrinterTurbo Public
Forked from harry0703/MoneyPrinterTurbo利用AI大模型,一键生成高清短视频 Generate short videos with one click using AI LLM.
Python MIT License UpdatedMar 23, 2025 -
easy-dataset Public
Forked from ConardLi/easy-dataset250319-A powerful tool for creating fine-tuning datasets for LLM
JavaScript UpdatedMar 21, 2025 -
-
ChatTTS Public
Forked from 2noise/ChatTTS文字转语音-用于日常对话的生成式语音模型。- A generative speech model for daily dialogue.
Python GNU Affero General Public License v3.0 UpdatedMar 14, 2025 -
wechatVideoDownload Public
Forked from qiye45/wechatVideoDownload微信视频号下载工具,支持视频、直播回放、直播下载
UpdatedMar 13, 2025 -
MinerU Public
Forked from opendatalab/MinerU一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。A high-quality tool for convert PDF to Markdown and JSON.
Python GNU Affero General Public License v3.0 UpdatedMar 13, 2025 -
OmniHuman-1-hack Public
Forked from johndpope/OmniHuman-1-hackmash up of Wan2.1 + Meta Sapiens + Seaweed Diffusion APT for One-Step Video Generation if you have compute - call me
Python UpdatedMar 12, 2025 -
TANGO Public
Forked from CyberAgentAILab/TANGO视频生成用的数字人-基于分层音频-动作嵌入和扩散插值的协同语音手势视频重演
Python Other UpdatedMar 11, 2025 -
wav2lip384 Public
Forked from hyfevian/wav2lip384wav2lip384生成器网格权重——来自不蠢不蠢
Python GNU General Public License v3.0 UpdatedMar 7, 2025 -
LLaMA-Factory Public
Forked from hiyouga/LLaMA-FactoryLLaMA-Factory微调-Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Python Apache License 2.0 UpdatedMar 3, 2025 -
LightRAG Public
Forked from HKUDS/LightRAG"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Python MIT License UpdatedFeb 28, 2025