-
ai-toolkit Public
Forked from ostris/ai-toolkitThe ultimate training toolkit for finetuning diffusion models
Python MIT License UpdatedDec 22, 2025 -
-
ComfyUI_Qwen2_5-VL-Instruct Public
Forked from IuvenisSapiens/ComfyUI_Qwen3-VL-InstructThe successful integration of Qwen2.5-VL-Instruct series into the ComfyUI platform has enabled a smooth operation, supporting (but not limited to) text-based queries, video queries, single-image qu…
Python Apache License 2.0 UpdatedNov 29, 2025 -
ComfyUI-QwenVL Public
Forked from 1038lab/ComfyUI-QwenVLComfyUI-QwenVL custom node integrates the Qwen-VL series, including the latest Qwen3-VL models, including Qwen2.5-VL and the latest Qwen3-VL, to enable advanced multimodal AI for text generation, i…
Python GNU General Public License v3.0 UpdatedNov 21, 2025 -
chatbot-ollama Public
Forked from ivanfioravanti/chatbot-ollamaChatbot Ollama is an open source chat UI for Ollama.
TypeScript Other UpdatedNov 21, 2025 -
comfyui-ollama Public
Forked from stavsap/comfyui-ollamaPython Apache License 2.0 UpdatedNov 20, 2025 -
ComfyUI-HunyuanVideo-Foley Public
Forked from phazei/ComfyUI-HunyuanVideo-FoleyHunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.
Python Other UpdatedNov 17, 2025 -
-
index-tts-vllm Public
Forked from Ksuriuri/index-tts-vllmAdded vLLM support to IndexTTS for faster inference.
-
SoulX-Podcast Public
Forked from Soul-AILab/SoulX-PodcastSoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.
Python Apache License 2.0 UpdatedNov 4, 2025 -
ComfyUI_RH_VideoAsPrompt Public
Forked from HM-RunningHub/ComfyUI_RH_VideoAsPromptThis is a VideoAsPrompt ComfyUI plugin
Python UpdatedOct 31, 2025 -
ComfyUI-AdvancedLivePortrait Public
Forked from PowerHouseMan/ComfyUI-AdvancedLivePortraitPython UpdatedOct 16, 2025 -
ComfyUI_RH_DreamOmni2 Public
Forked from HM-RunningHub/ComfyUI_RH_DreamOmni2A ComfyUI node for dvlab-research/DreamOmni2
Python UpdatedOct 14, 2025 -
ComfyUI_RH_Ovi Public
Forked from HM-RunningHub/ComfyUI_RH_OviComfyUI custom nodes for Ovi joint video+audio generation
Python Apache License 2.0 UpdatedOct 7, 2025 -
ComfyUI-HunyuanVideoWrapper Public
Forked from kijai/ComfyUI-HunyuanVideoWrapperPython UpdatedOct 4, 2025 -
-
ComfyUI-MiniCPM Public
Forked from 1038lab/ComfyUI-MiniCPMA custom ComfyUI node for MiniCPM vision-language models, supporting v4, v4.5, and v4 GGUF formats, enabling high-quality image captioning and visual analysis.
Python GNU General Public License v3.0 UpdatedSep 19, 2025 -
imgutils Public
Forked from deepghs/imgutilsA convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models
Python MIT License UpdatedSep 11, 2025 -
Step-Audio2 Public
Forked from stepfun-ai/Step-Audio2Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.
Python Apache License 2.0 UpdatedSep 10, 2025 -
twitter-media-downloader Public
Forked from mmpx12/twitter-media-downloadertwmd: CLI/GUI Apiless twitter downlaoder. Download medias from single tweet or a whole profile.
Go UpdatedSep 2, 2025 -
bilibili-api Public
Forked from Nemo2011/bilibili-api哔哩哔哩常用API调用。支持视频、番剧、用户、频道、音频等功能。原仓库地址:https://github.com/MoyuScript/bilibili-api
Python GNU General Public License v3.0 UpdatedAug 30, 2025 -
VideoModelStudio Public
Forked from jbilcke-hf/VideoModelStudioGradio webapp to train AI Video models using Finetrainers
-
-
InfiniteTalk Public
Forked from MeiGen-AI/InfiniteTalkUnlimited-length talking video generation that supports image-to-video and video-to-video generation
Python Apache License 2.0 UpdatedAug 24, 2025 -
Stand-In_Preprocessor_ComfyUI Public
Forked from WeChatCV/Stand-In_Preprocessor_ComfyUIThe core component of Stand-In, the preprocessor, is essential—only images processed through it can fully unlock the capabilities of Stand-In.
Python UpdatedAug 19, 2025 -
Qwen2.5-Omni Public
Forked from QwenLM/Qwen2.5-OmniQwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
Jupyter Notebook Apache License 2.0 UpdatedJul 14, 2025 -
-
ComfyUI-LatentSyncWrapper Public
Forked from ShmuelRonen/ComfyUI-LatentSyncWrapperThis node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audio input.
Python UpdatedJul 10, 2025 -
ComfyUI-FramePackWrapper_PlusOne Public
Forked from tori29umai0123/ComfyUI-FramePackWrapper_PlusOnePython Apache License 2.0 UpdatedJun 30, 2025 -