svjack

🧑‍🌾

svjack

🧑‍🌾

96 followers · 1.5k following

https://huggingface.co/svjack

Achievements

ai-toolkit Public
Forked from ostris/ai-toolkit

The ultimate training toolkit for finetuning diffusion models

Python MIT License Updated Dec 22, 2025
musubi-tuner Public
Forked from kohya-ss/musubi-tuner

Python 1 Updated Dec 8, 2025
ComfyUI_Qwen2_5-VL-Instruct Public
Forked from IuvenisSapiens/ComfyUI_Qwen3-VL-Instruct

The successful integration of Qwen2.5-VL-Instruct series into the ComfyUI platform has enabled a smooth operation, supporting (but not limited to) text-based queries, video queries, single-image qu…

Python Apache License 2.0 Updated Nov 29, 2025
ComfyUI-QwenVL Public
Forked from 1038lab/ComfyUI-QwenVL

ComfyUI-QwenVL custom node integrates the Qwen-VL series, including the latest Qwen3-VL models, including Qwen2.5-VL and the latest Qwen3-VL, to enable advanced multimodal AI for text generation, i…

Python GNU General Public License v3.0 Updated Nov 21, 2025
chatbot-ollama Public
Forked from ivanfioravanti/chatbot-ollama

Chatbot Ollama is an open source chat UI for Ollama.

TypeScript Other Updated Nov 21, 2025
comfyui-ollama Public
Forked from stavsap/comfyui-ollama

Python Apache License 2.0 Updated Nov 20, 2025
ComfyUI-HunyuanVideo-Foley Public
Forked from phazei/ComfyUI-HunyuanVideo-Foley

HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.

Python Other Updated Nov 17, 2025
ComfyUI-AV-FunASR Public
Forked from avenstack/ComfyUI-AV-FunASR

Python Updated Nov 12, 2025
index-tts-vllm Public
Forked from Ksuriuri/index-tts-vllm

Added vLLM support to IndexTTS for faster inference.

Python 1 Apache License 2.0 Updated Nov 5, 2025
SoulX-Podcast Public
Forked from Soul-AILab/SoulX-Podcast

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python Apache License 2.0 Updated Nov 4, 2025
ComfyUI_RH_VideoAsPrompt Public
Forked from HM-RunningHub/ComfyUI_RH_VideoAsPrompt

This is a VideoAsPrompt ComfyUI plugin

Python Updated Oct 31, 2025
ComfyUI-AdvancedLivePortrait Public
Forked from PowerHouseMan/ComfyUI-AdvancedLivePortrait

Python Updated Oct 16, 2025
ComfyUI_RH_DreamOmni2 Public
Forked from HM-RunningHub/ComfyUI_RH_DreamOmni2

A ComfyUI node for dvlab-research/DreamOmni2

Python Updated Oct 14, 2025
ComfyUI_RH_Ovi Public
Forked from HM-RunningHub/ComfyUI_RH_Ovi

ComfyUI custom nodes for Ovi joint video+audio generation

Python Apache License 2.0 Updated Oct 7, 2025
ComfyUI-HunyuanVideoWrapper Public
Forked from kijai/ComfyUI-HunyuanVideoWrapper

Python Updated Oct 4, 2025
Keye Public
Forked from Kwai-Keye/Keye

Python Updated Oct 3, 2025
ComfyUI-MiniCPM Public
Forked from 1038lab/ComfyUI-MiniCPM

A custom ComfyUI node for MiniCPM vision-language models, supporting v4, v4.5, and v4 GGUF formats, enabling high-quality image captioning and visual analysis.

Python GNU General Public License v3.0 Updated Sep 19, 2025
imgutils Public
Forked from deepghs/imgutils

A convenient and user-friendly anime-style image data processing library that integrates various advanced anime-style image processing models

Python MIT License Updated Sep 11, 2025
Step-Audio2 Public
Forked from stepfun-ai/Step-Audio2

Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.

Python Apache License 2.0 Updated Sep 10, 2025
twitter-media-downloader Public
Forked from mmpx12/twitter-media-downloader

twmd: CLI/GUI Apiless twitter downlaoder. Download medias from single tweet or a whole profile.

Go Updated Sep 2, 2025
bilibili-api Public
Forked from Nemo2011/bilibili-api

哔哩哔哩常用API调用。支持视频、番剧、用户、频道、音频等功能。原仓库地址：https://github.com/MoyuScript/bilibili-api

Python GNU General Public License v3.0 Updated Aug 30, 2025
VideoModelStudio Public
Forked from jbilcke-hf/VideoModelStudio

Gradio webapp to train AI Video models using Finetrainers

Python 1 Updated Aug 30, 2025
LLaVA-NeXT Public
Forked from LLaVA-VL/LLaVA-NeXT

Python Apache License 2.0 Updated Aug 28, 2025
InfiniteTalk Public
Forked from MeiGen-AI/InfiniteTalk

Unlimited-length talking video generation that supports image-to-video and video-to-video generation

Python Apache License 2.0 Updated Aug 24, 2025
Stand-In_Preprocessor_ComfyUI Public
Forked from WeChatCV/Stand-In_Preprocessor_ComfyUI

The core component of Stand-In, the preprocessor, is essential—only images processed through it can fully unlock the capabilities of Stand-In.

Python Updated Aug 19, 2025
Qwen2.5-Omni Public
Forked from QwenLM/Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook Apache License 2.0 Updated Jul 14, 2025
ComfyUI-MMAudio Public
Forked from kijai/ComfyUI-MMAudio

Python MIT License Updated Jul 12, 2025
ComfyUI-LatentSyncWrapper Public
Forked from ShmuelRonen/ComfyUI-LatentSyncWrapper

This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audio input.

Python Updated Jul 10, 2025
ComfyUI-FramePackWrapper_PlusOne Public
Forked from tori29umai0123/ComfyUI-FramePackWrapper_PlusOne

Python Apache License 2.0 Updated Jun 30, 2025
Step-Audio Public
Forked from stepfun-ai/Step-Audio

Python Apache License 2.0 Updated Jun 29, 2025

svjack

Achievements

Achievements

ai-toolkit Public

Uh oh!

musubi-tuner Public

Uh oh!

ComfyUI_Qwen2_5-VL-Instruct Public

Uh oh!

ComfyUI-QwenVL Public

Uh oh!

chatbot-ollama Public

Uh oh!

comfyui-ollama Public

Uh oh!

ComfyUI-HunyuanVideo-Foley Public

Uh oh!

ComfyUI-AV-FunASR Public

Uh oh!

index-tts-vllm Public

Uh oh!

SoulX-Podcast Public

Uh oh!

ComfyUI_RH_VideoAsPrompt Public

Uh oh!

ComfyUI-AdvancedLivePortrait Public

Uh oh!

ComfyUI_RH_DreamOmni2 Public

Uh oh!

ComfyUI_RH_Ovi Public

Uh oh!

ComfyUI-HunyuanVideoWrapper Public

Uh oh!

Keye Public

Uh oh!

ComfyUI-MiniCPM Public

Uh oh!

imgutils Public

Uh oh!

Step-Audio2 Public

Uh oh!

twitter-media-downloader Public

Uh oh!

bilibili-api Public

Uh oh!

VideoModelStudio Public

Uh oh!

LLaVA-NeXT Public

Uh oh!

InfiniteTalk Public

Uh oh!

Stand-In_Preprocessor_ComfyUI Public

Uh oh!

Qwen2.5-Omni Public

Uh oh!

ComfyUI-MMAudio Public

Uh oh!

ComfyUI-LatentSyncWrapper Public

Uh oh!

ComfyUI-FramePackWrapper_PlusOne Public

Uh oh!

Step-Audio Public

Uh oh!