Stars
A ComfyUI integration for FireRedTTS‑2, a real-time multi-speaker TTS system enabling high-quality, emotionally expressive dialogue and monologue synthesis. Leveraging a streaming architecture and …
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
An enhanced Wan2.2 Image-to-Video node specifically designed to fix the slow-motion issue in 4-step LoRAs (like lightx2v).
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
A TTS model capable of generating ultra-realistic dialogue in one pass.
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System