dukGuo

Follow

Dake Guo dukGuo

Follow

Student in aslp@npu, Interested in Speech Synthesis

33 followers · 63 following

Northwestern Polytechnical University
China
05:55 (UTC -12:00)

Achievements

Achievements

Pinned Loading

Qwen3-Omni Qwen3-Omni Public

Forked from QwenLM/Qwen3-Omni

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook
Qwen2.5-Omni Qwen2.5-Omni Public

Forked from QwenLM/Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook
SoulX-Podcast SoulX-Podcast Public

Forked from Soul-AILab/SoulX-Podcast

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python
OSUM OSUM Public

Forked from ASLP-lab/OSUM

OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.

Python
valle-audiodec valle-audiodec Public

Inference code for Audiodec-Valle-Wenetspeech4TTS

Python 50 2