- Japan
-
21:48
(UTC -12:00) - https://yousan.notion.site/
- @ayousanz
- https://ayousanz.hatenadiary.jp/archive
- https://zenn.dev/ayousanz
-
asset-diff-intake-data Public
UnityのブランチAからブランチBにアセットを取り込むCIのテストリポジトリ
Apache License 2.0 UpdatedNov 1, 2025 -
kanjikana-model Public
Forked from digital-go-jp/kanjikana-model氏名漢字カナ突合モデル
Jupyter Notebook MIT License UpdatedOct 28, 2025 -
uPiper Public
Unity TTS plugin: Piper neural synthesis + OpenJTalk Japanese + Unity AI Inference Engine. Windows/Mac/Linux/Android/iOS ready. High-quality voices for games & apps.
-
-
StyleTTS2 Public
Forked from yl4579/StyleTTS2StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
Python MIT License UpdatedOct 23, 2025 -
smalltts Public
Forked from smallbraineng/smallttssuperfast text to speech in any voice for japanese
Python Other UpdatedOct 23, 2025 -
-
ComfyUI Public
Forked from comfyanonymous/ComfyUIThe most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Python GNU General Public License v3.0 UpdatedOct 21, 2025 -
StreamVoiceAnon Public
Forked from Plachtaa/StreamVoiceAnonReal-time streaming voice anonymization & voice conversion
Python Apache License 2.0 UpdatedOct 20, 2025 -
neutts-air Public
Forked from neuphonic/neutts-airOn-device TTS model by Neuphonic
Python Apache License 2.0 UpdatedOct 20, 2025 -
TaDiCodec Public
Forked from AmphionTeam/TaDiCodecThis repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaDiCodec: Text-aware Diffusion Speech Tokenizer for Speech Lan…
Python UpdatedOct 20, 2025 -
-
piper-plus Public
Enhanced Piper TTS with Japanese support, WebAssembly, multi-GPU training, and quality improvements. Features OpenJTalk integration, browser-based TTS, auto dictionary download. Install: pip instal…
-
DiariZen Public
Forked from BUTSpeechFIT/DiariZenA toolkit for speaker diarization.
Jupyter Notebook MIT License UpdatedOct 18, 2025 -
Dolphin Public
Forked from bytedance/DolphinThe official repo for “Dolphin: Document Image Parsing via Heterogeneous Anchor Prompting”, ACL, 2025.
Python MIT License UpdatedOct 18, 2025 -
UtterTune Public
Forked from shuheikatoinfo/UtterTuneLoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme-level pronunciation and prosody while preserving other lang…
Python MIT License UpdatedOct 12, 2025 -
-
-
CosyVoice Public
Forked from FunAudioLLM/CosyVoiceMulti-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Python Apache License 2.0 UpdatedOct 8, 2025 -
VoiceStar Public
Forked from jasonppy/VoiceStarVoiceStar: Robust, Duration-controllable TTS that can Extrapolate
Python MIT License UpdatedOct 2, 2025 -
-
-
-
-
-
-
-
-
-