Skip to content
View hm-li0420's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report hm-li0420

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

"AI-Trader: Can AI Beat the Market?" Live Trading: https://hkuds.github.io/AI-Trader/

Python 8,360 1,161 Updated Nov 3, 2025

Official Repository of UltraVoice

JavaScript 42 1 Updated Oct 28, 2025

SoulX-Podcast is an inference codebase by the Soul AI team for generating high-fidelity podcasts from text.

Python 1,345 124 Updated Nov 2, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 45 9 Updated Nov 3, 2025

Vogent Turn: fast, open-source turn-detection for Voice AI applications

Python 32 2 Updated Oct 28, 2025

Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation

Jupyter Notebook 191 14 Updated Jul 29, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 77,703 11,469 Updated Nov 3, 2025

Code for the blog "Neural audio codecs: how to get audio into LLMs"

Python 111 3 Updated Oct 20, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 48,790 8,158 Updated Dec 9, 2024

A simple implementation for improving CosyVoice2 by GRPO method

Python 13 1 Updated Oct 17, 2025

OmniVinci is an omni-modal LLM for joint understanding of vision, audio, and language.

Python 384 38 Updated Oct 29, 2025

ICASSP2026 HumDial Challenge

Python 23 1 Updated Oct 30, 2025

MRSAudio: A Large-Scale Multimodal Recorded Spatial Audio Dataset with Refined Annotations

Python 21 1 Updated Oct 15, 2025

清华大学计算机系课程攻略 Guidance for courses in Department of Computer Science and Technology, Tsinghua University

HTML 35,996 7,814 Updated Sep 18, 2025

Open-Source Turn-Taking Detection Model and Dataset for Full-Duplex Spoken Dialogue Systems

Python 45 2 Updated Oct 12, 2025

Official code of SenSE.

Python 56 5 Updated Oct 30, 2025

Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation"

Python 162 13 Updated Oct 20, 2025

This is the official repository of ``Scalable Neural Vocoder from Range-Null Space Decomposition'', which is submitted to TPAMI.

Python 31 5 Updated Oct 11, 2025

A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.

Python 875 104 Updated Sep 13, 2025

Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation

Python 297 25 Updated Oct 28, 2025

整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。

21,591 2,048 Updated May 19, 2025

Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合

Python 5,442 509 Updated Oct 25, 2025
Jupyter Notebook 61 16 Updated Sep 21, 2025

[ASRU 2025] Omni-R1: Do You Really Need Audio to Fine-Tune Your Audio LLM?

8 Updated Aug 6, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,796 158 Updated Oct 9, 2025

We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction

Python 126 7 Updated Oct 31, 2025

Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).

Python 65 7 Updated Jun 8, 2025

A Python library written for Morse Code

Python 110 36 Updated Jun 27, 2021

三角洲摩斯电码解码器

Python 14 Updated Aug 14, 2025

Official repository for the WenetSpeech-Chuan dataset.

Python 66 1 Updated Oct 22, 2025
Next