kehanlu

Kehan Lu kehanlu

Speech and NLP @ntu-spml-lab

145 followers · 91 following

National Taiwan University
Taiwan
06:41 (UTC +08:00)
https://kehan.lu
@kehan_lu

Achievements

x3 x2

Achievements

x3 x2

Highlights

Organizations

Starred repositories

ddlBoJack / Speech-Resources

语音方向实验室/公司/资源/实习等，欢迎推荐或自荐

582 67 Updated Nov 13, 2024

ckyang1124 / SAKE

LALM knowledge editing

Python 5 Updated Nov 1, 2025

d223302 / SHANKS

JavaScript 2 Updated Oct 12, 2025

ga642381 / Game-Time-Benchmark

Game-Time: Evaluating Temporal Dynamics in Spoken Language Models

4 Updated Oct 7, 2025

rezzsl / HighRateMOS

HighRateMOS is the first non-intrusive MOS prediction model that explicitly models sampling rates, achieving first place in five out of eight metrics in AudioMOS Challenge 2025 Track3.

10 Updated Sep 15, 2025

jim-schwoebel / voice_datasets

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

2,082 253 Updated Jun 6, 2024

01Zhangbw / Speech-and-audio-papers-Top-Conference

112 4 Updated May 25, 2025

louislam / uptime-kuma

A fancy self-hosted monitoring tool

JavaScript 78,910 7,010 Updated Nov 25, 2025

robmsmt / ASR-Audio-Data-Links

A list of publically available audio data that anyone can download for ASR or other speech activities

Shell 231 22 Updated Aug 6, 2021

google-gemini / gemini-cli

An open-source AI agent that brings the power of Gemini directly into your terminal.

TypeScript 84,609 9,580 Updated Nov 25, 2025

d223302 / STITCH

JavaScript 1 Updated Jul 22, 2025

NVIDIA / audio-flamingo

PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models

873 71 Updated Nov 19, 2025

kehanlu / DeSTA2.5-Audio

Code for DeSTA2.5-Audio

Python 122 7 Updated Aug 7, 2025

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Python 25,510 1,783 Updated Oct 13, 2025

fishaudio / fish-speech

SOTA Open Source TTS

Python 24,175 1,973 Updated Nov 6, 2025

kehanlu / Speech-IFEval

Leaderboard and code for "Speech-IFEval", Interspeech 2025

Python 22 1 Updated May 27, 2025

ckyang1124 / LALM-Evaluation-Survey

Collection of works for evaluating (and analyzing) large audio-language models (LALMs)

40 Updated Aug 11, 2025

ckyang1124 / SAKURA

Official GitHub repository for paper "SAKURA: On the Multi-hop Reasoning of Large Audio-Language Models Based on Speech and Audio Information" (Interspeech 2025)

Python 19 3 Updated Aug 14, 2025