Skip to content
View Yip-Jia-Qi's full-sized avatar

Block or report Yip-Jia-Qi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A fully open-source humanoid arm for physical AI research and deployment in contact-rich environments.

MDX 1,523 163 Updated Nov 27, 2025

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,328 442 Updated Nov 28, 2025

[Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challenges

2,169 63 Updated Nov 7, 2025

An open source bipedal robot control framework, based on non-linear MPC and WBC, tailered for EC-hunter80-v01 bipedal robot.

C++ 544 98 Updated May 20, 2024

[RSS 2025 Best Systems Paper Finalist] 💐Official implementation of "Learning Humanoid Standing-up Control across Diverse Postures"

Python 460 52 Updated Jun 17, 2025

3d printed usb foot pedal

C++ 8 Updated Mar 9, 2021

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 9,056 1,006 Updated Nov 29, 2025

The hardware design for AgiBot X1.

1,001 319 Updated Apr 18, 2025
Python 5 Updated Jun 19, 2025

🤗 R1-AQA Model: mispeech/r1-aqa

Python 306 26 Updated Mar 28, 2025

Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)

Python 103 10 Updated Aug 18, 2025

Manifold is a platform for enabling workflow automation using AI assistants.

Go 465 28 Updated Nov 22, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 51,044 8,908 Updated Nov 17, 2025

A nearly-live implementation of OpenAI's Whisper.

Python 3,630 497 Updated Sep 25, 2025

Real time transcription with OpenAI Whisper.

Python 2,899 483 Updated Apr 15, 2025

Converts text to speech in realtime

Python 3,643 351 Updated Jul 22, 2025

🙌 OpenHands: Code Less, Make More

Python 65,289 7,971 Updated Nov 28, 2025

Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embeddings recursively. This helps us understand user behaviour on…

Python 368 37 Updated Sep 10, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,685 298 Updated Aug 14, 2025

Implementation of F5-TTS in MLX

Python 15 1 Updated Dec 13, 2024

Choose your own adventure with LLMs

TypeScript 21 6 Updated May 27, 2025

Towards Human-Friendly, Fast Learning and Adaptable Agent Communities

Jupyter Notebook 153 13 Updated Jul 14, 2025

This is a single-speaker neural text-to-speech (TTS) system capable of training in a end-to-end fashion. It is inspired by the Tacotron archicture and able to train based on unaligned text-audio pa…

Python 13 4 Updated Dec 28, 2018

This repository contains the SpeechBrain Benchmarks

Python 131 45 Updated Jul 15, 2025

An Open Source text-to-speech system built by inverting Whisper.

Jupyter Notebook 4,536 263 Updated Jun 8, 2025

Target Speaker Extraction Toolkit

Python 223 29 Updated Oct 4, 2025

General Speech Restoration

Python 1,240 151 Updated Feb 17, 2025

Music repair method to convert lossy MP3 compressed music to lossless music.

Python 322 29 Updated Aug 12, 2025

Thesis Latex Template for Nanyang Technological University (NTU)

TeX 166 55 Updated Oct 14, 2021

✨✨[NeurIPS 2025] VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,450 179 Updated Mar 28, 2025
Next