Skip to content
View dAItime001's full-sized avatar

Block or report dAItime001

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Streaming ASR and TTS based on FastAPI+ sherpa-onnx

Python 160 20 Updated Nov 2, 2025

Development repository for the Triton language and compiler

MLIR 17,506 2,365 Updated Nov 8, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 8,752 970 Updated Nov 5, 2025

Port of Funasr's Paraformer model in C/C++

C 37 5 Updated Jun 19, 2024

stt websockect server using sherpa-onnx

C++ 32 5 Updated Jul 30, 2025

Tensor library for machine learning

C++ 13,522 1,385 Updated Nov 4, 2025

Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯

Python 874 40 Updated Oct 28, 2025

AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment Extraction, Audio Denoising, and Enhancement, Support models such as paraformer, sensevoice, fireredasr, zipformer, moonsh…

C# 45 6 Updated Nov 8, 2025

c# library for decoding paraformer, sensevoice Models,used in speech recognition (ASR)

C# 63 9 Updated Oct 15, 2025

LLM inference in C/C++

C++ 89,406 13,610 Updated Nov 8, 2025

Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.

Jupyter Notebook 2,841 160 Updated Oct 9, 2025

Added vLLM support to IndexTTS for faster inference.

Python 838 107 Updated Oct 24, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 16,113 1,282 Updated Oct 27, 2025

🚀 The fast, Pythonic way to build MCP servers and clients

Python 20,089 1,478 Updated Nov 8, 2025

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 13,549 1,990 Updated Nov 3, 2025

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

Python 9,323 889 Updated Aug 28, 2025
Python 6,017 463 Updated Aug 29, 2025

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

Python 14,950 1,691 Updated Nov 7, 2025

MedEvalKit: A Unified Medical Evaluation Framework

Python 175 16 Updated Oct 23, 2025

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Python 5,982 567 Updated Feb 26, 2025

📊 Simple package for monitoring and control your NVIDIA Jetson [Orin, Xavier, Nano, TX] series

Python 2,416 303 Updated Nov 6, 2025

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 154,868 49,541 Updated Nov 8, 2025

August智能体框架的相关功能包,已成功部署在Robonova机器人上

16 1 Updated Aug 31, 2025

Machine Learning Containers for NVIDIA Jetson and JetPack-L4T

Jupyter Notebook 3,902 737 Updated Nov 8, 2025

Official implementation for the paper "Full-Order Sampling-Based MPC for Torque-Level Locomotion Control via Diffusion-Style Annealing". DIAL-MPC is a novel sampling-based MPC framework for legged …

Python 880 92 Updated May 28, 2025
Python 411 28 Updated Jun 12, 2025

Python interface for unitree sdk2

Python 455 162 Updated Oct 13, 2025

Optimal Control for Switched Systems

C++ 1,193 278 Updated Oct 19, 2023
Next