Skip to content
View xmkevin's full-sized avatar

Block or report xmkevin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

实时交互数字人,可自定义形象与音色,支持音色克隆,对话延迟低至3s。Real-time voice interactive digital human, customizable appearance and voice, supporting voice cloning, with initial package delay as low as 3s.

Python 1,181 151 Updated Dec 18, 2025

Android 混合推送SDK,快速集成6个厂商推送,共享系统推送通道,杀死也能收到推送,推送到达率90%以上

Java 1,203 207 Updated Dec 15, 2025

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 19,579 2,105 Updated Oct 21, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 9,691 1,089 Updated Jan 12, 2026

SOTA Open Source TTS

Python 24,577 2,039 Updated Jan 8, 2026

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 14,429 1,503 Updated Jan 7, 2026

🚀 Truly open-source AI avatar(digital human) toolkit for offline video generation and digital human cloning.

C 12,109 2,000 Updated Oct 16, 2025

Spark-TTS Inference Code

Python 10,892 1,169 Updated Apr 9, 2025

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 62,804 7,872 Updated Oct 4, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 52,732 9,277 Updated Jan 5, 2026

Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.

Java 12,280 2,822 Updated Jan 9, 2026

SymmetricDS is database replication and file synchronization software that is platform independent, web enabled, and database agnostic. It is designed to make bi-directional data replication fast, …

Java 851 234 Updated Jan 11, 2026

Ip2region is an offline IP address manager framework and locator with both IPv4 and IPv6 supported, supporting billions of data segments, ten microsecond searching performance, xdb search client fo…

Java 18,583 2,950 Updated Jan 9, 2026

本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。

Python 2,002 348 Updated Jun 4, 2023

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Python 13,506 2,594 Updated Jun 26, 2024

一个超轻量级、可以在移动端实时运行的数字人模型

Python 2,382 342 Updated Sep 18, 2025

Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"

Python 3,168 281 Updated Jan 8, 2026

🚀 The best real-time interactive AI avatar(digital human) with on-premise deployment and <1.5 s latency.

C++ 7,726 1,152 Updated Dec 31, 2025

English-Japanese Dictionary data (Public Domain) EJDict-hand

Python 236 16 Updated Nov 4, 2025

JMdict, JMnedict, KANJIDIC for Yomitan/Yomichan.

Shell 223 8 Updated Jan 12, 2026

The Java server library for the App Store Server API and App Store Server Notifications.

Java 255 61 Updated Jan 8, 2026

Flutter video player plugin for all desktop+mobile platforms. download prebuilt examples from github actions. https://pub.dev/packages/fvp

Dart 318 67 Updated Jan 1, 2026

A Flutter plugin that exposes device specific text to speech recognition capability.

Dart 455 292 Updated Dec 16, 2025

Flutter App That Can Transcribe Audio Offline/On Device with Whisper C++ Bindings via Rust

C++ 164 18 Updated Aug 8, 2025

Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

TypeScript 142,133 18,861 Updated Jan 12, 2026

Code release for NeRF (Neural Radiance Fields)

Jupyter Notebook 10,757 1,444 Updated Apr 12, 2025

Illustrate your sound waves on the fly 🚀

Swift 624 96 Updated Jun 7, 2022
HTML 607 166 Updated Jan 8, 2024

Collection of AI-related utilities. Welcome to submit issues and pull requests /收藏AI相关的实用工具,欢迎提交issues 或者pull requests

5,551 399 Updated Jan 10, 2026

🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model

Jupyter Notebook 1,012 93 Updated Oct 21, 2023
Next