Stars
Django backend for managing licenses and subscriptions
An app for creating audio-based content such as song covers and speech using Retrieval-based Voice Conversion.
A Python-based Xiaozhi AI for users who want the full Xiaozhi experience without owning specialized hardware.
小智ESP32的Java企业级管理平台,提供设备监控、音色定制、角色切换和对话记录管理的前后端及服务端一体化解决方案
a comprehensive, all-in-one audio dataset processing tool that integrates audio format conversion, loudness normalization, audio slicing, audio dataset analysis, and media information analysis.
A collection of neural vocoders suitable for singing voice synthesis tasks.
A simple, high-quality voice conversion tool focused on ease of use and performance.
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
📷 EasyPhoto | Your Smart AI Photo Generator.
Easily train a good VC model with voice data <= 10 mins!
AliceNavigator / Music-Source-Separation-Training-GUI
Forked from ZFTurbo/Music-Source-Separation-TrainingMSST-GUI is a Qt5-based inference GUI, designed to provide a convenient and intuitive way to inference (mainly for my own use)
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
本项目为xiaozhi-esp32提供后端服务,帮助您快速搭建ESP32设备控制服务器。Backend service for xiaozhi-esp32, helps you quickly build an ESP32 device control server.
使用YOLOv5和LPRNet进行车牌检测+识别(CCPD数据集)
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.
Lane detection with PaddlePaddle. PPLanedet contains many SOTA methods, e.g. CondLaneNet, SCNN, RESA, RTFormer,UFLD
The official implementation of "CLRerNet: Improving Confidence of Lane Detection with LaneIoU"
Official implementation for "iTransformer: Inverted Transformers Are Effective for Time Series Forecasting" (ICLR 2024 Spotlight)
A plugin for Automatic1111 for automating the aging of faces using Stable Diffusion
Aging Time Lapse using Stable Diffusion
Production-ready platform for agentic workflow development.
Matlab Scripts for Evaluation of a TDOA System based on RTL-SDRs
GNU Radio and ZeroMQ based collection of applications to perform distributed localization experiments with software defined radio. Includes software for the receivers, fusion center and GUI.
a Python 2/3 GUI for automated TDoA recording & processing
Repository for training models for music source separation.