Stars
基于PQAEF (https://github.com/QuwanAI/PQAEF) 框架设计的情感陪伴对话系统测评基准
Peking University & Quwan Ability Evaluation Framework ;
C++ 11 algorithm implementation for voice conversion using harmonic plus stochastic models
Python短视频去水印爬虫:抖音,皮皮虾,火山,微视,最右,快手,全民小视频,皮皮搞笑,西瓜视频,虎牙,梨视频,acfun,好看视频...
AI桌宠2.2(网页端toklen白嫖国产大模型服务器(glm4,kimi,deepseekv2),语音识别,屏幕识别自动发送,live2d 2.0和3.0模型,gpt-sovits语音,coysvoice语音,edge-tts语音(支持多语言音色),本地ollama模型无限制聊天)(主流国产大模型api接口支持)
real time face swap and one-click video deepfake with only a single image
Core Engine of Singing Voice Conversion & Singing Voice Clone
High-quality pro audio resampler / sample rate conversion C++ library. Very fast, for both audio resampling and time-series interpolation.
CVPR2020/TNNLS2023: Central Similarity Quantization/Hashing for Efficient Image and Video Retrieval
🌻 传统直播:HTML5播放器、M3U8直播/点播、RTMP直播、低延迟、推流/播流地址鉴权。🍏 实时直播:WebRTC
✨ Yao is an all-in-one application engine that enables developers to create web apps, REST APIs, business applications, and more, with AI as a development partner.
The most easy-to-understand tutorial for using LoRA (Low-Rank Adaptation) within diffusers framework for AI Generation Researchers🔥
🎒 飞书 ×(GPT-4 + GPT-4V + DALL·E-3 + Whisper)= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀
🤖 可 DIY 的 多模态 AI 聊天机器人 | 🚀 快速接入 微信、 QQ、Telegram、等聊天平台 | 🦈支持DeepSeek、Grok、Claude、Ollama、Gemini、OpenAI | 工作流系统、网页搜索、AI画图、人设调教、虚拟女仆、语音对话 |
Stable diffusion for real-time music generation (web app)
Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch
This is static lib for Piano Transcription app. Transcribes polyphonic piano pieces from audio (MP3, WAV, etc.) into MIDI-files
3D Tune-In Toolkit is a custom open-source C++ library developed within the EU-funded project 3D Tune-In. The Toolkit provides a high level of realism and immersiveness within binaural 3D audio sim…
An audio digital processing toolbox based on a workflow/pipeline principle