Highlights
- Pro
Starred repositories
本项目基于 Playwright 和AI过滤的闲鱼多任务实时/定时监控与智能分析工具,配备了功能完善的后台管理界面。帮助用户节省闲鱼商品过滤,能及时找到心仪商品。
A command-line tool that can plot graph of any binary implicit function equation or inequality, supporting both Cartesian and polar coordinates.
FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.
This is the code for paper: XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs. Demos, technical insights and experimental results are presented on
Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch, with lengths specified for samples in batch.
P2P Voice/Video phone App for local networks.
Model Context Protocol Servers
The official MCP server implementation for the Perplexity API Platform
AI Agent + Coding Agent + 300+ assistants: agentic AI desktop with autonomous coding, intelligent automation, and unified access to frontier LLMs.
Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your permission every step of the way.
A community driven list of open source alternatives to proprietary software and applications.
👾 Fast and simple video download library and CLI tool written in Go
A curated list of research papers and resources on code-switching
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
中文羊驼大模型三期项目 (Chinese Llama-3 LLMs) developed from Meta Llama 3
PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"
A Unified Library for Parameter-Efficient and Modular Transfer Learning
Dual-Path Attention and Recurrent Network for speech separation
Libri-CSS: dataset and evaluation pipeline
Speech separation with utterance-level PIT experiments
According to funcwj's uPIT, the training code supporting multi-gpu is written, and the Dataloader is reconstructed.
LASR: a lighting ASR model training platform
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
An Open Source Tools for Speaker Recognition
A Clash GUI based on tauri. Supports Windows, macOS and Linux.
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech