-
University of Surrey, CVSSP
- https://liangxg787.github.io/
Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
[CVPR 2022 Workshop] Biometrics Workshop Pet Biometric Challenge TOP3
Speech recognition module for Python, supporting several engines and APIs, online and offline.
Data manipulation and transformation for audio signal processing, powered by PyTorch
A python wrapper for Speech Signal Processing Toolkit (SPTK).
Robust Speech Recognition via Large-Scale Weak Supervision
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
[NeurIPS 2021] You Only Look at One Sequence
An anofficial implementation of "Audio Sep" (Separate Anything You Describe) take by Huggin Face
BioCPPNet: Automatic Bioacoustic Source Separation with Deep Neural Networks
A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Python Approximate Nearest Neighbor Search in very high dimensional spaces with optimised indexing.
Deezer source separation library including pretrained models.
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permutation Invariant Training (PIT).
Code for the paper Hybrid Spectrogram and Waveform Source Separation
OpenMusic: SOTA Text-to-music (TTM) Generation
Official implementation of "Separate Anything You Describe"
animal2vec: A self-supervised transformer for rare-event raw audio input
Python library for audio and music analysis
An extremely fast Python package and project manager, written in Rust.
YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection Framework(Supports RGBT detection for all YOLO series from YOLOv3 to YOLOv13, as well as RTDETR. 【Ultralytics YOLOv…
[NeurIPS 2025] YOLOv12: Attention-Centric Real-Time Object Detectors
🔥 Clone and recreate any website as a modern React app in seconds
Implementation of "YOLOv13: Real-Time Object Detection with Hypergraph-Enhanced Adaptive Visual Perception".