-
SDU
- Shandong University Qianfoshan Campus, No. 17923, Jingshi Road, Lixia District, Jinan City, Shandong Province, China
-
22:07
(UTC +08:00) - https://www.sdu.edu.cn/
Lists (9)
Sort Name ascending (A-Z)
Stars
A visualization page for the ontology of the AudioSet dataset, which also provides the function of merging internal labels
Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
Code repo for the 2025 ICMLWMLA workshop submission
一款轻量、可定制的开源桌面硬件监控软件 — 实时监测 CPU、GPU、内存、磁盘、网络等系统性能。支持横竖屏显示、多语言、主题切换、透明度显示、三色报警,界面简洁且高度可配置。A lightweight and customizable desktop hardware monitoring tool — real-time monitoring of system performance …
Transformer with Local Modeling by Convolution for Speech Separation and Enhancement
MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model
MiMo-Audio: Audio Language Models are Few-Shot Learners
"Paper2Slides: From Paper to Presentation in One Click"
Writing AI Conference Papers: A Handbook for Beginners
This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement
A personal toolkit for single/multi-channel speech recognition & enhancement & separation.
[CVPR 2025] SCSegamba: Lightweight Structure-Aware Vision Mamba for Crack Segmentation in Structures
AudioTrust: Benchmarking the Multi-faceted Trustworthiness of Audio Large Language Models
Conv-TasNet: Surpassing Ideal Time-Frequency Magnitude Masking for Speech Separation Pytorch's Implement
Your faithful, impartial partner for audio evaluation — know yourself, know your rivals. 真实评测,知己知彼。
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Disentangled Speech Embeddings using Cross-Modal Self-Supervision
算法岗笔试面试大全,励志做算法届的《五年高考,三年模拟》!
大模型算法岗面试题(含答案):常见问题和概念解析 "大模型面试题"、"算法岗面试"、"面试常见问题"、"大模型算法面试"、"大模型应用基础"
📄 适合中文的简历模板收集(LaTeX,HTML/JS and so on)由 @hoochanlon 维护
Music repair method to convert lossy MP3 compressed music to lossless music.
This is the demo of our paper "IIANet: An Intra- and Inter-Modality Attention Network for Audio-Visual Speech Separation".