Stars
This is the official implementation of the LiSenNet
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
Reading notes about Multimodal Large Language Models, Large Language Models, and Diffusion Models
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Paddle Graph Learning (PGL) is an efficient and flexible graph learning framework based on PaddlePaddle