Skip to content
View Dian-Yi's full-sized avatar

Block or report Dian-Yi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,067 3,188 Updated Nov 10, 2025

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

C++ 12,095 1,855 Updated Nov 11, 2025

Effortless data labeling with AI support from Segment Anything and other awesome models.

Python 6,930 773 Updated Nov 10, 2025

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…

C++ 8,773 972 Updated Nov 10, 2025

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…

Python 1,581 138 Updated Sep 22, 2025

Kolors Team

Python 4,571 351 Updated Nov 13, 2024

[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling

Python 3,116 308 Updated Dec 21, 2024

A PyTorch-based Speech Toolkit

Python 10,758 1,598 Updated Nov 7, 2025

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…

Python 1,835 309 Updated Mar 14, 2023

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 3,610 291 Updated Aug 14, 2025

Calibrate the camera with ZhangZhengyou method (in both distortion case and no distortion case)

Python 543 145 Updated Mar 18, 2024

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

Python 8,369 735 Updated Aug 13, 2024

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 13,397 1,356 Updated Oct 1, 2025

这是一个多国语言通用的编程语言

C++ 102 Updated Sep 7, 2025

Open-source framework for conversational voice AI agents

C 8,535 995 Updated Nov 10, 2025

这是一个简单的技术科普教程项目,主要聚焦于解释一些有趣的,前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。

6,188 571 Updated Nov 10, 2025

stock股票.获取股票数据,计算股票指标,筹码分布,识别股票形态,综合选股,选股策略,股票验证回测,股票自动交易,支持PC及移动设备。

Python 10,648 2,120 Updated Aug 28, 2025

Awesome-data shows most interesting data-source around the financial world!

563 62 Updated Mar 27, 2025

AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库

Python 14,293 2,575 Updated Nov 6, 2025

LangChain 的中文入门教程

8,603 672 Updated Apr 19, 2025

A modular graph-based Retrieval-Augmented Generation (RAG) system

Python 29,091 3,049 Updated Nov 11, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 62,196 7,527 Updated Nov 10, 2025

Facebook Oculus Quest Hand Detection&Pose Estimation System

Python 84 19 Updated Dec 17, 2020

Measure the SMPL body model

Python 249 29 Updated Apr 11, 2025

This project aims to train some alternative face landmark detection models based on dlib.

Python 3 2 Updated Sep 10, 2019

This repository is an official PyTorch implementation of the paper "Learnable Triangulation of Human Pose" (ICCV 2019, oral). Proposed method archives state-of-the-art results in multi-view 3D huma…

Python 1,128 181 Updated Oct 3, 2023

Main OpenCap processing pipeline

Python 244 206 Updated Sep 21, 2025

Bias Mitigation for Machine Translation Quality Estimation

Python 4 1 Updated Mar 13, 2022

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 152,364 31,107 Updated Nov 10, 2025

Load SMPL in blender

Python 348 37 Updated Jul 2, 2023
Next