Stars
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…
Effortless data labeling with AI support from Segment Anything and other awesome models.
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Andr…
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics rec…
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a ca…
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.
Calibrate the camera with ZhangZhengyou method (in both distortion case and no distortion case)
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Open-source framework for conversational voice AI agents
这是一个简单的技术科普教程项目,主要聚焦于解释一些有趣的,前沿的技术概念和原理。每篇文章都力求在 5 分钟内阅读完成。
stock股票.获取股票数据,计算股票指标,筹码分布,识别股票形态,综合选股,选股策略,股票验证回测,股票自动交易,支持PC及移动设备。
Awesome-data shows most interesting data-source around the financial world!
AKShare is an elegant and simple financial data interface library for Python, built for human beings! 开源财经数据接口库
LangChain 的中文入门教程
A modular graph-based Retrieval-Augmented Generation (RAG) system
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Facebook Oculus Quest Hand Detection&Pose Estimation System
This project aims to train some alternative face landmark detection models based on dlib.
This repository is an official PyTorch implementation of the paper "Learnable Triangulation of Human Pose" (ICCV 2019, oral). Proposed method archives state-of-the-art results in multi-view 3D huma…
agesb / TransQuest
Forked from TharinduDR/TransQuestBias Mitigation for Machine Translation Quality Estimation
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.