- Suzhou China
- http://placebokkk.github.io/
Stars
🚀 Efficient implementations of state-of-the-art linear attention models
Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.
List of papers related to neural network quantization in recent AI conferences and journals.
Production First and Production Ready End-to-End Speech Recognition Toolkit
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
An Open Source Conversational AI Platform for Deep-Domain Voice Interfaces and Chatbots.
Production First and Production Ready End-to-End Keyword Spotting Toolkit
A 10000+ hours dataset for Chinese speech recognition
Learn Go with test-driven development
📋 Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.
📚 Go: Under The Hood | Go 语言原本 | https://golang.design/under-the-hood
📚 Modern C++ Tutorial: C++11/14/17/20 On the Fly | https://changkun.de/modern-cpp/
Towards hot directions in industrial end to end speech recognition
A list of awesome compiler projects and papers for tensor computation and deep learning.
pytorch CTC implementation for ASR. Use eesen's fst decoder framework
You like pytorch? You like micrograd? You love tinygrad! ❤️
🤖 💬 Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API
Command-line program to download videos from YouTube.com and other video sites
Collection of generative models in Tensorflow
A curated list of resources for text detection/recognition (optical character recognition ) with deep learning methods.
PyTorch implementation of LF-MMI for End-to-end ASR
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
Extension to Kaldi implementing the standard i-vector hyperparameter estimation and i-vector extraction procedure
Speech Recognition using DeepSpeech2.
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.