Pinned Loading
-
google-research/leaf-audio
google-research/leaf-audio PublicLEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks, and then be trained for the task at hand, while using a ve…
-
google-research/diffstride
google-research/diffstride PublicTF/Keras code for DiffStride, a pooling layer with learnable strides.
-
facebookresearch/tdfbanks
facebookresearch/tdfbanks Public archivePytorch implementation of time-domain filterbanks
-
kyutai-labs/moshi
kyutai-labs/moshi PublicMoshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.
-
kyutai-labs/hibiki
kyutai-labs/hibiki PublicHibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits for the end of the source utterance to start translating--- H…
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.