Stars
- All languages
- Adblock Filter List
- Arduino
- Assembly
- C
- C#
- C++
- CSS
- Clojure
- Common Lisp
- Cuda
- Dart
- Dockerfile
- Elixir
- Elm
- Fluent
- GDScript
- Go
- Groovy
- HCL
- HTML
- Idris
- Java
- JavaScript
- Jupyter Notebook
- Kotlin
- LiveScript
- Logos
- Lua
- MDX
- Makefile
- OCaml
- Objective-C
- Objective-C++
- OpenSCAD
- PHP
- PLpgSQL
- Perl
- Pug
- Python
- Ruby
- Rust
- Scala
- Shell
- Smarty
- Svelte
- Swift
- Talon
- Tcl
- TeX
- TypeScript
- Vim Script
- Vue
- Web Ontology Language
- YAML
- Zig
repo for active speaker detection for media videos.
The Modern And Developer Centric Python Web Framework. Be sure to read the documentation and join the Discord channel for questions: https://discord.gg/TwKeFahmPZ
A structural code search engine for Al agents.
Rust command-line tool for querying Datadog logs and APM spans
RNode is an open, free and flexible digital radio interface with many uses
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions
INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues
Improving Mamaba performance on Video Understanding task
GlobalBuildingAtlas: an open global and complete dataset of building polygons, heights and LoD1 3D models
GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters
VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)
A clean, GraphQL-based Model Context Protocol server for Twenty CRM. Enables natural language interactions with your CRM data through Claude and other AI assistants.
Agentic voice AI using ConversationRelay. 5 minute setup.
A machine learning compiler for GPUs, CPUs, and ML accelerators
🍕 Peer-to-peer file transfers in your browser
Tools for managing DNS across multiple providers
Claude Code superpowers: core skills library
Valdi is a cross-platform UI framework that delivers native performance without sacrificing developer velocity.
Multilingual Voice Understanding Model
An interface library for RL post training with environments.
pg_lake: Postgres with Iceberg and data lake access
Early WebMCP proposal / implementation - since evolved and worked on by much more capable folks that develop the web: https://github.com/webmachinelearning/webmcp