Skip to content
View gardner's full-sized avatar

Organizations

@causeroot @cobudget-old

Block or report gardner

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

repo for active speaker detection for media videos.

Python 31 1 Updated Nov 19, 2023

The Modern And Developer Centric Python Web Framework. Be sure to read the documentation and join the Discord channel for questions: https://discord.gg/TwKeFahmPZ

Python 2,357 135 Updated Mar 25, 2025

A structural code search engine for Al agents.

TypeScript 382 14 Updated Jan 9, 2026

Rust command-line tool for querying Datadog logs and APM spans

Rust 1 Updated Dec 17, 2025

RNode is an open, free and flexible digital radio interface with many uses

Python 436 140 Updated Dec 28, 2025

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Python 319 56 Updated Nov 20, 2024

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,407 241 Updated May 21, 2023

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

Python 173 3 Updated Oct 22, 2023
Python 3 Updated Jun 18, 2025

INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues

Python 57 4 Updated May 29, 2023

Official implementation of TalkNCE (ICASSP 2024).

Python 12 Updated Apr 30, 2025

Improving Mamaba performance on Video Understanding task

Python 41 7 Updated Dec 30, 2025

R1-like Video-LLM for Temporal Grounding

Python 130 3 Updated Jun 20, 2025
Python 6 Updated Nov 26, 2025

Kubeflow Deployment Manifests

YAML 981 1,029 Updated Dec 30, 2025

GlobalBuildingAtlas: an open global and complete dataset of building polygons, heights and LoD1 3D models

Python 1,858 176 Updated Jan 7, 2026

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

Python 668 58 Updated Dec 30, 2025

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 911 342 Updated Dec 24, 2025

A clean, GraphQL-based Model Context Protocol server for Twenty CRM. Enables natural language interactions with your CRM data through Claude and other AI assistants.

TypeScript 3 1 Updated Dec 8, 2025

Agentic voice AI using ConversationRelay. 5 minute setup.

TypeScript 15 16 Updated Jul 8, 2025

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,884 721 Updated Jan 9, 2026

🍕 Peer-to-peer file transfers in your browser

TypeScript 9,850 597 Updated Jan 8, 2026

Tools for managing DNS across multiple providers

Python 3,607 430 Updated Dec 19, 2025

Claude Code superpowers: core skills library

Shell 15,032 1,217 Updated Dec 27, 2025

Valdi is a cross-platform UI framework that delivers native performance without sacrificing developer velocity.

C++ 16,046 544 Updated Jan 9, 2026

Multilingual Voice Understanding Model

Python 7,328 680 Updated Dec 30, 2025

An interface library for RL post training with environments.

Python 977 143 Updated Jan 9, 2026

pg_lake: Postgres with Iceberg and data lake access

C 1,360 65 Updated Jan 9, 2026

Early WebMCP proposal / implementation - since evolved and worked on by much more capable folks that develop the web: https://github.com/webmachinelearning/webmcp

JavaScript 333 23 Updated Mar 22, 2025
Next