Skip to content
View gardner's full-sized avatar

Organizations

@causeroot @cobudget-old

Block or report gardner

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

repo for active speaker detection for media videos.

Python 31 1 Updated Nov 19, 2023

The Modern And Developer Centric Python Web Framework. Be sure to read the documentation and join the Discord channel for questions: https://discord.gg/TwKeFahmPZ

Python 2,360 134 Updated Mar 25, 2025

A structural code search engine for Al agents.

TypeScript 487 20 Updated Jan 9, 2026

Rust command-line tool for querying Datadog logs and APM spans

Rust 3 Updated Dec 17, 2025

RNode is an open, free and flexible digital radio interface with many uses

Python 443 141 Updated Dec 28, 2025

This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.

Python 322 55 Updated Nov 20, 2024

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

Jupyter Notebook 1,406 242 Updated May 21, 2023

MAD: A Scalable Dataset for Language Grounding in Videos from Movie Audio Descriptions

Python 173 3 Updated Oct 22, 2023
Python 3 Updated Jun 18, 2025

INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues

Python 57 4 Updated May 29, 2023

Official implementation of TalkNCE (ICASSP 2024).

Python 12 Updated Apr 30, 2025

Improving Mamaba performance on Video Understanding task

Python 43 7 Updated Dec 30, 2025

R1-like Video-LLM for Temporal Grounding

Python 132 3 Updated Jun 20, 2025
Python 6 Updated Nov 26, 2025

Kubeflow Deployment Manifests

YAML 982 1,033 Updated Jan 15, 2026

GlobalBuildingAtlas: an open global and complete dataset of building polygons, heights and LoD1 3D models

Python 1,900 177 Updated Jan 7, 2026

GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters

Python 681 59 Updated Dec 30, 2025

VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)

Python 927 347 Updated Dec 24, 2025

A clean, GraphQL-based Model Context Protocol server for Twenty CRM. Enables natural language interactions with your CRM data through Claude and other AI assistants.

TypeScript 3 1 Updated Dec 8, 2025

Agentic voice AI using ConversationRelay. 5 minute setup.

TypeScript 16 17 Updated Jul 8, 2025

A machine learning compiler for GPUs, CPUs, and ML accelerators

C++ 3,910 725 Updated Jan 19, 2026

🍕 Peer-to-peer file transfers in your browser

TypeScript 9,862 600 Updated Jan 12, 2026

Tools for managing DNS across multiple providers

Python 3,613 430 Updated Jan 17, 2026

An agentic skills framework & software development methodology that works.

Shell 28,750 2,165 Updated Jan 19, 2026

Valdi is a cross-platform UI framework that delivers native performance without sacrificing developer velocity.

C++ 16,215 545 Updated Jan 18, 2026

Multilingual Voice Understanding Model

Python 7,380 684 Updated Dec 30, 2025

An interface library for RL post training with environments.

Python 1,061 159 Updated Jan 16, 2026

pg_lake: Postgres with Iceberg and data lake access

C 1,373 68 Updated Jan 16, 2026

Early WebMCP proposal / implementation - since evolved and worked on by much more capable folks that develop the web: https://github.com/webmachinelearning/webmcp

JavaScript 334 23 Updated Mar 22, 2025
Next