-
Google
- San Francisco
- http://www.jasonmayes.com
- in/WebAI
- @jason_mayes
- https://goo.gle/WebAIVideos
- https://goo.gle/Learn-WebAI
Starred repositories
Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.
LiteRT, successor to TensorFlow Lite. is Google's On-device framework for high-performance ML & GenAI deployment on edge platforms, via efficient conversion, runtime, and optimization
Explainer for the Cross-Origin Storage (COS) API
Landing page for Web AI Summit 2025 Bangle.js
Official implementation of OpenTrack.
Base yolov11n.pt trained on 6877 images of Drones and UAVs
Voyager is an interactive RGBD video generation model conditioned on camera input, and supports real-time 3D reconstruction.
Continuous Thought Machines, because thought takes time and reasoning is a process.
An open protocol enabling communication and interoperability between opaque agentic applications.
Kortix – build, manage and train AI Agents.
ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control (e.g., audio, expression).
A Chrome extension for asking questions over websites
📊 EEG signal processing and machine learning in JavaScript
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS …
Retrieve large (GBs) AI binary model files from cloud, cache locally as sharded blobs to load faster on 2nd page load, returns stored file as data URL
BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks
The official JS client library for the Massive.com REST and WebSocket API.
Official code of DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction (3DV 2025))
Spotify Web AI DJ - client side agentic smarts using Gemma 2, two billion parameter LLM, to play what a user wants via natural speech input
A utility for fetching large files in chunks with support for parallel downloads and progress tracking.
Simple JavaScript app to make API calls to Spotify
A Web AI Agent running entirely client side in browser, that's capable of controlling a fictional flights webpage, to get the job done by using Google's Gemma 2 (2B) model in JavaScript via WebGPU …
Run LLMs in the Browser with MLC / WebLLM ✨