-
The University of Tokyo
- Japan
Highlights
- Pro
Stars
Audio Dataset for training CLAP and other models
"DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"
Advanced real-time screen translator for games, hardcoded subtitles in videos, static text and etc.
Adds adversarial noise to a given image, tricking an image classification model into misclassifying it to a specified target class.
deck is a tool for creating deck using Markdown and Google Slides.
A powerful coding agent toolkit providing semantic retrieval and editing capabilities (MCP server & other integrations)
This repository aims at providing efficient CNNs for Audio Tagging. We provide AudioSet pre-trained models ready for downstream training and extraction of audio embeddings.
A script engine for "yu-gi-oh!" and sample gui
AI-driven Yu-Gi-Oh! bot using deep reinforcement learning and LLMs
ImageBind One Embedding Space to Bind Them All
This is a template for deploying a FastAPI endpoint on AWS SageMaker.
Example 📓 Jupyter notebooks that demonstrate how to build, train, and deploy machine learning models using 🧠 Amazon SageMaker.
This repository contains implementations of the paper Audio-Text Retrieval in Context
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
Auto-Generate `GO struct (datamodel)` from `Elasticsearch (also, OpenSearch) Mapping JSON`
[IJCV] FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
In this code, we propose a different implementation fo the original paper https://arxiv.org/abs/1709.07124 under PyTorch. This architecture is constructed by unfolding the iterations of a sequenti…
Implementation of deep recurrent nonnegative matrix factorization (DR-NMF) for speech separation
SNMF: Integrated Learning of Mutational Signatures and Prediction of DNA Repair Deficiencies by Goossens S, Tepeli YI, Seale C, and Gonçalves JP (bioRxiv 2024)
Lightweight and powerful workflow engine for enterprise & small teams. Single binary with Web UI. 100% open source. No vendor lock-in. It natively supports running containers and executing commands…
This is a CLI tool to download shared files and folders from Google Drive.