- Rotten City
- https://www.robinsloan.com
Lists (10)
Sort Name ascending (A-Z)
Stars
Pure TypeScript media toolkit for reading, writing, and converting video and audio files, directly in the browser.
Visible geometry edge projection and flattening based on three-mesh-bvh.
Ruby extension for the libvips image processing library.
A Roguelike Game written in vanilla Ruby
Procedural terrain generation with diffusion models
Code and Models for the paper NeuralRemaster with Phase-Preserving Diffusion
Official codebase for SIGGRAPH Asia 2025 paper: GSWT: Gaussian Splatting Wang Tiles, containing the GSWT renderer.
[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen ones. (🔥 🔥 🔥 Merged into offical pipelines of diffusers.)
[Arxiv 2025] In-Video Instructions: Visual Signals as Generative Control
OCR, layout analysis, reading order, table recognition in 90+ languages
Official implementation of "Continuous Autoregressive Language Models"
Single-line / open-paths cursive font (exports in OTF-SVG, OTF and WOFF2 formats)
A light, cross-platform library for building web-based desktop apps with Deno or Python.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
From baby GPT to diffusion GPT: An annotated implementation of a character-level discrete diffusion model (adapted from Karpathy’s baby GPT).
Ruby FFI bindings for llama.cpp to run open-source LLMs such as GPT-OSS, Qwen 3, Gemma 3, and Llama 3 locally with Ruby.
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
A Python Library for Generating PDFs and Images from HTML, powered by PlutoBook
A tiny, dependency-free JavaScript module for making textarea elements grow with their content.
Apriltag detector using the apriltag C library and compiled to WASM using emscripten.