Starred repositories
MotionStream: Real-Time Video Generation with Interactive Motion Controls
🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org
Janus-Series: Unified Multimodal Understanding and Generation Models
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Create stunning countdown videos with ease using Video Timer Generator. Set your start and finish time, choose your font and colors, and download your video in seconds. Perfect for social media pos…
Examples and guides for using the Gemini API
Examples of ComfyUI workflows
Take photos via webcam with JavaScript
A boilerplate for building production-ready RESTful APIs using Node.js, Express, and Mongoose
Generate prompts using GA algorithm for a pretrained LLM
Create, manipulate, and optimize GIF images and animations
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…
A fast, local first, reactive Database for JavaScript Applications https://rxdb.info/
Official codebase for I-JEPA, the Image-based Joint-Embedding Predictive Architecture. First outlined in the CVPR paper, "Self-supervised learning from images with a joint-embedding predictive arch…
An Open-Ended Embodied Agent with Large Language Models
Interact with your documents using the power of GPT, 100% privately, no data leaks
BabyAGI: an Autonomous and Self-Improving agent, or BASI
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
C# wrapper for the Sketchfab API (Unity)
Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition (CVPR2023)
Algorithms and Data Structures implemented in JavaScript for beginners, following best practices.
Curated list of awesome tools, demos, docs for ChatGPT and GPT-3
A Low Profile Component Framework – Stable, minimal, easy to audit, zero-dependencies and build-tool-free.
[SIGGRAPH Asia 2022] IDE-3D: Interactive Disentangled Editing For High-Resolution 3D-aware Portrait Synthesis
📡
An application to listen to broadcast stereo FM and AM radio from your Chrome browser or your ChromeBook computer using a $15 USB digital TV tuner.