- Burbank, CA, USA
- https://mrpowers.com
Stars
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Hackable and optimized Transformers building blocks, supporting a composable construction.
Access large language models from the command-line
Add long-term memory to any AI in minutes. Self-hosted, open, and framework-free.
Modular Multi-Agent System for Scientific Research Assistance
Lyra: Generative 3D Scene Reconstruction via Video Diffusion Model Self-Distillation
Cosmos-Transfer1-DiffusionRenderer: High-quality video de-lighting and re-lighting based on Cosmos video diffusion framework
[CVPR'25 Oral] Official implementation for "DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models"
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Latent Diffusion Model made of public domain images (CC-0).
An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
Iterable datapipelines for pytorch training.
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Codebase for Aria - an Open Multimodal Native MoE
Typescript/React Library for AI Chat💬🚀
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.
[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings
Local Deep Research achieves ~95% on SimpleQA benchmark (tested with GPT-4.1-mini). Supports local and cloud LLMs (Ollama, Google, Anthropic, ...). Searches 10+ sources - arXiv, PubMed, web, and yo…
verl: Volcano Engine Reinforcement Learning for LLMs
DSPy: The framework for programming—not prompting—language models
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
An open-source alternative to OpenAI and Gemini's deep research.