Stars
🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation
A generative speech model for daily dialogue.
Official Code for DragGAN (SIGGRAPH 2023)
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
High-Resolution Image Synthesis with Latent Diffusion Models
something like visual-chatgpt, 文心一言的开源版
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A toolkit showing GPU's all-round capability in video processing
CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.
Instant neural graphics primitives: lightning fast NeRF and more
Web-based Cloud Gaming service for Retro Game
Open-source simulator for autonomous driving research.
CUDA Templates and Python DSLs for High-Performance Linear Algebra
Benchmarking Deep Learning operations on different hardware
A latent text-to-image diffusion model
OneFlow is a deep learning framework designed to be user-friendly, scalable and efficient.
Muzic: Music Understanding and Generation with Artificial Intelligence
[ICLR 2023 Spotlight] EVA3D: Compositional 3D Human Generation from 2D Image Collections
StyleFlow: Attribute-conditioned Exploration of StyleGAN-generated Images using Conditional Continuous Normalizing Flows (ACM TOG 2021)
PyTorch package for the discrete VAE used for DALL·E.
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model (unofficial)
Use caffe to train your own data in just one click