Lists (2)
Sort Name ascending (A-Z)
Stars
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
Survey on LLM Agents (Published on CoLing 2025)
[ICML 2025] "SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator"
[NeurIPS 2024] An official implementation of "ShareGPT4Video: Improving Video Understanding and Generation with Better Captions"
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions
Open-Sora: Democratizing Efficient Video Production for All
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
Nightly release of ControlNet 1.1
one for all, Optimal generator with No Exception
本文原文由知名 Hacker Eric S. Raymond 所撰寫,教你如何正確的提出技術問題並獲得你滿意的答案。
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs
[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis
Official repo for consistency models.
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
基于 OpenAI API 的文本翻译、文本润色、语法纠错 Bob 插件,让我们一起迎接不需要巴别塔的新时代!Licensed under CC BY-NC-SA 4.0
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models