Skip to content
View dongzelian's full-sized avatar

Block or report dongzelian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A general fine-tuning kit geared toward image/video/audio diffusion models.

Python 2,706 266 Updated Jan 9, 2026

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 32,389 6,674 Updated Jan 9, 2026

Official repository for the paper PLLaVA

Python 677 45 Updated Jul 28, 2024

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,378 651 Updated Sep 26, 2024

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

4,192 322 Updated Dec 3, 2025

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 886 52 Updated Jan 3, 2025

[CVPR'24 Best Student Paper] Mip-Splatting: Alias-free 3D Gaussian Splatting

Python 1,383 108 Updated Dec 17, 2024

Navigate dreamscapes with a click – your chosen point guides the drone’s flight in a thrilling visual journey.

Python 48 3 Updated Sep 2, 2025

A curated list for Efficient Large Language Models

Python 1,926 148 Updated Jun 17, 2025

(CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos"

Python 31 4 Updated Apr 2, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,256 742 Updated May 31, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,993 3,446 Updated May 18, 2024

Official repo for consistency models.

Python 6,464 435 Updated Mar 22, 2024

General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX

1,836 101 Updated Nov 15, 2023

A curated list of Composable AI methods: Building AI system by composing modules.

197 5 Updated Nov 24, 2023

[ICCV 2023] TM2D: Bimodality Driven 3D Dance Generation via Music-Text Integration

Python 102 6 Updated Mar 4, 2024

Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs

Python 1,933 109 Updated Jan 12, 2025

Transfer the ControlNet with any basemodel in diffusers🔥

Python 845 53 Updated Apr 23, 2023

Lossless Training Speed Up by Unbiased Dynamic Data Pruning

Python 341 20 Updated Sep 24, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,427 1,260 Updated Jan 7, 2026

Let us control diffusion models!

Python 33,529 3,001 Updated Feb 25, 2024

A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.

410 26 Updated Sep 26, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,478 3,299 Updated Aug 17, 2024

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,507 499 Updated Mar 22, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,809 8,688 Updated Nov 12, 2025

A Unified Framework for Surface Reconstruction

Python 2,085 197 Updated Jul 11, 2024

[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation

Python 70 1 Updated Jul 25, 2023

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Python 2,234 180 Updated Dec 22, 2022

MetaFormer Baselines for Vision (TPAMI 2024)

Python 496 31 Updated Jun 1, 2024
Next