Skip to content
View dongzelian's full-sized avatar

Block or report dongzelian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A general fine-tuning kit geared toward diffusion models.

Python 2,598 253 Updated Nov 12, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 31,577 6,497 Updated Nov 12, 2025

Official repository for the paper PLLaVA

Python 671 44 Updated Jul 28, 2024

Accepted as [NeurIPS 2024] Spotlight Presentation Paper

Jupyter Notebook 6,356 649 Updated Sep 26, 2024

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

4,091 318 Updated Oct 17, 2025

We introduce a novel approach for parameter generation, named neural network parameter diffusion (p-diff), which employs a standard latent diffusion model to synthesize a new set of parameters

Python 884 51 Updated Jan 3, 2025

[CVPR'24 Best Student Paper] Mip-Splatting: Alias-free 3D Gaussian Splatting

Python 1,346 103 Updated Dec 17, 2024

Navigate dreamscapes with a click – your chosen point guides the drone’s flight in a thrilling visual journey.

Python 47 3 Updated Sep 2, 2025

A curated list for Efficient Large Language Models

Python 1,894 145 Updated Jun 17, 2025

(CVPR 2023) Official implemention of the paper "Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos"

Python 31 4 Updated Apr 2, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 8,017 719 Updated May 31, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,989 3,445 Updated May 18, 2024

Official repo for consistency models.

Python 6,436 434 Updated Mar 22, 2024

General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX

1,824 100 Updated Nov 15, 2023

A curated list of Composable AI methods: Building AI system by composing modules.

197 5 Updated Nov 24, 2023

[ICCV 2023] TM2D: Bimodality Driven 3D Dance Generation via Music-Text Integration

Python 101 5 Updated Mar 4, 2024

Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs

Python 1,926 109 Updated Jan 12, 2025

Transfer the ControlNet with any basemodel in diffusers🔥

Python 843 52 Updated Apr 23, 2023

Lossless Training Speed Up by Unbiased Dynamic Data Pruning

Python 340 20 Updated Sep 24, 2024

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 9,284 1,222 Updated Nov 10, 2025

Let us control diffusion models!

Python 33,284 2,979 Updated Feb 25, 2024

A collection of parameter-efficient transfer learning papers focusing on computer vision and multimodal domains.

410 26 Updated Sep 26, 2024

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

Python 37,495 3,300 Updated Aug 17, 2024

Using Low-rank adaptation to quickly fine-tune diffusion models.

Jupyter Notebook 7,470 495 Updated Mar 22, 2024

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 49,420 8,281 Updated Nov 12, 2025

A Unified Framework for Surface Reconstruction

Python 2,082 196 Updated Jul 11, 2024

[CVPR 2023] iQuery: Instruments as Queries for Audio-Visual Sound Separation

Python 71 1 Updated Jul 25, 2023

[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)

Python 2,228 176 Updated Dec 22, 2022

MetaFormer Baselines for Vision (TPAMI 2024)

Python 492 31 Updated Jun 1, 2024
Next