hasaki321

hina hasaki321

6 followers · 9 following

Lanzhou University
Lanzhou
https://hrshome.site

Stars

FoundationVision / InfinityStar

[NeurIPS 2025 Oral]Infinity⭐️: Uniﬁed Spacetime AutoRegressive Modeling for Visual Generation

Python 495 17 Updated Nov 12, 2025

Berkeley-Speech-Group / sylber

Sylber: Syllabic Embedding Representation of Speech from Raw Audio

Jupyter Notebook 68 4 Updated Mar 17, 2025

MoonshotAI / Kimi-Audio

Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation

Python 4,356 315 Updated Jun 21, 2025

Tobertz-max / DiFlow-TTS

DiFlow-TTS delivers low-latency zero-shot TTS via discrete flow matching and factorized speech tokens. A compact, open framework for fast voice synthesis.🐙

Python 46 4 Updated Nov 17, 2025

lxa9867 / ImageFolder

High-performance Image Tokenizers for VAR and AR

Python 297 6 Updated Apr 25, 2025

sihyun-yu / REPA

[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 1,413 64 Updated Mar 16, 2025

amir84ferdos / ComfyUI-GRAG-ArchAi3D

Advanced GRAG implementation for ComfyUI with beginner-friendly and expert modes

Python 12 2 Updated Nov 6, 2025

little-misfit / GRAG-Image-Editing

https://little-misfit.github.io/GRAG-Image-Editing/

Python 102 2 Updated Nov 5, 2025

shaochenze / calm

Official implementation of "Continuous Autoregressive Language Models"

Python 582 71 Updated Nov 10, 2025

amphionspace / FlexiCodec

FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates

Python 32 3 Updated Nov 4, 2025

xg-chu / ARTalk

ARTalk generates realistic 3D head motions (lip sync, blinking, expressions, head poses) from audio in ⚡ real-time ⚡.

Python 100 16 Updated Jun 12, 2025

AlonzoLeeeooo / awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

TeX 702 35 Updated Oct 23, 2025

LTH14 / fractalgen

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 1,193 65 Updated Feb 25, 2025

lxa9867 / Awesome-Autoregressive-Visual-Generation

This is a repo to track the latest autoregressive visual generation papers.

409 5 Updated Jun 25, 2025

LTH14 / mar

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 1,787 109 Updated Sep 27, 2024

huang-yh / SpectralAR

[ICCV 25]SpectralAR: Spectral Autoregressive Visual Generation

35 1 Updated Jun 13, 2025

ApexGen-X / MergeVQ

[CVPR] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization

Python 46 3 Updated Jul 22, 2025

guolinke / SphereAR

Implementation of "Hyperspherical Latents Improve Continuous-Token Autoregressive"

Python 77 6 Updated Nov 15, 2025

FoundationVision / VAR

[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". A…

Jupyter Notebook 8,479 546 Updated Nov 10, 2025