Skip to content
View bollossom's full-sized avatar
🎯
Focusing
🎯
Focusing

Highlights

  • Pro

Block or report bollossom

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.

Python 227 7 Updated Oct 27, 2025

DDT: Decoupled Diffusion Transformer

Python 295 15 Updated Aug 22, 2025

Official implementation of "UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing"

Python 36 Updated Oct 24, 2025

molly, an LLM designed to understand multi-omics data.

Python 21 Updated Oct 22, 2025

Implementation of "Hyperspherical Latents Improve Continuous-Token Autoregressive"

Python 68 6 Updated Sep 30, 2025

Pytorch implemention of UniFlow

Jupyter Notebook 107 2 Updated Oct 17, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,365 32 Updated Oct 15, 2025

Benchmarking Knowledge Transfer in Lifelong Robot Learning

Jupyter Notebook 1,025 210 Updated Mar 15, 2025

ULMEvalKit: One-Stop Eval ToolKit for Image Generation

Python 44 1 Updated Oct 22, 2025

“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with any VAE.

Python 157 4 Updated May 1, 2025

Code release for Ming-UniVision: Joint Image Understanding and Geneation with a Continuous Unified Tokenizer

Python 112 4 Updated Oct 14, 2025

[ICCV 2025] LIRA

Python 17 3 Updated Oct 9, 2025

HunyuanImage-3.0: A Powerful Native Multimodal Model for Image Generation

Python 2,304 96 Updated Oct 14, 2025

WorldVLA: Towards Autoregressive Action World Model

Python 472 21 Updated Oct 10, 2025

Official repo for NeurIPS 2025 poster: Unveiling the Spatial-temporal Effective Receptive Fields of Spiking Neural Networks

Python 3 Updated Oct 24, 2025

[NeurIPS-24] This is the official implementation of the paper "DeepStack: Deeply Stacking Visual Tokens is Surprisingly Simple and Effective for LMMs".

Python 61 3 Updated Jun 17, 2024

[NeurIPS 2025] The official implementation of paper "Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking"

Python 10 2 Updated Oct 10, 2025

[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project

Python 178 2 Updated Mar 20, 2025

Sequence Parallelism for Long Training

Python 2 Updated Oct 13, 2025

Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.

Python 4,728 274 Updated Jul 18, 2025

Unified Long Training Codebase

Python 1 Updated Oct 13, 2025

Large World Model -- Modeling Text and Video with Millions Context

Python 7,361 561 Updated Oct 19, 2024

Fully Open Framework for Democratized Multimodal Training

Python 579 40 Updated Oct 21, 2025

Multi-Level Triton Runner supporting Python, IR, PTX, and cubin.

Python 75 1 Updated Oct 22, 2025

[Fully open] [Encoder-free MLLM] Vision as LoRA

Python 341 29 Updated Jun 12, 2025

OneCAT: Decoder-Only Auto-Regressive Model for Unified Understanding and Generation

Python 225 5 Updated Sep 22, 2025
Python 1,140 151 Updated Sep 25, 2025

Code for our paper "Next Visual Granularity Generation".

Python 39 Updated Oct 7, 2025

[ICCV2025]Code Release of Harmonizing Visual Representations for Unified Multimodal Understanding and Generation

Python 177 5 Updated May 21, 2025

Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models

Python 330 26 Updated Feb 23, 2025
Next