Glupayy

Y-Y Glupayy

World should have its kernel.

3 followers · 0 following

Highlights

Stars

JayZhang42 / SLED

SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433

Python 110 21 Updated Dec 5, 2024

jiwoogit / StyleID

[CVPR 2024 Highlight] Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer

Python 448 38 Updated Dec 16, 2024

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,528 43 Updated Oct 15, 2025

ssfgunner / VL-SAE

[NeurIPS 2025] This is the official repository for VL-SAE: Interpreting and Enhancing Vision-Language Alignment with a Unified Concept Set

Python 4 1 Updated Oct 29, 2025

nnnth / UFO

[NeurIPS2025 Spotlight 🔥 ] Official implementation of 🛸 "UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface"

Python 244 11 Updated Nov 5, 2025

ronen94 / SAEdit

The official implementation of the paper SAEdit: Token-level control for continuous image editing via Sparse AutoEncoder

Python 13 Updated Oct 19, 2025

cywinski / SAeUron

[ICML 2025] Unlearning in Diffusion Models using Sparse Autoencoders

Python 44 5 Updated Oct 16, 2025

Qinyu-Allen-Zhao / LVLM-LP

The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?

Shell 40 14 Updated Nov 1, 2024

clemneo / llava-interp

Python 70 7 Updated Nov 5, 2024

nickjiang2378 / vlm-hallucinations

[ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"

Python 91 8 Updated May 27, 2025

tim-lawson / mlsae

Multi-Layer Sparse Autoencoders (ICLR 2025)

Python 26 Updated Feb 11, 2025

ChengShiest / Vision-Function-Layer

[NeurIPS 2025] The official PyTorch implementation of the "Vision Function Layer in MLLM".

Python 17 Updated Oct 3, 2025

zjunlp / steer-target-atoms

[ACL 2025] Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms

Python 27 2 Updated Jun 4, 2025

YaojieBao / An-Information-theoretic-Metric-of-Transferability

This repository contains the codes of the experiments in Paper An Information-theoretic Metric of Transferability for Task Transfer Learning

Python 23 4 Updated Mar 1, 2019

HugoFry / mats_sae_training_for_ViTs

Forked from decoderesearch/SAELens

Jupyter Notebook 22 4 Updated Apr 23, 2024

PKU-Alignment / SAELens-V

Python 8 1 Updated Jan 31, 2025

decoderesearch / SAELens

Training Sparse Autoencoders on Language Models

Python 1,045 200 Updated Nov 11, 2025

EvolvingLMMs-Lab / sae

A framework that allows you to apply Sparse AutoEncoder on any models

Python 42 1 Updated Jul 11, 2025

EvolvingLMMs-Lab / multimodal-sae

[ICCV 2025] Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.

Python 160 9 Updated Sep 26, 2025

kmatton / walk-the-talk

Code for project on assessing the faithfulness of LLMs

Jupyter Notebook 7 2 Updated Oct 5, 2025

veronica320 / Faithful-COT

Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".

Python 163 11 Updated May 7, 2024

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,273 422 Updated Nov 10, 2025

open-compass / VLMEvalKit

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 3,352 545 Updated Nov 8, 2025

xmed-lab / TAM

[ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs

Python 121 2 Updated Aug 8, 2025

PKU-YuanGroup / Look-Back

This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".

Python 69 4 Updated Jul 10, 2025

Osilly / Vision-R1

This is the first paper to explore how to effectively use R1-like RL for MLLMs and introduce Vision-R1, a reasoning MLLM that leverages cold-start initialization and RL training to incentivize reas…

Python 726 19 Updated Sep 10, 2025

ars22 / e3

Python 18 3 Updated Sep 16, 2025

MoonshotAI / Kimi-VL

Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities

1,101 55 Updated Jul 15, 2025

Fancy-MLLM / R1-Onevision

R1-onevision, a visual language model capable of deep CoT reasoning.

Python 569 16 Updated Apr 13, 2025

klara-research / klarity

See Through Your Models

Python 402 28 Updated Jul 8, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly