GitHub - fyting/LLaVA-SP: [ICCV 2025] The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs".

The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs" [Paper]

The implementation changes of LLaVA-SP are in llava_arch.py, clip_encoder.py, llava_trainer.py and train.py.

Install

Please see instructions for https://github.com/haotian-liu/LLaVA/

LLaVA-SP Weights

Please check out https://huggingface.co/Levideus/models for all public LLaVA-SP checkpoints.

Quick Start

python llava/eval/run_llava.py
--model_path /path/llava-sp-cropping-lora
--model_base /path/vicuna-1.5-7b

Citation

If you find LLaVA-SP useful for your research and applications, please cite using this BibTeX:

@misc{lou2025llavasp,
    title={LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs},
    author={Lou, Haoran and Fan, Chunxiao and Liu, Ziyan Liu and Wu, Yuexin Wu and Wang, Xinliang},
    publisher={arXiv:2507.00505},
    year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
docs		docs
llava.egg-info		llava.egg-info
llava		llava
scripts		scripts
LICENSE		LICENSE
README.md		README.md
cog.yaml		cog.yaml
finetune_lora.sh		finetune_lora.sh
fintune_lora.sh		fintune_lora.sh
predict.py		predict.py
pretrain.sh		pretrain.sh
pyproject.toml		pyproject.toml
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs" [Paper]

Install

LLaVA-SP Weights

Quick Start

Citation

About

Uh oh!

Releases

Packages

Languages

Uh oh!

License

Uh oh!

fyting/LLaVA-SP

Folders and files

Latest commit

History

Repository files navigation

The official pytorch implement of "LLaVA-SP: Enhancing Visual Representation with Visual Spatial Tokens for MLLMs" [Paper]

Install

LLaVA-SP Weights

Quick Start

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages