Merge LoRA, convert to GGUF, and quantize.

A time-saving script for any Llama CPP/LoRA workflow: it handles merging the LoRA into the base model, converting it to GGUF format, and applying post-training quantization.

Setup

Clone this repository: git clone https://github.com/georgepullen/Merge-LoRA-Into-GGUF.git .
Clone the llama.cpp repository: git clone https://github.com/ggerganov/llama.cpp.git .
Install requirements for both repositories: pip install -r requirements.txt, pip install -r merge_lora_requirements.txt

Arguments

-l Path to LoRA (HF Repo ID or Local Path)
-b Path to base model (HF Repo ID or Local Path)
-f Format (F32, F16, q6_k, q4_0, etc.)

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
README.md		README.md
merge_lora_into_gguf.py		merge_lora_into_gguf.py
merge_lora_requirements.txt		merge_lora_requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Merge LoRA, convert to GGUF, and quantize.

Setup

Arguments

About

Uh oh!

Releases

Packages

Languages

davidtorcivia/Merge-LoRA-Into-GGUF

Folders and files

Latest commit

History

Repository files navigation

Merge LoRA, convert to GGUF, and quantize.

Setup

Arguments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages