lora_r is double when converting olora to lora. #2075

JaheimLee · 2024-09-18T09:25:14Z

System Info

transformers version: 4.44.2
Platform: Linux-5.13.0-30-generic-x86_64-with-glibc2.31
Python version: 3.12.4
Huggingface_hub version: 0.24.5
Safetensors version: 0.4.3
Accelerate version: 0.34.0
Accelerate config: not found
PyTorch version (GPU?): 2.4.0+cu121 (True)
Tensorflow version (GPU?): not installed (NA)
Flax version (CPU?/GPU?/TPU?): not installed (NA)
Jax version: not installed
JaxLib version: not installed
Using distributed or parallel set-up in script?:
Using GPU in script?:
GPU type: NVIDIA GeForce RTX 3090
Peft: 0.12.0

Who can help?

No response

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder
My own task or dataset (give details below)

Reproduction

import os
os.environ["CUDA_VISIBLE_DEVICES"] = "0"
from transformers import AutoModel
from peft import get_peft_model, LoraConfig

base_model = AutoModel.from_pretrained("facebook/opt-350m")
olora_config = LoraConfig(
    r=16,
    lora_alpha=32,
    lora_dropout=0.05,
    target_modules='all-linear',
    init_lora_weights='olora',
)
olora_model = get_peft_model(base_model, olora_config)
init_path = './tmp/init'
olora_model.save_pretrained(init_path) # Save the model *before* performing any training

# Train the model
# train(olora_model) # Your training loop

#Save the model after training
olora_model.save_pretrained('./tmp/lora', path_initial_model_for_weight_conversion=init_path)

Expected behavior

The lora_r of init adapter is 16.

{
  "alpha_pattern": {},
  "auto_mapping": null,
  "base_model_name_or_path": "facebook/opt-350m",
  "bias": "none",
  "fan_in_fan_out": false,
  "inference_mode": false,
  "init_lora_weights": true,
  "layer_replication": null,
  "layers_pattern": null,
  "layers_to_transform": null,
  "loftq_config": {},
  "lora_alpha": 32,
  "lora_dropout": 0.05,
  "megatron_config": null,
  "megatron_core": "megatron.core",
  "modules_to_save": null,
  "peft_type": "LORA",
  "r": 16,
  "rank_pattern": {},
  "revision": null,
  "target_modules": [
    "k_proj",
    "q_proj",
    "fc1",
    "out_proj",
    "project_out",
    "project_in",
    "v_proj",
    "fc2"
  ],
  "task_type": null,
  "use_dora": false,
  "use_rslora": false
}

But the converted one is 32.

{
  "alpha_pattern": {},
  "auto_mapping": {
    "base_model_class": "OPTModel",
    "parent_library": "transformers.models.opt.modeling_opt"
  },
  "base_model_name_or_path": "facebook/opt-350m",
  "bias": "none",
  "fan_in_fan_out": false,
  "inference_mode": true,
  "init_lora_weights": true,
  "layer_replication": null,
  "layers_pattern": null,
  "layers_to_transform": null,
  "loftq_config": {},
  "lora_alpha": 64,
  "lora_dropout": 0.05,
  "megatron_config": null,
  "megatron_core": "megatron.core",
  "modules_to_save": null,
  "peft_type": "LORA",
  "r": 32,
  "rank_pattern": {},
  "revision": null,
  "target_modules": [
    "k_proj",
    "q_proj",
    "fc1",
    "out_proj",
    "project_out",
    "project_in",
    "v_proj",
    "fc2"
  ],
  "task_type": null,
  "use_dora": false,
  "use_rslora": false
}

Model size is also double.
Is it as expected?

The text was updated successfully, but these errors were encountered:

BenjaminBossan · 2024-09-18T09:59:47Z

Yes, this is expected. Methods like OLoRA modify the base weights too. When you want to convert the OLoRA weights to LoRA weights, it needs to be ensured that the original base weights can be used. This is only possible by performing some changes on the OLoRA weights, which involves doubling their size. The reason is not quite straightforward to understand but it's explained here (this is for LoftQ but the same idea applies to OLoRA).

Ping @tokenizer-decode for info.

JaheimLee · 2024-09-18T11:13:01Z

Yes, this is expected. Methods like OLoRA modify the base weights too. When you want to convert the OLoRA weights to LoRA weights, it needs to be ensured that the original base weights can be used. This is only possible by performing some changes on the OLoRA weights, which involves doubling their size. The reason is not quite straightforward to understand but it's explained here (this is for LoftQ but the same idea applies to OLoRA).

Ping @tokenizer-decode for info.

Got it, thanks for your reply

JaheimLee · 2024-09-19T03:38:30Z

Yes, this is expected. Methods like OLoRA modify the base weights too. When you want to convert the OLoRA weights to LoRA weights, it needs to be ensured that the original base weights can be used. This is only possible by performing some changes on the OLoRA weights, which involves doubling their size. The reason is not quite straightforward to understand but it's explained here (this is for LoftQ but the same idea applies to OLoRA).

Ping @tokenizer-decode for info.

Found a new problem. After converting to lora model, r and alpha of the base model will be 2r and 2alpha. So maybe it's better to reset them to r and alpha after saving is finished.

Resolves huggingface#2075 When saving PiSSA or OLoRA with the option to convert to normal LoRA, the LoRA weight shapes change, which means that some values like r and alpha need to be adjusted in the saved PEFT config. However, these modifications should be limited to the saved config, while the loaded config should stay the same. This PR implements this change by creating a copy of the config before modifying it.

BenjaminBossan · 2024-09-19T09:42:54Z

Good point @JaheimLee, I created a PR to address that: #2077.

Resolves #2075 When saving PiSSA or OLoRA with the option to convert to normal LoRA, the LoRA weight shapes change, which means that some values like r and alpha need to be adjusted in the saved PEFT config. However, these modifications should be limited to the saved config, while the loaded config should stay the same. This PR implements this change by creating a copy of the config before modifying it.

JaheimLee changed the title ~~lora r is double when converting olora to lora.~~ lora_r is double when converting olora to lora. Sep 18, 2024

JaheimLee closed this as completed Sep 18, 2024

JaheimLee reopened this Sep 19, 2024

BenjaminBossan mentioned this issue Sep 19, 2024

ENH: PiSSA/OLoRA: Preserve original config on save #2077

Merged

BenjaminBossan closed this as completed in #2077 Sep 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lora_r is double when converting olora to lora. #2075

lora_r is double when converting olora to lora. #2075

JaheimLee commented Sep 18, 2024 •

edited

Loading

BenjaminBossan commented Sep 18, 2024

JaheimLee commented Sep 18, 2024

JaheimLee commented Sep 19, 2024

BenjaminBossan commented Sep 19, 2024

lora_r is double when converting olora to lora. #2075

lora_r is double when converting olora to lora. #2075

Comments

JaheimLee commented Sep 18, 2024 • edited Loading

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

BenjaminBossan commented Sep 18, 2024

JaheimLee commented Sep 18, 2024

JaheimLee commented Sep 19, 2024

BenjaminBossan commented Sep 19, 2024

JaheimLee commented Sep 18, 2024 •

edited

Loading