curlora下載 - curlora源代碼下載

curlora

其他源碼

v4.0.0

下載

庫洛拉：穩定的LLM持續微調和災難性遺忘緩解措施

穆罕默德·法維（Muhammad Fawi）

代碼doi：

研究預印本：

概述

該回購包含庫洛拉研究論文的代碼，這是一種新型的大型語言模型（LLMS）的新方法，該方法在低級別適應性（LORA）的背景下利用CUR矩陣分解。我們的方法解決了LLM微調方面的兩個關鍵挑戰：減輕持續學習過程中的災難性遺忘並減少可訓練參數的數量。我們建議對CUR分解過程進行獨特的修改，以使更有效，更穩定的方法使LLM適應新任務，而不會損害任何現有知識。我們通過在多個數據集上進行的實驗證明，策展人在減輕災難性遺忘方面優於標準洛拉。它保持了跨任務的模型穩定性和性能，同時大大減少了可訓練的參數的數量。我們的結果表明，與洛拉相比，庫洛拉獲得了卓越的準確性和困惑得分，尤其是在數據有限的情況下。

內容

CURLoRA.pdf ：詳細介紹Curlora方法的研究論文。
code/ ：包含Curlora和實驗實現的目錄。
- code/curlora.py ：包含Curlora類。
- code/utils.py ：輔助功能。
- code/lora.py類。
- code/curlora_experiment.ipynb ：Mistral 7b（MRPC上的微調，SST-2和Mentiment140）實驗。
- code/curlora_experiment-gpt.ipynb實驗gpt2-large（MRPC上的微調，SST-2和MENTIMENT140）。
- code/squad_gpt-curlora.ipynb和sfttrainer在小隊數據集上的Q＆A進行微調GPT2-LARGE。

相同的筆記本可用於洛拉。

快速開始

首先我們安裝要求

pip3 install -r code/requirements.txt

所有Curlora輔助功能和類都在Code/Curlora.py和Code/Utils.py中定義。

加載模型並應用庫洛拉

 from transformers import AutoTokenizer , AutoModelForCausalLM
from utils import *

model_name = "gpt2-large"
model = AutoModelForCausalLM . from_pretrained ( model_name )
model . to ( "cuda" ) # this will make all existing layers in CUDA

# turning off grad for all layers
for param in model . parameters ():
    param . requires_grad = False


# replace original Q,K,V layers with CURLoRA (GPT2-Large specific)
# refer to utils.py for a more general way
for name , module in model . named_modules ():
    if isinstance ( module , type ( model . transformer . h [ 0 ]. attn )):
        # rank = 24, alpha = 1
        module . c_attn = LinearWithCURLoRA ( module . c_attn , 24 , 1 )


# now look at how many CURLoRA parameters to be trained
total_params = sum ( p . numel () for p in model . parameters () if p . requires_grad )
print ( f"Total trainable parameters after: { total_params :, } " )
# making sure CURLoRA layers are on CUDA as well
model . to ( "cuda" )

現在，您擁有將庫洛拉層應用於注意層（鑰匙，值和查詢）的模型，您可以將其用於微調或正常推理。

您可能需要知道如何調用該圖層，以便可以正確替換。例如，可以通過以下方式找到Mistral中的Q，K，V。

 for name , module in model . named_children ():
    if any ( l in name for l in [ "q_proj" , "v_proj" , "k_proj" ]):
        setattr ( model , name , LinearWithCURLoRA ( module , rank , alpha ))

請注意：

某些變量和值在Code/Utils.py或Code/Curlora.py中進行了硬編碼，就像適用於，等級，Alpha，設備等的圖層一樣。
持續的工作（貢獻）對支持量化（QCOURLORA）的持續工作，即到目前為止，您加載了未量化的整個模型。
在代碼/目錄中，有筆記本可以運行研究論文實驗
您可能需要使用比Lora更高的學習率來獲得更好的準確性。由於論文中解釋的“隱式正則化”，更高的學習率不會導致過度擬合。

執照

該項目是根據MIT許可證獲得許可的 - 有關詳細信息，請參見許可證文件。

引用

如果您發現Curlora研究或編碼有幫助，請考慮引用它們。

代碼引用

bibtext

 @software { Fawi_CURLoRA_Leveraging_CUR_2024 ,
  author       = { Fawi, Muhammad } ,
  title        = { {CURLoRA: Leveraging CUR Matrix Decomposition for 
                   Stable LLM Continual Fine-Tuning and Catastrophic
                   Forgetting Mitigation} } ,
  month        = jul,
  year         = 2024 ,
  publisher    = { Zenodo } ,
  version      = { v4.0.0 } ,
  doi          = { 10.5281/zenodo.12729738 } ,
  url          = { https://zenodo.org/doi/10.5281/zenodo.12729738 }
}

APA

 Fawi, M. (2024). CURLoRA: Leveraging CUR Matrix Decomposition for Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation (v4.0.0) [Computer software]. Zenodo. https://doi.org/10.5281/zenodo.12729738

研究論文引用

bibtext

 @misc { fawi_2024_12730055 ,
  author       = { Fawi, Muhammad } ,
  title        = { {CURLoRA: Leveraging CUR Matrix Decomposition for 
                   Stable LLM Continual Fine-Tuning and Catastrophic
                   Forgetting Mitigation} } ,
  month        = jul,
  year         = 2024 ,
  publisher    = { Zenodo } ,
  doi          = { 10.5281/zenodo.12730055 } ,
  url          = { https://doi.org/10.5281/zenodo.12730055 }
}

APA

 Fawi, M. (2024). CURLoRA: Leveraging CUR Matrix Decomposition for Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation. Zenodo. https://doi.org/10.5281/zenodo.12730055

貢獻和想法將不勝感激

展開

附加信息

版本 v4.0.0
類型其他源碼
更新時間 2025-03-08
大小 213.06KB
來自於 Github

相關應用

Google Dorks

2025-03-10
shepherd

2025-06-04
mongo express

2025-06-04
hidusbf

2025-02-14
Free Algorithms Books

2025-05-29
markdownpedia

2025-04-22

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部