LLM Finetuning Toolkit下載LLM Finetuning Toolkit包源代碼下載

LLM Finetuning Toolkit

其他源碼

v0.2.3

下載

LLM Finetuning工具包

概述

LLM Finetuning Toolkit是一種基於配置的CLI工具，用於在您的數據上啟動一系列LLM微調實驗並收集結果。從一個yaml配置文件中，控制典型實驗管道的所有元素 -提示，開源LLM ，優化策略和LLM測試。

安裝

PIPX（推薦）

PIPX在單獨的虛擬環境中安裝軟件包和依賴項

pipx install llm-toolkit

pip

pip install llm-toolkit

快速開始

本指南包含3個階段，可以使您能夠充分利用此工具包！

基本：運行您的第一個LLM微調實驗
中級：通過更改YAML配置文件的組件來運行自定義實驗
高級：跨不同提示模板，LLM，優化技術的啟動系列微調實驗 - 全部通過一個YAML配置文件

基本的

llmtune generate config
llmtune run ./config.yml

第一個命令生成有用的啟動器config.yml文件，並保存在當前工作目錄中。提供給用戶快速入門，並作為進一步修改的基礎。

然後，第二個命令使用默認YAML Configuration config.yaml中指定的設置來啟動微調過程。

中間的

配置文件是定義工具包行為的中心部分。它以YAML格式編寫，由幾個部分組成，這些部分控製過程的不同方面，例如數據攝入，模型定義，培訓，推理和質量保證。我們重點介紹了一些關鍵部分。

閃光注意力2

啟用閃存注意力的支持模型。首先安裝flash-attn ：

pipx

pipx inject llm-toolkit flash-attn --pip-args=--no-build-isolation

pip

 pip install flash-attn --no-build-isolation

然後，添加到配置文件。

 model :
  torch_dtype : " bfloat16 " # or "float16" if using older GPU
  attn_implementation : " flash_attention_2 "

數據攝入

數據攝入可能是什麼樣子：

 data :
  file_type : " huggingface "
  path : " yahma/alpaca-cleaned "
  prompt :
    # ## Instruction: {instruction}
    # ## Input: {input}
    # ## Output:
  prompt_stub : { output }
  test_size : 0.1 # Proportion of test as % of total; if integer then # of samples
  train_size : 0.9 # Proportion of train as % of total; if integer then # of samples
  train_test_split_seed : 42

雖然上面的示例說明了使用擁抱面的公共數據集，但配置文件也可以攝入您自己的數據。

   file_type : " json "
   path : " <path to your data file>

   file_type : " csv "
   path : " <path to your data file>

提示字段有助於創建說明以微調LLM。它讀取數據集中存在的{}括號中提到的特定列中的數據。在提供的示例中，可以預期數據文件具有列名： instruction ， input和output 。
在微調過程中，提示字段同時使用prompt和prompt_stub 。但是，在測試過程中，僅prompt部分用作微調LLM的輸入。

LLM定義

 model :
  hf_model_ckpt : " NousResearch/Llama-2-7b-hf "
  quantize : true
  bitsandbytes :
    load_in_4bit : true
    bnb_4bit_compute_dtype : " bf16 "
    bnb_4bit_quant_type : " nf4 "

# LoRA Params -------------------
lora :
  task_type : " CAUSAL_LM "
  r : 32
  lora_dropout : 0.1
  target_modules :
    - q_proj
    - v_proj
    - k_proj
    - o_proj
    - up_proj
    - down_proj
    - gate_proj

雖然上面的示例使用Llama2 7b展示，但從理論上講，該工具包中可以使用任何由擁抱臉支持的開源LLM。

 hf_model_ckpt : " mistralai/Mistral-7B-v0.1 "

 hf_model_ckpt : " tiiuae/falcon-7b "

可以更改LORA的參數，例如等級r和輟學。

 lora :
  r : 64
  lora_dropout : 0.25

質量保證

 qa :
  llm_metrics :
    - length_test
    - word_overlap_test

為了確保微調的LLM行為預期，您可以添加測試，以檢查是否達到所需的行為。示例：對於用於摘要任務的LLM進行了微調，我們可能希望檢查生成的摘要的長度確實小於輸入文本。我們還想學習原始文本中的單詞與生成摘要之間的重疊。

偽影輸出

此配置將在目錄中進行微調並保存結果./experiment/[unique_hash] 。每種唯一的配置都會生成一個唯一的哈希，以便我們的工具可以自動拾取其關閉的位置。例如，如果您需要在培訓的中間退出，則通過重新啟動腳本，該程序將自動加載在目錄下生成的現有數據集，而不是再次進行。

腳本完成後，您將看到這些獨特的文物：

/dataset # generated pkl file in hf datasets format
/model # peft model weights in hf format
/results # csv of prompt, ground truth, and predicted values
/qa # csv of test results: e.g. vector similarity between ground truth and prediction

一旦將所有更改都合併到YAML文件中，您就可以使用它來運行自定義微調實驗！

python toolkit.py --config-path < path to custom YAML file >

先進的

微調工作流程通常涉及在各種LLM上進行消融研究，及時設計和優化技術。可以更改配置文件以支持運行消融研究。

在微調時指定不同的提示模板進行實驗。

 data :
  file_type : " huggingface "
  path : " yahma/alpaca-cleaned "
  prompt :
    - >-
      This is the first prompt template to iterate over
      ### Input: {input}
      ### Output:
    - >-
      This is the second prompt template
      ### Instruction: {instruction}
      ### Input: {input}
      ### Output:
  prompt_stub : { output }
  test_size : 0.1 # Proportion of test as % of total; if integer then # of samples
  train_size : 0.9 # Proportion of train as % of total; if integer then # of samples
  train_test_split_seed : 42

指定您想嘗試的各種LLM。

 model :
  hf_model_ckpt :
    [
      " NousResearch/Llama-2-7b-hf " ,
      mistralai/Mistral-7B-v0.1",
      " tiiuae/falcon-7b " ,
    ]
  quantize : true
  bitsandbytes :
    load_in_4bit : true
    bnb_4bit_compute_dtype : " bf16 "
    bnb_4bit_quant_type : " nf4 "

指定您想要消融的Lora的不同配置。

 lora :
  r : [16, 32, 64]
  lora_dropout : [0.25, 0.50]

擴展

該工具包提供了模塊化且可擴展的體系結構，使開發人員可以自定義和增強其功能以適應其特定需求。工具包的每個組件，例如數據攝入，微調，推理和質量保證測試，旨在易於擴展。

貢獻

歡迎和鼓勵對該工具包的開源供款。如果您想做出貢獻，請參閱parduting.md。

展開

附加信息

版本 v0.2.3
類型其他源碼
更新時間 2025-04-16
大小 9.94MB
來自於 Github

相關應用

webextension pixiv toolkit

2024-11-12
TensorRT LLM

2024-11-10
jsdoc_工具包 v2.3.1

2022-05-31
jsdoc_toolkit v2.0.1 b

2022-05-30
jsdoc工具包

2009-05-24
jsdoc_工具包

2009-05-11

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部