LLM Finetuning Toolkit下载LLM Finetuning Toolkit包源代码下载

LLM Finetuning Toolkit

其他源码

v0.2.3

下载

LLM Finetuning工具包

概述

LLM Finetuning Toolkit是一种基于配置的CLI工具，用于在您的数据上启动一系列LLM微调实验并收集结果。从一个yaml配置文件中，控制典型实验管道的所有元素 -提示，开源LLM ，优化策略和LLM测试。

安装

PIPX（推荐）

PIPX在单独的虚拟环境中安装软件包和依赖项

pipx install llm-toolkit

pip

pip install llm-toolkit

快速开始

本指南包含3个阶段，可以使您能够充分利用此工具包！

基本：运行您的第一个LLM微调实验
中级：通过更改YAML配置文件的组件来运行自定义实验
高级：跨不同提示模板，LLM，优化技术的启动系列微调实验 - 全部通过一个YAML配置文件

基本的

llmtune generate config
llmtune run ./config.yml

第一个命令生成有用的启动器config.yml文件，并保存在当前工作目录中。提供给用户快速入门，并作为进一步修改的基础。

然后，第二个命令使用默认YAML Configuration config.yaml中指定的设置来启动微调过程。

中间的

配置文件是定义工具包行为的中心部分。它以YAML格式编写，由几个部分组成，这些部分控制过程的不同方面，例如数据摄入，模型定义，培训，推理和质量保证。我们重点介绍了一些关键部分。

闪光注意力2

启用闪存注意力的支持模型。首先安装flash-attn ：

pipx

pipx inject llm-toolkit flash-attn --pip-args=--no-build-isolation

pip

 pip install flash-attn --no-build-isolation

然后，添加到配置文件。

 model :
  torch_dtype : " bfloat16 " # or "float16" if using older GPU
  attn_implementation : " flash_attention_2 "

数据摄入

数据摄入可能是什么样子：

 data :
  file_type : " huggingface "
  path : " yahma/alpaca-cleaned "
  prompt :
    # ## Instruction: {instruction}
    # ## Input: {input}
    # ## Output:
  prompt_stub : { output }
  test_size : 0.1 # Proportion of test as % of total; if integer then # of samples
  train_size : 0.9 # Proportion of train as % of total; if integer then # of samples
  train_test_split_seed : 42

虽然上面的示例说明了使用拥抱面的公共数据集，但配置文件也可以摄入您自己的数据。

   file_type : " json "
   path : " <path to your data file>

   file_type : " csv "
   path : " <path to your data file>

提示字段有助于创建说明以微调LLM。它读取数据集中存在的{}括号中提到的特定列中的数据。在提供的示例中，可以预期数据文件具有列名： instruction ， input和output 。
在微调过程中，提示字段同时使用prompt和prompt_stub 。但是，在测试过程中，仅prompt部分用作微调LLM的输入。

LLM定义

 model :
  hf_model_ckpt : " NousResearch/Llama-2-7b-hf "
  quantize : true
  bitsandbytes :
    load_in_4bit : true
    bnb_4bit_compute_dtype : " bf16 "
    bnb_4bit_quant_type : " nf4 "

# LoRA Params -------------------
lora :
  task_type : " CAUSAL_LM "
  r : 32
  lora_dropout : 0.1
  target_modules :
    - q_proj
    - v_proj
    - k_proj
    - o_proj
    - up_proj
    - down_proj
    - gate_proj

虽然上面的示例使用Llama2 7b展示，但从理论上讲，该工具包中可以使用任何由拥抱脸支持的开源LLM。

 hf_model_ckpt : " mistralai/Mistral-7B-v0.1 "

 hf_model_ckpt : " tiiuae/falcon-7b "

可以更改LORA的参数，例如等级r和辍学。

 lora :
  r : 64
  lora_dropout : 0.25

质量保证

 qa :
  llm_metrics :
    - length_test
    - word_overlap_test

为了确保微调的LLM行为预期，您可以添加测试，以检查是否达到所需的行为。示例：对于用于摘要任务的LLM进行了微调，我们可能希望检查生成的摘要的长度确实小于输入文本。我们还想学习原始文本中的单词与生成摘要之间的重叠。

伪影输出

此配置将在目录中进行微调并保存结果./experiment/[unique_hash] 。每种唯一的配置都会生成一个唯一的哈希，以便我们的工具可以自动拾取其关闭的位置。例如，如果您需要在培训的中间退出，则通过重新启动脚本，该程序将自动加载在目录下生成的现有数据集，而不是再次进行。

脚本完成后，您将看到这些独特的文物：

/dataset # generated pkl file in hf datasets format
/model # peft model weights in hf format
/results # csv of prompt, ground truth, and predicted values
/qa # csv of test results: e.g. vector similarity between ground truth and prediction

一旦将所有更改都合并到YAML文件中，您就可以使用它来运行自定义微调实验！

python toolkit.py --config-path < path to custom YAML file >

先进的

微调工作流程通常涉及在各种LLM上进行消融研究，及时设计和优化技术。可以更改配置文件以支持运行消融研究。

在微调时指定不同的提示模板进行实验。

 data :
  file_type : " huggingface "
  path : " yahma/alpaca-cleaned "
  prompt :
    - >-
      This is the first prompt template to iterate over
      ### Input: {input}
      ### Output:
    - >-
      This is the second prompt template
      ### Instruction: {instruction}
      ### Input: {input}
      ### Output:
  prompt_stub : { output }
  test_size : 0.1 # Proportion of test as % of total; if integer then # of samples
  train_size : 0.9 # Proportion of train as % of total; if integer then # of samples
  train_test_split_seed : 42

指定您想尝试的各种LLM。

 model :
  hf_model_ckpt :
    [
      " NousResearch/Llama-2-7b-hf " ,
      mistralai/Mistral-7B-v0.1",
      " tiiuae/falcon-7b " ,
    ]
  quantize : true
  bitsandbytes :
    load_in_4bit : true
    bnb_4bit_compute_dtype : " bf16 "
    bnb_4bit_quant_type : " nf4 "

指定您想要消融的Lora的不同配置。

 lora :
  r : [16, 32, 64]
  lora_dropout : [0.25, 0.50]

扩展

该工具包提供了模块化且可扩展的体系结构，使开发人员可以自定义和增强其功能以适应其特定需求。工具包的每个组件，例如数据摄入，微调，推理和质量保证测试，旨在易于扩展。

贡献

欢迎和鼓励对该工具包的开源供款。如果您想做出贡献，请参阅parduting.md。

展开

附加信息

版本 v0.2.3
类型其他源码
更新时间 2025-04-16
大小 9.94MB
来自于 Github

LLM Finetuning Toolkit

LLM Finetuning工具包

概述

安装

PIPX（推荐）

pip

快速开始

基本的

中间的

闪光注意力2

数据摄入

LLM定义

质量保证

伪影输出

先进的

扩展

贡献

webextension pixiv toolkit

TensorRT LLM

jsdoc_工具包 v2.3.1

jsdoc_toolkit v2.0.1 b

jsdoc工具包

jsdoc_工具包

chat.petals.dev

GPT Prompt Templates

GPTyped

Google Dorks

shepherd

mongo express

Google Dorks

shepherd

mongo express