KaSA下載KaSA源代碼下載

KaSA

Ai源碼

1.0.0

下載

KASA：知識吸引的大語言模型的奇異價值改編

[GPT4O生成的高質量合成指令遵循數據集？ ]

我們在官方擁抱面孔的PEFT存儲庫中對Lora實施Kasa。我們的KASA實現的源代碼可以在PEFT/SRC/PEFT/TUNERS/LORA/LOYE.PY上找到。值得注意的是，我們的實施是關於PEFT的版本不可能的。我們在最新的（0.13.1.dev0）和較舊的（0.6.3.dev0）版本之間取得了一致的結果，從而避免了由於實施的差異而產生的收益。

重要的

如果您在此存儲庫中使用數據或代碼，請考慮引用以下論文：

 @article { wang2024kasa ,
  title = { KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models } ,
  author = { Wang, Fan and Jiang, Juyong and Park, Chansung and Kim, Sunghun and Tang, Jing } ,
  journal = { arXiv preprint arXiv:2412.06071 } ,
  year = { 2024 }
}

環境

conda create -n kasa python=3.10
conda install pytorch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 pytorch-cuda=11.8 -c pytorch -c nvidia
# install peft with local folder
cd peft
pip install -e .
# note the version of packages
pip install datasets==2.21.0
pip install numpy==1.26.4
pip install scipy 
pip install scikit-learn
pip install sentencepiece

peft

在一般語言理解評估（GLUE）基准上對HuggingFace社區模型進行序列分類，涉及處理6個不同的任務，包括COLA，SST-2，MRPC，STS-B，QNLI和RTE。數據集的詳細信息可在https://huggingface.co/datasets/nyu-mll/glue上找到。

這是如何通過可樂任務開始微調羅伯塔基地的一個示例：

 cd runs
bash robert_base_cola.sh

在以下內容中描述了robert_base_cola.sh的內容：

 #! /bin/bash
cd ../
mkdir -p logs/roberta-base

# variables
CUDA_DEVICE=2

MODEL_NAME_OR_PATH= " roberta-base "

DATASET= " cola "
TASK= " cola "

BATCH_SIZE=32
MAX_LENGTH=512
NUM_EPOCH=100

HEAD_LR=4e-4
MODULE_LR=4e-4 

LORA_R=8
LORA_ALPHA=16
LORA_DROPOUT=0.0

BETA=0.0001
GEMMA=0.001

SEED=0
WEIGHT_DECAY=0.0

# run
LOG_FILE= " logs/ ${MODEL_NAME_OR_PATH} / ${MODEL_NAME_OR_PATH} _ ${TASK} _bs_ ${BATCH_SIZE} _maxlen_ ${MAX_LENGTH} _lora_r_ ${LORA_R} _lora_alpha_ ${LORA_ALPHA} _lora_dropout_ ${LORA_DROPOUT} _modulelr_ ${MODULE_LR} _headlr_ ${HEAD_LR} _beta_ ${BETA} _gemma_ ${GEMMA} _weight_decay_ ${WEIGHT_DECAY} _seed_ ${SEED} .log "
CUDA_VISIBLE_DEVICES= $CUDA_DEVICE python main.py 
    --model_name_or_path $MODEL_NAME_OR_PATH 
    --dataset $DATASET 
    --task $TASK 
    --max_length $MAX_LENGTH 
    --bs $BATCH_SIZE 
    --lora_r $LORA_R 
    --lora_alpha $LORA_ALPHA 
    --lora_dropout $LORA_DROPOUT 
    --num_epoch $NUM_EPOCH 
    --head_lr $HEAD_LR 
    --module_lr $MODULE_LR 
    --beta $BETA 
    --gemma $GEMMA 
    --weight_decay $WEIGHT_DECAY 
    --seed $SEED 2>&1 | tee $LOG_FILE

為推理加載PEFT模型：

 from peft import AutoPeftModelForCausalLM
from transformers import AutoTokenizer
import torch

model = AutoPeftModelForCausalLM . from_pretrained ( "saves/kasa/checkpoint-52580" ). to ( "cuda" )
tokenizer = AutoTokenizer . from_pretrained ( "saves/kasa/checkpoint-52580" )

model . eval ()

template = "### Context : {} n ### Completion : "
prompt = template . format ( "name : Blue Spice | Type : coffee shop | area : city centre" )
inputs = tokenizer ( prompt , return_tensors = "pt" )

outputs = model . generate ( input_ids = inputs [ "input_ids" ]. to ( "cuda" ), max_new_tokens = 50 )
print ( tokenizer . batch_decode ( outputs , skip_special_tokens = True )[ 0 ])

> "Blue Spice is a coffee shop located in the city centre."

運行日誌和結果

提示

我們所有實驗的運行日誌和結果都保存在日誌路徑中。以下是一個示例。

epoch 0: { ' matthews_correlation ' : 0.0} , current_best_corr: 0.0 train_loss: 0.5064952373504639
epoch 1: { ' matthews_correlation ' : 0.4528085001256977} , current_best_corr: 0.4528085001256977 train_loss: 0.2968645691871643
epoch 2: { ' matthews_correlation ' : 0.5314083843246411} , current_best_corr: 0.5314083843246411 train_loss: 0.3451506495475769
...
epoch 96: { ' matthews_correlation ' : 0.6331219341866674} , current_best_corr: 0.6581805893879898 train_loss: 0.057534683495759964
epoch 97: { ' matthews_correlation ' : 0.6206837048829764} , current_best_corr: 0.6581805893879898 train_loss: 0.057706814259290695
epoch 98: { ' matthews_correlation ' : 0.6281691768918801} , current_best_corr: 0.6581805893879898 train_loss: 0.05744687840342522
epoch 99: { ' matthews_correlation ' : 0.6256673855627156} , current_best_corr: 0.6581805893879898 train_loss: 0.0582236722111702

model_name_or_path: roberta-base
dataset: cola
task: cola
peft: kasa
num_epochs: 100
bs: 32
lora_r: 8
lora_alpha: 16
lora_dropout: 0.0
head_lr: 0.0004
module_lr: 0.0004
max_length: 512
weight_decay: 0.0
warmup_ratio: 0.06
seed: 0
beta: 0.0001
gemma: 0.001
...
  0% |          | 0/33 [00: 00< ? , ? it/s]
  9% | ▉         | 3/33 [00: 00< 00:01, 27.53it/s]
 21% | ██        | 7/33 [00: 00< 00:00, 30.12it/s]
 30% | ███       | 10/33 [00: 00< 00:00, 28.58it/s]
 39% | ███▉      | 13/33 [00: 00< 00:00, 27.65it/s]
 48% | ████▊     | 16/33 [00: 00< 00:00, 27.95it/s]
 58% | █████▊    | 19/33 [00: 00< 00:00, 25.45it/s]
 67% | ██████▋   | 22/33 [00: 00< 00:00, 25.99it/s]
 76% | ███████▌  | 25/33 [00: 00< 00:00, 24.67it/s]
 88% | ████████▊ | 29/33 [00: 01< 00:00, 25.53it/s]
100% | ██████████ | 33/33 [00: 01< 00:00, 27.68it/s]
100% | ██████████ | 33/33 [00: 01< 00:00, 27.01it/s]
epoch 99: { ' matthews_correlation ' : 0.6256673855627156}, current_best_corr: 0.6581805893879898 train_loss: 0.0582236722111702

有用的工具和資源

原始碼

Loralib：https：//github.com/microsoft/lora
peft：https：//github.com/huggingface/peft
對齊手冊：https：//github.com/huggingface/alignment手冊
Llama-factory：https：//github.com/hiyouga/llama-factory

基準

膠水基準：https：//huggingface.co/datasets/nyu-mll/glue
E2E基準：https：//huggingface.co/datasets/kibru/e2e
指令調整羊駝毛：https：//huggingface.co/datasets/yahma/alpaca-cleaned

評估

NLG評估：https：//github.com/microsoft/lora/blob/main/examples/nlg/eval
llm作為法官：https：//github.com/lm-sys/fastchat/tree/main/main/fastchat/fastchat/llm_judge
語言模型評估線束：https：//github.com/eleutherai/lm-evaluation-harness

展開

附加信息

版本 1.0.0
類型 Ai源碼
更新時間 2025-09-03
大小 21.6MB
來自於 Github

相關應用

ML stack

2025-07-01
awesome free chatgpt

2025-01-04
pywin_contextmenu

2025-08-31
promptl

2025-02-17
tick.chat

2025-09-16
FastLoRAChat

2025-09-03

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
ML stack

Ai源碼

1.0.0
awesome free chatgpt

Ai源碼

1.0.0
pywin_contextmenu

Ai源碼

Version update
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部