flan alpaca lora下載-Flan flan alpaca lora源代碼下載

flan alpaca lora

Ai源碼

1.0.0

下載

??? Flan-Alpaca-Lora：從人類和機器進行低級適應的指令調整

此存儲庫通過低級適應訓練方法在羊駝數據集上訓練Google/Flan-T5 。它減少了所需的GPU內存並加快訓練的速度。

2023年6月17日：添加筆記本。您現在可以嘗試使用Flan-Alpaca-Lora。

2023年5月3日：使用羊Alpaca-GPT4數據集訓練Flan-T5-XL。

2023年4月13日：使用GPTEACHER數據集（指示和角色扮演）訓練Flan-T5-XL，表現良好。

2023年4月5日：使用8位量化列車flan-t5-xxl。該模型可以安裝到單個3090 GPU中。所有模型都可以在HuggingFace中找到。

模型	adapter_params	數據	GPU	時間
Flan-Alpaca-lora鹼	0.9m	羊駝清潔	3090	20分鐘
Flan-alpaca-lora-large	2.4m	羊駝清潔	3090	50分鐘
flan-alpaca-lora-xl	4.7m	羊駝清潔	3090	2.5小時
Flan-Alpaca-lora-XXL	94m	羊駝清潔	3090	10小時
Flan-Gpteacher-Lora-Xl	4.7m	GPTEACHER	3090	80分鐘
Flan-Alpaca-gpt4-lora-Xl	4.7m	羊駝gpt4	3090	3.25小時

依賴性

 torch == 1.13.1
transformers == 4.29.1
peft == 0.3.0
bitsandbytes==0.38.1
accelerate==0.19.0

這些軟件包的最新版本應該很好。

訓練

以下命令Finetune Flan-T5鹼基僅在單個3090 GPU上使用20分鐘

python train.py 
    --model_name_or_path google/flan-t5-base 
    --data_path ./alpaca_data_cleaned.json 
    --bf16 True 
    --output_dir ./ckpts/ 
    --num_train_epochs 3 
    --per_device_train_batch_size 8 
    --gradient_accumulation_steps 8 
    --evaluation_strategy " no " 
    --save_strategy " no " 
    --learning_rate 5e-4 
    --weight_decay 0. 
    --warmup_ratio 0.03 
    --lr_scheduler_type " cosine " 
    --logging_steps 50 
    --tf32 True

示例用法：

 import transformers
from peft import PeftModel

# Where peft_model_id should be the saving directory or huggingface model id
model_name = "google/flan-t5-large" ; peft_model_id = "reasonwang/flan-alpaca-lora-large"
tokenizer = transformers . AutoTokenizer . from_pretrained ( model_name )
base_model = transformers . AutoModelForSeq2SeqLM . from_pretrained ( model_name )
peft_model = PeftModel . from_pretrained ( base_model , peft_model_id )

# Input an instruction or any other questions.
inputs = tokenizer ( "List a few tips to get good scores in math." , return_tensors = "pt" )
outputs = peft_model . generate ( ** inputs , max_length = 128 , do_sample = True )
print ( tokenizer . batch_decode ( outputs , skip_special_tokens = True ))

展開

附加信息

版本 1.0.0
類型 Ai源碼
更新時間 2025-09-03
大小 13.55MB
來自於 Github

相關應用

GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

2024-11-03
Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-03
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01
羊駝球：全明星

2022-08-08

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
ML stack

Ai源碼

1.0.0
awesome free chatgpt

Ai源碼

1.0.0
pywin_contextmenu

Ai源碼

Version update
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部