Llama 2_Huggingface_4Bit_QLoRA下載Llama 2_Huggingface_4Bit

Llama 2_Huggingface_4Bit_QLoRA

Ai源碼

1.0.0

下載

更新註釋

更新版本可以在新的回購中找到

https://github.com/gmongaras/wizard_qlora_finetuning

llama-2_huggingface_4bit_qlora

使用HuggingFace的4位Qlora Falcon/Llama2型號的工作示例

要開始登錄，編輯和運行main.py

填充完成後，您應該在./outputs中具有檢查點。在運行推理之前，我們可以將洛拉的權重與原始權重結合，以更快地推斷和推理期間的GPU要求較小。為此，請使用您的路徑運行merge_weights.py腳本。

最後，您可以在合併的模型給定模型下運行生成generate.py 。

要求

運行腳本的Python要求位於sumpliont.txt中

您還應該在此處下載7b型號的獵鷹權重https://huggingface.co/tiiuae/falcon-7b ，然後將文件放入目錄中./tiiuae/falcon-7b或https://huggingface.co/meta-llama/Llama-2-7b-hf此處下載llama-2權重./llama-2

多個GPU

該腳本不支持4位登錄上的多GPU。如果我找到了這樣做的方法，我將更新腳本。

GPU要求

基本模型大約需要6 GB的內存。
填充取決於適配器大小，批處理大小，最大長度等。在當前配置中，內存使用率約為8GB。

問題

如果訓練時出現形狀錯誤，則BitsandBytes和/或PEFT存在問題。解決這個問題的最好方法是完全卸載它們並將其從來源重新安裝：

 python -m pip uninstall bitsandbytes transformers accelerate peft -y
python -m pip install git+https://github.com/huggingface/transformers.git git+https://github.com/huggingface/peft.git git+https://github.com/huggingface/accelerate.git git+https://github.com/timdettmers/bitsandbytes.git -U

如果您遇到了錯誤CUDA Setup failed despite GPU being available. Please run the following command to get more information ，然後您需要從源來構建bitsandbytes，然後按照https://github.com/oobabooga/text-generation-webui/issues/147 ，將其放入零件和字節site-package中。

展開

附加信息

版本 1.0.0
類型 Ai源碼
更新時間 2025-09-07
大小 7.02KB
來自於 Github

相關應用

OpenCore_NO_ACPI_Build

2024-11-13
nspanel_pro_tools_apk

2024-11-12
YuQue_Book_Download

2024-11-12
zkwork_aleo_gpu_worker

2024-11-11
nextcloud_share_url_downloader

2024-11-01
麗華資料分析引擎免費版3.0_搜尋_導航_採集_輿情_排行_api

2022-06-28

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
ML stack

Ai源碼

1.0.0
awesome free chatgpt

Ai源碼

1.0.0
pywin_contextmenu

Ai源碼

Version update
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部