baichuan Qlora Tuning Download - baichuan Qlora Tuning Source code download

baichuan Qlora Tuning

AI Source Code

1.0.0

Download

baichuan-Qlora-Tuning

Based on qlora, the instructions are fine-tuned to the baichuan-7B big model.

rely

 mpi4py
transformers == 4.28 . 1
peft == 0.3 . 0
icetk
deepspeed == 0.9 . 2
accelerate
cpm_kernels
sentencepiece == 0.1 . 99
peft = 0.3 . 0
torch = 2.0 . 0

The latest version is OK.

Directory structure

 --data
----msra
------train.txt
------eval.txt
--checkpoint
----baichuan
------adapter_model
--model_hub
----baichuan-7B
--baichuan_qlora.py  # 训练
--dataset.py  # 处理数据
--predict.py  # 交互式闲聊
--test_baichuan.py  # 测试baichuan模型
--train.log  # 训练日志

General steps

Data format

The data formats in train.txt and eval.txt are the same, and each behavior is a sample, which is specifically:

{ "instruct" : "你现在是一个实体识别模型，你需要提取文本里面的人名、地名、机构名，如果存在结果，返回'实体_实体类型'，不同实体间用n分隔。如果没有结果，回答'没有'。" , "query" : "文本：因有关日寇在京掠夺文物详情，藏界较为重视，也是我们收藏北京史料中的要件之一。" , "answer" : "日_地名n京_地名n北京_地名" }

Instruct is the instruction, query is the text, and answer is the result. It can be constructed by itself based on your own data.

The names of these three columns can be defined in baichuan_qlora.py.

train

Go to the hugging face to download the relevant files of baichuan-7B to model_hub/baichuan-7B. After the download is completed, you can use python test_baichuan.pt to test the model. After building the dataset, run it directly:

 python baichaun_qlora . py

predict

After the training is completed, you can use the following instructions to make predictions:

 python predict . py - - model_name "baichuan" - - base_model "./model_hub/baichuan-7B" - - tokenizer_path "./model_hub/baichuan-7B" - - lora_model "./checkpoint/baichuan/adapter_model" - - with_prompt - - interactive

Prediction results:

加载模型耗时： 0.6068947672843933分钟
loading peft model
Start inference with instruction mode .
== == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == =
+ 当前使用的模型是： baichuan
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
+ 该模式下仅支持单轮问答，无多轮对话能力。
== == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == == =
Input :你现在是一个实体识别模型，你需要提取文本里面的人名、地名、机构名，如果存在结果，返回'实体_实体类型' ，不同实体间用n分隔。如果没有结果，回答'没有' 。文本：我们是受到郑振铎先生、阿英先生著作的启示，从个人条件出发，瞄准现代出版史研究的空白，重点集藏解放区、国民党毁禁出版物。
Response :  郑振铎_人名
阿英_人名


Input :你现在是一个实体识别模型，你需要提取文本里面的人名、地名、机构名，如果存在结果，返回'实体_实体类型' ，不同实体间用n分隔。如果没有结果，回答'没有' 。文本：藏书家、作家姜德明先生在1997年出版的书话专集《文林枝叶》中以“爱书的朋友”为题，详细介绍了我们夫妇的藏品及三口之家以书为友、好乐清贫的逸闻趣事。
Response :  姜德明_人名


Input :你现在是一个实体识别模型，你需要提取文本里面的人名、地名、机构名，如果存在结果，返回'实体_实体类型' ，不同实体间用n分隔。如果没有结果，回答'没有' 。文本：去年，我们又被评为“北京市首届家庭藏书状元明星户”。
Response :  北京_地名

It has certain effects, but is not very good. It may require adjustment of parameters and training for longer time.

Predicted video memory:

Replenish

**How to train your own data? **Just build the format required for the above data, and then modify the address of the data in baichuan_qlora, and of course, you can also modify some other parameters.
How to use other models? The difference between other models lies in two aspects: 1. The input format of the data is different. 2. The model loading method is different. For chatglm, llama, belle, you can refer to what you did before: https://github.com/taishan1994/qlora-chinese-LLM

refer to

https://github.com/wp931120/baichuan_sft_lora
https://github.com/baichuan-inc/baichuan-7B
https://github.com/artidoro/qlora/
https://github.com/taishan1994/qlora-chinese-LLM

Expand

Additional Information

Version 1.0.0
Type AI Source Code
Update Time 2025-09-11
size 18.16MB
From Github

Related Applications

GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

2024-11-03
Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-03
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01
GitHub actions/download artifact

2024-11-01

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
ML stack

AI Source Code

1.0.0
awesome free chatgpt

AI Source Code

1.0.0
pywin_contextmenu

AI Source Code

Version update
Google Dorks

Other source code

1.0
shepherd

Other source code

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

Other source code

v1.1.0-rc-3

Related Information All