fine tunning Download - fine tunning Source code download

fine tunning

AI Source Code

1.0.0

Download

fine-tunning

From model deployment to model fine-tuning, this project is a training camp study combined with the training camp project, self-understanding, digestion and summary, and innovative applications. Xiaobai can star/fork

Quick introduction to large language models (theoretical learning and fine-tuning practice)

Build a development environment

Python v3.10+
Python Environment Management Miniconda
Python interactive development environment Jupyter Lab
Hugging Face Transformers
Audio processing toolkit ffmpeg

Install Python dependency package

Please use the requirements.txt file for Python dependency package installation:

pip install -r requirements.txt

About GPU drivers and CUDA versions

Generally, the GPU driver and CUDA versions are required to meet the installed versions of PyTorch and TensorFlow.

Most newly released large language models use the newer PyTorch v2.0+ version, which Pytorch officially believes is 11.8 and matched GPU driver versions. For details, please refer to the CUDA minimum version requested reply provided by Pytorch.

In short, it is recommended to install the current latest CUDA 12.3 version directly. For details, please refer to the official Nvidia installation package.

After the installation is complete, use nvidia-smi directive to view the version:

nvidia-smi

Fri Mar  1 11:16:55 2024
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 529.08       Driver Version: 529.08       CUDA Version: 12.0     |
| -------------------------------+----------------------+----------------------+
| GPU  Name            TCC/WDDM | Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap |         Memory-Usage | GPU-Util  Compute M. |
|                               |                      |               MIG M. |
| ===============================+======================+====================== |
|   0  NVIDIA GeForce ... WDDM  | 00000000:01:00.0 Off |                  N/A |
| N/A   45C    P8     6W /  30W |      0MiB /  4096MiB |      0%      Default |
|                               |                      |                  N/A |
+-------------------------------+----------------------+----------------------+

+-----------------------------------------------------------------------------+
| Processes:                                                                  |
|  GPU   GI   CI        PID   Type   Process name                  GPU Memory |
|        ID   ID                                                   Usage      |
| ============================================================================= |
+-----------------------------------------------------------------------------+

In order to use the OpenAI API, you need to get an API key from the OpenAI console. Once you have the key, you can set it as an environment variable:

For Unix-based systems such as Ubuntu or MacOS, you can run the following command in the terminal:

 export OPENAI_API_KEY= '你的-api-key '

For Windows, you can use the following command in the command prompt:

 set OPENAI_API_KEY=你的-api-key

About requirements, you can download it according to the situation

pip install -r requirements.txt

Transformers development environment construction

introduce

Development environment construction includes several parts

Miniconda
Jupyter Lab
Hugging Face Transformers, when you need to try multiple models, it is recommended to install both tensorflow and pytorch.
Other dependency packages

Miniconda

Miniconda is a Python environment management tool that can be used to create and manage multiple Python environments. It is a lightweight alternative to Anaconda and does not include any IDE tools. Miniconda can download the installation package from the official website. You can also download it from the mirror website:

Installation of Miniconda environment

 # 下载 Miniconda 安装包
$ wget https://mirrors.tuna.tsinghua.edu.cn/anaconda/miniconda/Miniconda3-latest-Linux-x86_64.sh
# 也可以使用curl命令下载
$ curl -O https://mirrors.tuna.tsinghua.edu.cn/anaconda/miniconda/Miniconda3-latest-Linux-x86_64.sh
# 安装 Miniconda
$ bash Miniconda3-latest-Linux-x86_64.sh

During the installation process, some questions need to be answered, such as the installation path, whether to add Miniconda to environment variables, etc. After the installation is completed, the terminal needs to be restarted to make the environment variable take effect.

You can use the following command to verify that Miniconda is installed successfully:

$ conda --version

Configure Miniconda

Miniconda configuration files are stored in ~/.condarc. You can modify them manually by referring to the document, or you can use the conda config command to modify them.

In order to speed up package downloads, you can configure and use domestic mirror sources:

 # 配置清华镜像
$ conda config --add channels https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/free/
$ conda config --add channels https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main/
$ conda config --add channels https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/conda-forge/
$ conda config --add channels https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/pytorch/
$ conda config --set show_channel_urls yes
# 查看~/.condarc配置
$ conda config --show-sources
channels:
  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/pytorch/
  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/cloud/conda-forge/
  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/main/
  - https://mirrors.tuna.tsinghua.edu.cn/anaconda/pkgs/free/
  - defaults
show_channel_urls: True

To accelerate the download of anaconda package, you can use mamba or micromamba instead of conda. These two tools are substitutes for conda and will cache the package version information and do not need to be checked every time the package is installed. This can effectively improve conda-forge and other larger ones. The method to install mamba or micromamba is as follows:

 # 安装mamba
$ conda install -n base -c conda-forge mamba
# 安装micromamba
$ conda install -n base -c conda-forge micromamba

Then you can use the mamba or micromamba command instead of the conda command.

Create a virtual environment

 # 创建虚拟环境，指定 Python 版本为 3.11
(base) $ conda create -n transformers python=3.11
# 激活 openai 环境
$ conda activate transformers

If there is no special description below, all of them will be carried out in the newly created openai environment here.

Jupyter Lab

Jupyter Lab is an interactive development environment that can run in a browser. It supports a variety of programming languages, including Python, R, Julia, etc. Jupyter Lab is provided by conda-forge. Please configure the image first and then install it using the following command:

(transformers) $ conda install jupyterlab

Hugging Face Transformers

Hugging Face Transformers is a natural language processing toolkit based on PyTorch and TensorFlow, which provides a large number of pre-trained models that can be used to complete a variety of NLP tasks. Hugging Face Transformers can be installed via conda:

(transformers) $ conda install -c huggingface transformers

Installation documentation: Hugging Face Transformers

Install tensorflow

Transformers need to use tensorflow for actual model reasoning. The following command installs the CPU and GPU versions of tensorflow:

(transformers) $ pip install tensorflow

If you are using a Mac, you can install Metal plug-in for the M1/M2 chip, and you can also try some smaller models:

(transformers) $ pip install tensorflow-metal

Installation documentation:

tensorflow
tensorflow-metal

Install pytorch

Transformers need to use pytorch for actual model reasoning. The pytorch and conda-forge image sources used have been configured in the previous step. You can use the following command to install the Pytorch version corresponding to the CUDA version:

 # Linux
# CUDA 11.8
(transformers) $ conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c nvidia
# CUDA 12.1
(transformers) $ conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c nvidia

# Mac
(transformers) $ conda install pytorch::pytorch torchvision torchaudio

Installation documentation: pytorch

Install other dependency packages

When processing images, audio and other data, other dependencies need to be used, including:

tqdm, eprogress progress bar
ffmpeg, ffmpeg-python audio processing tools
pillow image processing tool

(transformers) $ conda install tqdm iprogress ffmpeg ffmpeg-python pillow

Wish you progress in your studies

Expand

Additional Information

Version 1.0.0
Type AI Source Code
Update Time 2025-09-12
size 46.56MB
From Github

Related Applications

GitHub sgrebnov/cordova plugin background download

2024-11-05
Wa ch ull navra maza navsacha 2 2024 ull ovie Fr e Online On Strea ings

2024-11-03
Wa ch navra maza navsacha 2 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-03
Wa ch the greatest of all time 2024 ull ovie Online For Fr e Strea ings At Home

2024-11-02
wolfs 2024 f llmo ie f lmyz lla dow load ree 7 0p 4 0p a d 10 0p

2024-11-01
GitHub the via/releases

2024-11-01

Recommended for You

chat.petals.dev

Other source code

1.0.0
GPT Prompt Templates

Other source code

1.0.0
GPTyped

Other source code

GPTyped 1.0.5
ML stack

AI Source Code

1.0.0
awesome free chatgpt

AI Source Code

1.0.0
pywin_contextmenu

AI Source Code

Version update
Google Dorks

Other source code

1.0
shepherd

Other source code

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

Other source code

v1.1.0-rc-3

Related Information All