sentiment_analysis_fine_grain下載sentiment_analysis_fine

sentiment_analysis_fine_grain

Ai源碼

1.0.0

下載

介紹

使用此存儲庫，您將能夠使用Bert訓練多標籤分類

部署BERT進行在線預測。

您還可以找到有關如何與中文使用Bert的簡短教程：Bert Short Chinese教程

您可以從AI Challenger中找到對穀物情緒的介紹

基本思想

在這裡添加一些東西。

實驗新型號

有關更多信息，請檢查模型/bert_cnn_fine_grain_model.py

表現

模型	textcnn（無預告片）	textcnn（預處理）	bert（base_model_zh）	bert（base_model_zh，在語料庫上的預訓練）
F1得分	0.678	0.685	在這裡添加一個號碼	在這裡添加一個號碼

注意：F1分數在驗證集上報告

用法

BERT用於多標籤Classificaiton [用於微調和預訓練的數據]

 export BERT_BASE_DIR=BERT_BASE_DIR/chinese_L-12_H-768_A-12
export TEXT_DIR=TEXT_DIR
nohup python run_classifier_multi_labels_bert.py   
  --task_name=sentiment_analysis   
  --do_train=true   
  --do_eval=true  
  --data_dir=$TEXT_DIR   
  --vocab_file=$BERT_BASE_DIR/vocab.txt   
  --bert_config_file=$BERT_BASE_DIR/bert_config.json  
  --init_checkpoint=$BERT_BASE_DIR/bert_model.ckpt   
  --max_seq_length=512   
  --train_batch_size=4   
  --learning_rate=2e-5   
  --num_train_epochs=3   
  --output_dir=./checkpoint_bert &

1.首先，您需要從Google下載預訓練的模型，然後放入文件夾（egbert_base_dir）

 chinese_L-12_H-768_A-12 from <a href='https://storage.googleapis.com/bert_models/2018_11_03/chinese_L-12_H-768_A-12.zip'>bert</a>

2.第二，您需要擁有培訓數據（例如Train.TSV）和驗證數據（例如Dev.TSV），然後將其放在

 folder(e.g.TEXT_DIR ). you can also download data from here <a href='https://pan.baidu.com/s/1ZS4dAdOIAe3DaHiwCDrLKw'>data to train bert for AI challenger-Sentiment Analysis</a>.
  
 it contains processed data you can run for both fine-tuning on sentiment analysis and pre-train with Bert. 
  
 it is generated by following this notebook step by step:
  
 preprocess_char.ipynb 
  
 you can generate data by yourself as long as data format is compatible with 
  
 processor SentimentAnalysisFineGrainProcessor(alias as sentiment_analysis); 


 data format:  label1,label2,label3t here is sentence or sentencest
 
 it only contains two columns, the first one is target(one or multi-labels), the second one is input strings.
  
 no need to tokenized.
 
 sample:"0_1,1_-2,2_-2,3_-2,4_1,5_-2,6_-2,7_-2,8_1,9_1,10_-2,11_-2,12_-2,13_-2,14_-2,15_1,16_-2,17_-2,18_0,19_-2 浦东五莲路站，老饭店福瑞轩属于上海的本帮菜，交通方便，最近又重新装修，来拨草了，饭店活动满188元送50元钱，环境干净，简单。朋友提前一天来预订包房也没有订到，只有大堂，五点半到店基本上每个台子都客满了，都是附近居民，每道冷菜量都比以前小，味道还可以，热菜烤茄子，炒河虾仁，脆皮鸭，照牌鸡，小牛排，手撕腊味花菜等每道菜都很入味好吃，会员价划算，服务员人手太少，服务态度好，要能团购更好。可以用支付宝方便"
 
 check sample data in ./BERT_BASE_DIR folder 

 for more detail, check create_model and SentimentAnalysisFineGrainProcessor from run_classifier.py

基於開源模型的預訓練BERT模型，然後執行分類任務

生成原始數據：[在這裡添加一些東西]
確保每行都是句子。在每個文檔之間都有一個空白行。
您可以從zip文件中找到生成的數據。
```
 use write_pre_train_doc() from preprocess_char.ipynb 
```

使用以下方式生成用於培訓階段的數據：

 export BERT_BASE_DIR=./BERT_BASE_DIR/chinese_L-12_H-768_A-12
nohup python create_pretraining_data.py 
--input_file=./PRE_TRAIN_DIR/bert_*_pretrain.txt 
--output_file=./PRE_TRAIN_DIR/tf_examples.tfrecord 
--vocab_file=$BERT_BASE_DIR/vocab.txt 
--do_lower_case=True 
--max_seq_length=512 
--max_predictions_per_seq=60 
--masked_lm_prob=0.15 
--random_seed=12345 
--dupe_factor=5 nohup_pre.out &

帶有生成數據的預訓練模型：
python run_pretraining.py
微調
python run_classifier.py

textcnn

下載情感分析的緩存文件（令牌處於單詞級別）
訓練模型：
python train_cnn_fine_grain.py

 cache file of TextCNN model was generate by following steps from preprocess_word.ipynb. 
 
 it contains everything you need to run TextCNN.
 
 it include: processed train/validation/test set; vocabulary of word; a dict map label to index. 
 
 take train_valid_test_vocab_cache.pik and put it under folder of preprocess_word/
 
 raw data are also included in this zip file.

預訓練文本

帶有蒙版語言模型的預訓練文本
python train_cnn_lm.py
textcnn的微調
python train_cnn_fine_grain.py

部署BERT進行在線預測

 with session and feed style you can easily deploy BERT.

與伯特的在線預測，從這裡檢查更多

參考

雙向編碼器表示來自變形金剛的語言理解
Google-Research/Bert
Pengshuang/ai-comp
AI挑戰者2018
句子分類的捲積神經網絡

展開

附加信息

版本 1.0.0
類型 Ai源碼
更新時間 2025-09-06
大小 3.31MB
來自於 Github

相關應用

OpenCore_NO_ACPI_Build

2024-11-13
nspanel_pro_tools_apk

2024-11-12
YuQue_Book_Download

2024-11-12
zkwork_aleo_gpu_worker

2024-11-11
nextcloud_share_url_downloader

2024-11-01
麗華資料分析引擎免費版3.0_搜尋_導航_採集_輿情_排行_api

2022-06-28

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
ML stack

Ai源碼

1.0.0
awesome free chatgpt

Ai源碼

1.0.0
pywin_contextmenu

Ai源碼

Version update
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部