K BERT Download - K BERT Source Source Download

K BERT

ซอร์สโค้ดอื่น ๆ

1.0.0

ดาวน์โหลด

K-Bert

รหัส Sorce และชุดข้อมูลสำหรับ "K-Bert: การเปิดใช้งานการเป็นตัวแทนภาษาด้วยกราฟความรู้" ซึ่งดำเนินการตามกรอบการทำงานของ UER

ข่าว

EasyNLP รวม K-Bert เข้าด้วยกัน สำหรับรายละเอียดดู EasyNlp 集成 K-Bert 算法，借助知识图谱实现更优 Finetune

ความต้องการ

ซอฟต์แวร์:

 Python3
Pytorch >= 1.0
argparse == 1.1

เตรียมตัว

ดาวน์โหลด google_model.bin จากที่นี่และบันทึกลงใน models/ ไดเรกทอรี
ดาวน์โหลด CnDbpedia.spo จากที่นี่และบันทึกลงใน brain/kgs/ ไดเรกทอรี
ตัวเลือก - ดาวน์โหลดชุดข้อมูลสำหรับการประเมินผลจากที่นี่คลายซิปและวางไว้ใน datasets/ ไดเรกทอรี

ต้นไม้ไดเรกทอรีของ K-Bert:

 K-BERT
├── brain
│   ├── config.py
│   ├── __init__.py
│   ├── kgs
│   │   ├── CnDbpedia.spo
│   │   ├── HowNet.spo
│   │   └── Medical.spo
│   └── knowgraph.py
├── datasets
│   ├── book_review
│   │   ├── dev.tsv
│   │   ├── test.tsv
│   │   └── train.tsv
│   ├── chnsenticorp
│   │   ├── dev.tsv
│   │   ├── test.tsv
│   │   └── train.tsv
│    ...
│
├── models
│   ├── google_config.json
│   ├── google_model.bin
│   └── google_vocab.txt
├── outputs
├── uer
├── README.md
├── requirements.txt
├── run_kbert_cls.py
└── run_kbert_ner.py

K-Bert สำหรับการจำแนกข้อความ

ตัวอย่างการจำแนกประเภท

รันตัวอย่างในการตรวจสอบหนังสือด้วย CNDBPEDIA:

CUDA_VISIBLE_DEVICES= ' 0 ' nohup python3 -u run_kbert_cls.py 
    --pretrained_model_path ./models/google_model.bin 
    --config_path ./models/google_config.json 
    --vocab_path ./models/google_vocab.txt 
    --train_path ./datasets/book_review/train.tsv 
    --dev_path ./datasets/book_review/dev.tsv 
    --test_path ./datasets/book_review/test.tsv 
    --epochs_num 5 --batch_size 32 --kg_name CnDbpedia 
    --output_model_path ./outputs/kbert_bookreview_CnDbpedia.bin 
    > ./outputs/kbert_bookreview_CnDbpedia.log &

ผลลัพธ์:

 Best accuracy in dev : 88.80%
Best accuracy in test: 87.69%

ตัวเลือกของ run_kbert_cls.py :

 useage: [--pretrained_model_path] - Path to the pre-trained model parameters.
        [--config_path] - Path to the model configuration file.
        [--vocab_path] - Path to the vocabulary file.
        --train_path - Path to the training dataset.
        --dev_path - Path to the validating dataset.
        --test_path - Path to the testing dataset.
        [--epochs_num] - The number of training epoches.
        [--batch_size] - Batch size of the training process.
        [--kg_name] - The name of knowledge graph, "HowNet", "CnDbpedia" or "Medical".
        [--output_model_path] - Path to the output model.

มาตรฐานการจำแนกประเภท

ความแม่นยำ (dev/test %) ในชุดข้อมูลที่แตกต่างกัน:

ชุดข้อมูล	Hownet	Cndbpedia
หนังสือรีวิว	88.75/87.75	88.80/87.69
chnsenticorp	95.00/95.50	94.42/95.25
ช้อปปิ้ง	97.01/96.92	96.94/96.73
Weibo	98.22/98.33	98.29/98.33
LCQMC	88.97/87.14	88.91/87.20
xnli	77.11/77.07	76.99/77.43

K-Bert สำหรับการจดจำเอนทิตีที่มีชื่อ (NER)

ตัวอย่าง ner

เรียกใช้ตัวอย่างบนชุดข้อมูล MSRA_NER ด้วย CNDBPEDIA:

 CUDA_VISIBLE_DEVICES='0' nohup python3 -u run_kbert_ner.py 
    --pretrained_model_path ./models/google_model.bin 
    --config_path ./models/google_config.json 
    --vocab_path ./models/google_vocab.txt 
    --train_path ./datasets/msra_ner/train.tsv 
    --dev_path ./datasets/msra_ner/dev.tsv 
    --test_path ./datasets/msra_ner/test.tsv 
    --epochs_num 5 --batch_size 16 --kg_name CnDbpedia 
    --output_model_path ./outputs/kbert_msraner_CnDbpedia.bin 
    > ./outputs/kbert_msraner_CnDbpedia.log &

ผลลัพธ์:

 The best in dev : precision=0.957, recall=0.962, f1=0.960
The best in test: precision=0.953, recall=0.959, f1=0.956

ตัวเลือกของ run_kbert_ner.py :

 useage: [--pretrained_model_path] - Path to the pre-trained model parameters.
        [--config_path] - Path to the model configuration file.
        [--vocab_path] - Path to the vocabulary file.
        --train_path - Path to the training dataset.
        --dev_path - Path to the validating dataset.
        --test_path - Path to the testing dataset.
        [--epochs_num] - The number of training epoches.
        [--batch_size] - Batch size of the training process.
        [--kg_name] - The name of knowledge graph.
        [--output_model_path] - Path to the output model.

K-Bert สำหรับงานเฉพาะโดเมน

ผลการทดลองเกี่ยวกับงานเฉพาะโดเมน (ความแม่นยำ/การเรียกคืน/F1 %):

กิโลกรัม	finance_qa	law_qa	finance_ner	Medicine_ner
Hownet	0.805/0.888/0.845	0.842/0.903/0.871	0.860/0.888/0.874	0.935/0.939/0.937
CN-DBPEDIA	0.814/0.881/0.846	0.814/0.942/0.874	0.860/0.887/0.873	0.935/0.937/0.936
MedicalKG	-	-	-	0.944/0.943/0.944

การรับทราบ

งานนี้เป็นการศึกษาร่วมกันกับการสนับสนุนของ Peking University และ Tencent Inc.

หากคุณใช้รหัสนี้โปรดอ้างอิงบทความนี้:

 @inproceedings{weijie2019kbert,
  title={{K-BERT}: Enabling Language Representation with Knowledge Graph},
  author={Weijie Liu, Peng Zhou, Zhe Zhao, Zhiruo Wang, Qi Ju, Haotang Deng, Ping Wang},
  booktitle={Proceedings of AAAI 2020},
  year={2020}
}

ขยาย

ข้อมูลเพิ่มเติม

เวอร์ชัน 1.0.0
ประเภท ซอร์สโค้ดอื่น ๆ
เวลาอัปเดต 2025-04-17
ขนาด 12.08MB
มาจาก Github

แอปที่เกี่ยวข้อง

แอพ K-Friends

2024-09-03
เกมคีออสก์

2024-08-02
K-MetaSearch

2011-11-28
ค้นหาเว็บสไตล์ K K-PageSearch

2011-06-28
K-MetaSearch

2010-02-26
เค-เมตาเสิร์ชเอ็นจิ้น

2009-04-29

แนะนำสำหรับคุณ

chat.petals.dev

ซอร์สโค้ดอื่น ๆ

1.0.0
GPT Prompt Templates

ซอร์สโค้ดอื่น ๆ

1.0.0
GPTyped

ซอร์สโค้ดอื่น ๆ

GPTyped 1.0.5
Google Dorks

ซอร์สโค้ดอื่น ๆ

1.0
shepherd

ซอร์สโค้ดอื่น ๆ

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

ซอร์สโค้ดอื่น ๆ

v1.1.0-rc-3
Google Dorks

ซอร์สโค้ดอื่น ๆ

1.0
shepherd

ซอร์สโค้ดอื่น ๆ

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

ซอร์สโค้ดอื่น ๆ

v1.1.0-rc-3

ข้อมูลที่เกี่ยวข้อง ทั้งหมด