COPEN下載 - COPEN源代碼下載

COPEN

Ai源碼

1.0.0

下載

哥倫

EMNLP 2022論文的數據集和代碼“哥倫：探測預訓練的語言模型中的概念知識”。 Copen是一種概念知識門檻基準，旨在分析預訓練的語言模型（PLM）的概念理解能力。具體而言，哥倫族由三個任務組成：

概念相似性判斷（CSJ）。給定查詢實體和幾個候選實體，CSJ任務要求選擇最相似的候選實體與查詢實體。
概念性財產判斷（CPJ）。鑑於描述概念屬性的陳述，PLM需要判斷該陳述是否為真。
上下文（CIC）中的概念化。給定句子，句子中提到的實體以及實體的幾個概念鏈，PLM需要根據實體的上下文選擇最合適的概念。

對不同尺寸和類型的PLM類型的廣泛實驗表明，現有的PLM系統地缺乏概念知識，並且遭受了各種虛假相關性。我們認為，這是實現PLM中類似人類認知的關鍵瓶頸。需要更多的概念意識的目標或架構來開發概念知識淵博的PLM。

codalab

要獲得測試結果，您需要將結果提交給Codalab。

1。快速開始

代碼存儲庫基於Pytorch和Transformers 。請使用以下命令安裝所有必要的依賴性。 pip install -r requirements.txt

2。下載數據集

將副基準放置在Tsinghua Cloud上，請使用以下命令下載數據集並將其放置在預言路徑中。

 cd data/
wget --content-disposition https://cloud.tsinghua.edu.cn/f/f0b33fb429fa4575aa7f/ ? dl=1
unzip copen_data.zip
mkdir task1/data
mkdir task2/data
mkdir task3/data
mv copen_data/task1/ * task1/data
mv copen_data/task2/ * task2/data
mv copen_data/task3/ * task3/data

3。預處理數據集

探測

 cd task1
python probing_data_processor.py
cd ../
cd task2
python probing_data_processor.py
cd ../
cd task3
python probing_data_processor.py
cd ../

微調

python processor_utils.py task1 mc 
python processor_utils.py task2 sc
python processor_utils.py task3 mc

4。運行

探測

 cd code/probing
bash task1/run.sh 0 bert bert-base-uncased
bash task2/run.sh 0 bert bert-base-uncased
bash task3/run.sh 0 bert bert-base-uncased

微調

 cd code/finetuning
cd task1/ 
bash ../run.sh 0 bert bert-base-uncased task1 mc 42
cd task2/ 
bash ../run.sh 0 bert bert-base-uncased task2 sc 42
cd task3/ 
bash ../run.sh 0 bert bert-base-uncased task3 mc 42

5。引用

如果我們的代碼或基準對您有所幫助，請引用我們：

 @inproceedings{peng2022copen,
  title={COPEN: Probing Conceptual Knowledge in Pre-trained Language Models},
  author={Peng, Hao and Wang, Xiaozhi and Hu, Shengding and Jin, Hailong and Hou, Lei and Li, Juanzi and Liu, Zhiyuan and Liu, Qun},
  booktitle={Proceedings of EMNLP},
  year={2022}
}

展開

附加信息

版本 1.0.0
類型 Ai源碼
更新時間 2025-09-09
大小 10.16MB
來自於 Github

相關應用

ML stack

2025-07-01
awesome free chatgpt

2025-01-04
pywin_contextmenu

2025-08-31
promptl

2025-02-17
tick.chat

2025-09-16
FastLoRAChat

2025-09-03

爲您推薦

chat.petals.dev

其他源碼

1.0.0
GPT Prompt Templates

其他源碼

1.0.0
GPTyped

其他源碼

GPTyped 1.0.5
ML stack

Ai源碼

1.0.0
awesome free chatgpt

Ai源碼

1.0.0
pywin_contextmenu

Ai源碼

Version update
Google Dorks

其他源碼

1.0
shepherd

其他源碼

v6.1.6-react-shepherd: Prepare Release (#3063)
mongo express

其他源碼

v1.1.0-rc-3

相關資訊全部