TempoSum
1.0.0
该存储库包含本文的数据,LMS可以推广到将来的数据吗?关于文本摘要的经验分析
@inproceedings{temposum23,
title="{Can LMs Generalize to Future Data? An Empirical Analysis on Text Summarization}",
author = {Chi Seng Cheang and
Hou Pong Chan and
Derek F. Wong and
Xuebo Liu and
Zhaocong Li and
Yanming Sun and
Shudong Liu and
Lidia S. Chao},
booktitle = "Proceedings of the Conference on Empirical Methods in Natural Language Processing ({EMNLP})",
month = {December},
year = "2023",
publisher = "Association for Computational Linguistics",
}
运行以下命令以从HuggingFace数据集库加载数据集。
import datasets
# BBC in-distribution test set
dataset = datasets . load_dataset ( 'chiseng-cheang/TempoSum' , 'BBC_in-distribution' )
# BBC future test set
dataset = datasets . load_dataset ( 'chiseng-cheang/TempoSum' , 'BBC_future' )
# CNN in-distribution test set
dataset = datasets . load_dataset ( 'chiseng-cheang/TempoSum' , 'CNN_in-distribution' )
# CNN future test set
dataset = datasets . load_dataset ( 'chiseng-cheang/TempoSum' , 'CNN_future' )所有数据集也可在以下网址找到:https://drive.google.com/drive/folders/1bdetfqoea8gd240h78pgxbo68e53ea9e?usp = sharing