Summarization Papers
1.0.0
نظمتها Xiachong Feng.
Yichong Huang ، Haozheng Yang ، Jiaan Wang
مسار التعلم تلخيص (مع رابط) 

ICLR 2023 [PDF] [رمز]COLING 2022 [PDF]TACL 2022 [PDF]ACM Computing Surveys [PDF]IJCAI 2022, Survey Track [PDF]IJCAI21 [PDF]ICICT21 [PDF]Journal of King Saud University - Computer and Information Sciences [PDF]IJCAI20 [PDF]EMNLP 2022 Demo [PDF] [Demo]EMNLP 2021 [pdf] [demo]EMNLP 2021 Demo Track [pdf] [Demo]EMNLP 2022NAACL19 [pdf] [code]EMNLP 2022 [pdf] [code]Findings of ACL 2021 [pdf] [code]ACL2021 [pdf] [code]Findings of ACL 2021 [pdf] [code]EMNLP20 [pdf]COLING20 Short [pdf] [code]COLING20 [pdf]Findings of EMNLP [pdf]EMNLP20 Short [pdf] [code]EMNLP20 [pdf] [code]EMNLP20 [pdf] [code]EMNLP20 [pdf]Findings of EMNLP20 [pdf]ACL20 [pdf] [code]EMNLP19 [pdf]EMNLP19 [pdf] [code]EMNLP19 Workshop [pdf]EMNLP19 Short [pdf]ACL19 [pdf] [code]EMNLP18 [pdf] [code] NAACL21 [pdf] [code]Findings of EMNLP20 [pdf] [code]ACL19 [pdf]EMNLP19 [pdf] [code] | بطاقة تعريف | اسم | وصف | ورق | مؤتمر |
|---|---|---|---|---|
| 1 | CNN-DailyMail | أخبار | Abstractive Text Summarization using Sequence-to-sequence RNNs and Beyond | SIGNLL16 |
| 2 | New York Times | أخبار | The New York Times Annotated Corpus | |
| 3 | DUC | أخبار | The Effects Of Human Variation In DUC Summarization Evaluation | |
| 4 | Gigaword | أخبار | A Neural Attention Model For Abstractive Sentence Summarization | EMNLP15 |
| 5 | غرفة الأخبار | أخبار | Newsroom: A Dataset of 1.3 Million Summaries with Diverse Extractive Strategies | NAACL18 |
| 6 | Xsum | أخبار | Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization | EMNLP18 |
| 7 | Multi-News | Multi-document News | Multi-News: a Large-Scale Multi-Document Summarization Dataset and Abstractive Hierarchical Model | ACL19 |
| 8 | SAMSum | Multi-party conversation | SAMSum Corpus: A Human-annotated Dialogue Dataset for Abstractive Summarization | EMNLP19 |
| 9 | AMI | مقابلة | The AMI Meeting Corpus: A pre-announcement. | |
| 10 | ICSI | مقابلة | The ICSI Meeting Corpus | |
| 11 | MSMO | Multi-modal | MSMO: Multimodal Summarization with Multimodal Output | EMNLP18 |
| 12 | How2 | Multi-modal | How2: A Large-scale Dataset for Multimodal Language Understanding | NIPS18 |
| 13 | ScisummNet | Scientific paper | ScisummNet: A Large Annotated Corpus and Content-Impact Models for Scientific Paper Summarization with Citation Networks | AAAI19 |
| 14 | PubMed, ArXiv | Scientific paper | A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents | NAACL18 |
| 15 | TALKSUMM | Scientific paper | TALKSUMM: A Dataset and Scalable Annotation Method for Scientific Paper Summarization Based on Conference Talks | ACL19 |
| 16 | BillSum | قانوني | BillSum: A Corpus for Automatic Summarization of US Legislation | EMNLP19 |
| 17 | LCSTS | Chinese Weibo | LCSTS: A Large Scale Chinese Short Text Summarization Dataset | EMNLP15 |
| 18 | WikiHow | Online Knowledge Base | WikiHow: A Large Scale Text Summarization Dataset | |
| 19 | Concept-map-based MDS Corpus | Educational Multi-document | Bringing Structure into Summaries : Crowdsourcing a Benchmark Corpus of Concept Maps | EMNLP17 |
| 20 | WikiSum | Wikipedia Multi-document | Generating Wikipedia By Summarizing Long Sequence | ICLR18 |
| واحد وعشرون | GameWikiSum | Game Multi-document | GameWikiSum : a Novel Large Multi-Document Summarization Dataset | LREC20 |
| إثنان وعشرون | En2Zh CLS, Zh2En CLS | Cross-Lingual | NCLS: Neural Cross-Lingual Summarization | EMNLP19 |
| ثلاثة وعشرين | Timeline Summarization Dataset | Baidu timeline | Learning towards Abstractive Timeline Summarization | IJCAI19 |
| 24 | Reddit TIFU | online discussion | Abstractive Summarization of Reddit Posts with Multi-level Memory Networks | NAACL19 |
| 25 | TripAtt | مراجعة | Attribute-aware Sequence Network for Review Summarization | EMNLP19 |
| 26 | Reader Comments Summarization Corpus | Comments-based Weibo | Abstractive Text Summarization by Incorporating Reader Comments | AAAI19 |
| 27 | BIGPATENT | براءة اختراع | BIGPATENT: A Large-Scale Dataset for Abstractive and Coherent Summarization | ACL19 |
| 28 | Curation Corpus | أخبار | Curation Corpus for Abstractive Text Summarisation | |
| 29 | MATINF | Multi-task | MATINF: A Jointly Labeled Large-Scale Dataset for Classification, Question Answering and Summarization | ACL20 |
| 30 | MLSUM | Multi-Lingual Summarization Dataset | MLSUM: The Multilingual Summarization Corpus | EMNLP20 |
| 31 | Dialogue(Debate) | Argumentative Dialogue Summary Corpus | Using Summarization to Discover Argument Facets in Online Idealogical Dialog | NAACL15 |
| 32 | WCEP | News Multi-document | A Large-Scale Multi-Document Summarization Dataset from the Wikipedia Current Events Portal | ACL20 Short |
| 33 | ArgKP | Argument-to-key Point Mapping | From Arguments to Key Points: Towards Automatic Argument Summarization | ACL20 |
| 34 | CRD3 | حوار | Storytelling with Dialogue: A Critical Role Dungeons and Dragons Dataset | 2020 |
| 35 | Gazeta | Russian news | Dataset for Automatic Summarization of Russian News | |
| 36 | عقل | English news recommendation, Summarization, Classification, Entity | MIND: A Large-scale Dataset for News Recommendation | ACL20 |
| 37 | public_meetings | french meeting(test set) | Align then Summarize: Automatic Alignment Methods for Summarization Corpus Creation | LREC |
| 38 | Enron | بريد إلكتروني | Building a Dataset for Summarization and Keyword Extraction from Emails | 2014 |
| 39 | Columbia | بريد إلكتروني | Summarizing Email Threads | 2004 |
| 40 | BC3 | بريد إلكتروني | A publicly available annotated corpus for supervised email summarization | |
| 41 | WikiLingua | Cross-Lingual | WikiLingua- A New Benchmark Dataset for Cross-Lingual Abstractive Summarization | Findings of EMNLP20 |
| 42 | LcsPIRT | Chinese Dialogue | Global Encoding for Long Chinese Text Summarization | TALLIP |
| 43 | CLTS,CLTS-plus | Chinese News | CLTS: A New Chinese Long Text Summarization Dataset CLTS+: A New Chinese Long Text Summarization Dataset with Abstractive Summaries | NLPCC20 |
| 44 | VMSMO | Multi-modal | VMSMO: Learning to Generate Multimodal Summary for Video-based News Articles | EMNLP20 |
| 45 | Multi-XScience | Multi-document | Multi-XScience: A Large-scale Dataset for Extreme Multi-document Summarization of Scientific Articles | EMNLP20 short |
| 46 | SCITLDR | Scientific Document | TLDR: Extreme Summarization of Scientific Documents | Findings of EMNLP20 |
| 47 | scisumm-corpus | Scientific Document | ||
| 48 | QBSUM | Query-Based Chinese | QBSUM: a Large-Scale Query-Based Document Summarization Dataset from Real-world Applications | Computer Speech & Language |
| 49 | qMDS | Query-Based Multi-Document | AQuaMuSe: Automatically Generating Datasets for Query-Based Multi-Document Summarization | |
| 50 | Liputan6 | Indonesian | Liputan6: A Large-scale Indonesian Dataset for Text Summarization | AACL20 |
| 51 | SportsSum | Sports Game | Generating Sports News from Live Commentary: A Chinese Dataset for Sports Game Summarization | AACL20 |
| 52 | WikiAsp | Aspect-based | WikiAsp: A Dataset for Multi-domain Aspect-based Summarization | Transaction of the ACL |
| 53 | DebateSum | دعوى | DebateSum:A large-scale argument mining and summarization dataset | ARGMIN 2020 |
| 54 | Open4Business | عمل | Open4Business (O4B): An Open Access Dataset for Summarizing Business Documents | Workshop on Dataset Curation and Security-NeurIPS 2020 |
| 55 | OrangeSum | فرنسي | BARThez: a Skilled Pretrained French Sequence-to-Sequence Model | |
| 56 | Medical Conversation | medical conversation | Summarizing Medical Conversations via Identifying Important Utterances | COLING20 |
| 57 | SumTitles | movie dialogue | SumTitles: a Summarization Dataset with Low Extractiveness | COLING20 |
| 58 | BANS | bengali news | Bengali Abstractive News Summarization (BANS): A Neural Attention Approach | TCCE-2020 |
| 59 | e-commerce | E-commerce | On the Faithfulness for E-commerce Product Summarization | COLING20 |
| 60 | TWEETSUM | تغريد | TWEETSUM: Event-oriented Social Summarization Dataset | COLING20 |
| 61 | فضاء | رأي | Extractive Opinion Summarization in Quantized Transformer Spaces | TACL |
| 62 | pn-summary | Persian | Leveraging ParsBERT and Pretrained mT5 for Persian Abstractive Text Summarization | csicc2021 |
| 63 | E-commerce1 desensitized | حوار | Topic-Oriented Spoken Dialogue Summarization for Customer Service with Saliency-Aware Topic Modeling | AAAI21 |
| 64 | E-commerce2 desensitized | حوار | Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-Encoders | AAAI21 |
| 65 | BengaliSummarization | Bengali | Unsupervised Abstractive Summarization of Bengali Text Documents | EACL21 |
| 66 | MediaSum | حوار | MediaSum: A Large-scale Media Interview Dataset for Dialogue Summarization | NAACL21 |
| 67 | Healthline and BreastCancer | multi-document | Nutri-bullets: Summarizing Health Studies by Composing Segments | AAAI21 |
| 68 | GOVREPORT | Long Government reports | Efficient Attentions for Long Document Summarization | NAACL21 |
| 69 | SSN | Scientific Paper | Enhancing Scientific Papers Summarization with Citation Graph | AAAI21 |
| 70 | MTSamples | طبي | Towards objectively evaluating the quality of generated medical summaries | |
| 71 | QMSum | Meeting, Query | QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization | NAACL21 |
| 72 | MS2 | Medical, Multi-Document | MS2: Multi-Document Summarization of Medical Studies | |
| 73 | SummScreen | Television Series | SummScreen: A Dataset for Abstractive Screenplay Summarization | ACL 2022 |
| 74 | SciDuet | Scientific Papers and Slides | D2S: Document-to-Slide Generation Via Query-Based Text Summarization | NAACL21 |
| 75 | MultiHumES | Multilingual | MultiHumES: Multilingual Humanitarian Dataset for Extractive Summarization | EACL21 |
| 76 | DialSumm | حوار | DialSumm: A Real-Life Scenario Dialogue Summarization Dataset | Findings of ACL21 |
| 77 | BookSum | Book, Long-form | BookSum: A Collection of Datasets for Long-form Narrative Summarization | |
| 78 | CLES | Chinese Weibo | A Large-Scale Chinese Long-Text Extractive Summarization Corpus | ICASSP |
| 79 | FacetSum | Scientific Paper | Bringing Structure into Summaries: a Faceted Summarization Dataset for Long Scientific Documents | ACL2021 short |
| 80 | ConvoSumm | حوار | ConvoSumm: Conversation Summarization Benchmark and Improved Abstractive Summarization with Argument Mining | ACL2021 |
| 81 | AgreeSum | Multi-document with entailment annotations | AgreeSum: Agreement-Oriented Multi-Document Summarization | Findings of ACL2021 |
| 82 | En2De | Cross-Lingual En2De | Cross-Lingual Abstractive Summarization with Limited Parallel Resources | ACL 2021 |
| 83 | VT-SSum | Spoken | VT-SSum: A Benchmark Dataset for Video Transcript Segmentation and Summarization | |
| 84 | AESLC | بريد إلكتروني | This Email Could Save Your Life: Introducing the Task of Email Subject Line Generation | ACL 2019 |
| 85 | XL-Sum | Cross-lingual | XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages | Findings of ACL2021 |
| 86 | TES 2012-2016 | سقسقة | TSSuBERT: Tweet Stream Summarization Using BERT | |
| 87 | PENS | Personalized Headline | PENS: A Dataset and Generic Framework for Personalized News Headline Generation | ACL 2021 |
| 88 | XSum Hallucination Annotations | Factuality | On Faithfulness and Factuality in Abstractive Summarization | ACL 2020 |
| 89 | factuality-datasets | Factuality | Annotating and Modeling Fine-grained Factuality in Summarization | NAACL 2021 |
| 90 | صريح | Factuality | Understanding Factuality in Abstractive Summarization with FRANK: A Benchmark for Factuality Metrics | NAACL 2021 |
| 91 | TRIPOD | فيلم | Movie Summarization via Sparse Graph Construction | AAAI 2021 |
| 92 | AdaptSum | Low-Resource | AdaptSum: Towards Low-Resource Domain Adaptation for Abstractive Summarization | NAACL 2021 |
| 93 | PTS | منتج | Multi-Source Pointer Network for Product Title Summarization | CIKM 2018 |
| 94 | RAMDS | Reader-Aware | Reader-Aware Multi-Document Summarization: An Enhanced Model and The First Dataset | EMNLP 2017 Workshop |
| 95 | court judgment | court judgment | How to Write Summaries with Patterns? Learning towards Abstractive Summarization through Prototype Editing | EMNLP 2019 |
| 96 | ADEGBTS | gaze behaviors | A Dataset for Exploring Gaze Behaviors in Text Summarization | ACM MMSys'20 |
| 97 | MeQSum | طبي | On the Summarization of Consumer Health Questions | ACL 2019 |
| 98 | OpoSum | رأي | Summarizing Opinions: Aspect Extraction Meets Sentiment Prediction and They Are Both Weakly Supervised | EMNLP 2018 |
| 99 | MM-AVS | Multi-modal | Multi-modal Summarization for Video-containing Documents | NAACL 2021 |
| 100 | WikiCatSum | multi-doc | Generating Summaries with Topic Templates and Structured Convolutional Decoders | ACL 2019 |
| 101 | SDF-TLS | الجدول الزمني | Summarize Dates First: A Paradigm Shift in Timeline Summarization | SIGIR 2021 |
| 102 | RWS-Cit | *Automatic generation of related work through summarizing citations | 2017 | |
| 103 | MTLS | الجدول الزمني | Multi-TimeLine Summarization (MTLS): Improving Timeline Summarization by Generating Multiple Summaries | ACL 2021 |
| 104 | EMAILSUM | بريد إلكتروني | EmailSum: Abstractive Email Thread Summarization | ACL 2021 |
| 105 | WikiSum | WikiHow | WikiSum: Coherent Summarization Dataset for Efficient Human-Evaluation | ACL 2021 Short |
| 106 | SumPubMed | PubMed Scientific Article | SumPubMed: Summarization Dataset of PubMed Scientific Articles | ACL 2021 Student Research Workshop |
| 107 | MLGSum | Multi-lingual | Contrastive Aligned Joint Learning for Multilingual Summarization | ACL 2021 Findings |
| 108 | SMARTPHONE,COMPUTER | منتج | CUSTOM: Aspect-Oriented Product Summarization for E-Commerce | |
| 109 | CSDS | Customer Service Dialogue | CSDS: A Fine-grained Chinese Dataset for Customer Service Dialogue Summarization | EMNLP 2021 |
| 110 | persian-dataset | persian | ARMAN: Pre-training with Semantically Selecting and Reordering of Sentences for Persian Abstractive Summarization | |
| 111 | StreamHover | spoken livestream | StreamHover: Livestream Transcript Summarization and Annotation | EMNLP 2021 |
| 112 | CNewSum | أخبار | CNewSum: A Large-scale Chinese News Summarization Dataset with Human-annotated Adequacy and Deducibility Level | NLPCC 2021 |
| 113 | MiRANews | news, factual | MiRANews: Dataset and Benchmarks for Multi-Resource-Assisted News Summarization | EMNLP 2021 Findings |
| 114 | HowSumm | query multi-doc | HowSumm: A Multi-Document Summarization Dataset Derived from WikiHow Articles | |
| 115 | SportsSum2.0 | الرياضة | SportsSum2.0: Generating High-Quality Sports News from Live Text Commentary | |
| 116 | CoCoSum | opinion multi-ref | Comparative Opinion Summarization via Collaborative Decoding | |
| 117 | MReD | Controllable | MReD: A Meta-Review Dataset for Controllable Text Generation | |
| 118 | MSˆ2 | Multi-Document, Medical | MSˆ2: Multi-Document Summarization of Medical Studies | EMNLP 2021 |
| 119 | MassiveSumm | MassiveSumm: a very large-scale, very multilingual, news summarisation dataset | EMNLP 2021 | |
| 120 | XWikis | multilingual | Models and Datasets for Cross-Lingual Summarisation | EMNLP 2021 |
| 121 | SUBSUME | Intent, subjective | SUBSUME: A Dataset for Subjective Summary Extraction from Wikipedia Documents | EMNLP 2021 newsum |
| 122 | TLDR9+ | TLDR9+: A Large Scale Resource for Extreme Summarization of Social Media Posts | EMNLP 2021 newsum | |
| 123 | 20 Minuten | الألمانية | A New Dataset and Efficient Baselines for Document-level Text Simplification in German | EMNLP 2021 newsum |
| 124 | WSD | multi-lingual | A Novel Wikipedia based Dataset for Monolingual and Cross-Lingual Summarization | EMNLP 2021 newsum |
| 125 | TEDSummary | خطاب | Attention-based Multi-hypothesis Fusion for Speech Summarization | |
| 126 | SummaC Benchmark | Factual, NLI | SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization | |
| 127 | ForumSum | محادثة | ForumSum: A Multi-Speaker Conversation Summarization Dataset | EMNLP 2021 Findings |
| 128 | K-SportsSum | الرياضة | Knowledge Enhanced Sports Game Summarization | WSDM 2022 |
| 129 | Test-Amazon | Opinion, New test for Amazon reviews | Unsupervised Opinion Summarization as Copycat-Review Generation | ACL 2020 |
| 130 | Test-Amazon-Yelp | Opinion, New test for Amazon(180) and Yelp(300) | Few-Shot Learning for Opinion Summarization | EMNLP 2020 |
| 131 | AmaSum | رأي | Learning Opinion Summarizers by Selecting Informative Reviews | EMNLP 2021 |
| 132 | CrossSum | Cross lingual | CrossSum: Beyond English-Centric Cross-Lingual Abstractive Text Summarization for 1500+ Language Pairs | |
| 133 | HCSCL-MSDataset | Multi-modal | Hierarchical Cross-Modality Semantic Correlation Learning Model for Multimodal Summarization | AAAI 2022 |
| 134 | Klexikon | الألمانية | Klexikon: A German Dataset for Joint Summarization and Simplification | |
| 135 | TODSum | Customer Service | TODSum: Task-Oriented Dialogue Summarization with State Tracking | |
| 136 | TWEETSUMM | Customer Service | TWEETSUMM - A Dialog Summarization Dataset for Customer Service | Findings of EMNLP 2021 |
| 137 | PeerSum | Multi-document, Scientific | PeerSum: A Peer Review Dataset for Abstractive Multi-document Summarization | |
| 138 | Celebrity TS, Event TS, Wiki TS | Timeline, person, event | Follow the Timeline! Generating Abstractive and Extractive Timeline Summary in Chronological Order | TOSI 2022 |
| 139 | Chart-to-Text | جدول | Chart-to-Text: A Large-Scale Benchmark for Chart Summarization | |
| 140 | GovReport-QS | Long Document | HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document Summarization | ACL 2022 |
| 141 | EntSUM | كيان | EntSUM: A Data Set for Entity-Centric Summarization | ACL 2022 |
| 142 | ALLSIDES | Framing Bias | NeuS: Neutral Multi-News Summarization for Mitigating Framing Bias | ACL 2022 |
| 143 | GRAPHELSUMS | graph | Summarization with Graphical Elements | |
| 144 | Annotated-Wikilarge-Newsela | Factuality | Evaluating Factuality in Text Simplification | ACL 2022 |
| 145 | WikiMulti | Cross-lingual | WikiMulti: a Corpus for Cross-Lingual Summarization | |
| 146 | Welsh | Introducing the Welsh Text Summarisation Dataset and Baseline Systems | ||
| 147 | SuMe | Biomedical | SuMe: A Dataset Towards Summarizing Biomedical Mechanisms | LREC 2022 |
| 148 | CiteSum | CiteSum: Citation Text-guided Scientific Extreme Summarization and Low-resource Domain Adaptation | ||
| 148 | MSAMSum | حوار | MSAMSum: Towards Benchmarking Multi-lingual Dialogue Summarization | ACL 2022 DialDoc |
| 149 | SQuALITY | Long-Document | SQuALITY: Building a Long-Document Summarization Dataset the Hard Way | EMNLP 2022 |
| 150 | X-SCITLDR | X-SCITLDR: Cross-Lingual Extreme Summarization of Scholarly Documents | JCDL 2022 | |
| 151 | NEWTS | أخبار | NEWTS: A Corpus for News Topic-Focused Summarization | |
| 152 | EntSUM | كيان | EntSUM: A Data Set for Entity-Centric Extractive Summarization | ACL 2022 |
| 153 | ASPECTNEWS | ASPECTNEWS: Aspect-Oriented Summarization of News Documents | ACL 2022 | |
| 154 | RNSum | Commit Logs | RNSum: A Large-Scale Dataset for Automatic Release Note Generation via Commit Logs Summarization | ACL 2022 |
| 155 | AnswerSumm | query multi-doc | AnswerSumm: A Manually-Curated Dataset and Pipeline for Answer Summarization | NAACL 2022 |
| 156 | CHQ-Summ | CHQ-Summ: A Dataset for Consumer Healthcare Question Summarization | ||
| 157 | Multi-LexSum | multi-doc | Real-World Summaries of Civil Rights Lawsuits at Multiple Granularities | |
| 158 | DACSA | Catalan and Spanish | DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles | NAACL 2022 |
| 159 | BigSurvey | Academic Multi-doc | Generating a Structured Summary of Numerous Academic Papers: Dataset and Method | IJCAI 2022 |
| 160 | CSL | Chinese, Academic | CSL: A Large-scale Chinese Scientific Literature Dataset | COLING 2022 |
| 161 | PCC Summaries | الألمانية | Extractive Summarisation for German-language Data: A Text-level Approach with Discourse Features | COLING 2022 |
| 162 | LipKey | abstractive summaries, absent keyphrases, and titles | LipKey: A Large-Scale News Dataset for Absent Keyphrases Generation and Abstractive Summarization | COLING 2022 |
| 163 | PLOS | Lay summary of biomedical journal articles | Making Science Simple: Corpora for the Lay Summarisation of Scientific Literature | EMNLP 2022 |
| 164 | eLife | Lay summary of biomedical journal articles | Making Science Simple: Corpora for the Lay Summarisation of Scientific Literature | EMNLP 2022 |
| 165 | ECTSum | Long Earnings Call Transcripts | ECTSum: A New Benchmark Dataset For Bullet Point Summarization of Long Earnings Call Transcripts | EMNLP 2022 |
| 166 | EUR-Lex-Sum | Multi- and Cross-lingual Legal | EUR-Lex-Sum: A Multi- and Cross-lingual Dataset for Long-form Summarization in the Legal Domain | EMNLP 2022 |
| 167 | CrisisLTLSum | الجدول الزمني | CrisisLTLSum: A Benchmark for Local Crisis Event Timeline Extraction and Summarization | |
| 168 | LANS( upon request ) | عربي | LANS: Large-scale Arabic News Summarization Corpus | |
| 169 | MACSUM | Controllable News Dialogue | MACSUM: Controllable Summarization with Mixed Attributes | |
| 170 | NarraSum | رواية | NarraSum: A Large-Scale Dataset for Abstractive Narrative Summarization | EMNLP Findings 2022 |
| 171 | LoRaLay | Long Scientific Visual | LoRaLay: A Multilingual and Multimodal Dataset for Long Range and Layout-Aware Summarization | EACL 2023 |
| 172 | HunSum-1 | المجري | HunSum-1: an Abstractive Summarization Dataset for Hungarian | |
| 173 | MCLS | ultimodal Cross-Lingual | Assist Non-native Viewers: Multimodal Cross-Lingual Summarization for How2 Videos | EMNLP 2022 |
| 174 | JDDC 2.1 | multimodal | JDDC 2.1: A Multimodal Chinese Dialogue Dataset with Joint Tasks of Query Rewriting, Response Generation, Discourse Parsing, and Summarization | EMNLP 2022 |
| 175 | CroCoSum | Code-switched Cross-lingual | CroCoSum: A Benchmark Dataset for Cross-Lingual Code-Switched Summarization | |
| 176 | unarXive | scholarly | unarXive: a large scholarly data set with publications' full-text, annotated in-text citations, and links to metadata | Scientometrics 2020 |
| 177 | TempoSum | TempoSum: Evaluating the Temporal Generalization of Abstractive Summarization | ||
| 178 | VCSUM | مقابلة | VCSUM: A Versatile Chinese Meeting Summarization Dataset | ACL Findings 2023 |
| 179 | MeetingBank | مقابلة | MeetingBank: A Benchmark Dataset for Meeting Summarization | ACL 2023 |
ACL 2023 [pdf] [data]EMNLP 2022 [pdf] [data]Findings of EMNLP 2021 [pdf] [data]EMNLP 2021 Findings [pdf] [data]EMNLP 2021 [pdf] [data]ACL 2021 [pdf] [data]Findings of ACL21 [pdf] [data]ACL 2021 [pdf] [code]NAACL21 [pdf] [code]NAACL21 [pdf] [data]ACL20 [pdf] [data]COLING20 [pdf] [code]COLING20 [pdf] [code]EMNLP 2021 [pdf][code]ACL 2022 [pdf] [data]EMNLP19 [pdf] [data]Findings of ACL 2022 [pdf]ACL 2021 [pdf] [data]ACL 2020 [pdf] [code] [bib]AACL20 [pdf]ACL 2019 [pdf] [data] [bib]LREC 2014 [pdf]AAAI 2008 [pdf]WWW 2007 [pdf]ACL 2004 [pdf]NAACL 2004 [pdf] [bib]Recent advances in natural language processing III: selected papers from RANLP [pdf]Proceedings of the 8th international conference on Intelligent user interfaces [pdf]Proceedings of the ACL 2001 Workshop on Computational Natural Language Learning (ConLL) 2001 [pdf] [bib]Findings of ACL 2023 [pdf]ACL 2023 [pdf] [code]Findings of ACL 2023 [pdf] [data]ICASSP 2023 [pdf]AACL-IJCNLP 2022 [pdf] [demo]Interspeech 2022 [pdf]LREC 2022 [pdf] [data]Findings of NAACL 2022 [pdf]ACL 2022 [pdf] [code]EMNLP 2021 Findings [pdf]EMNLP 2021| newsum [pdf]Findings of EMNLP 2021 Short [pdf]AAAI 2022 [pdf] [code]SummDial@SIGDial 2021 [pdf]SIGIR 2021 [pdf]INTERSPEECH 2020 [pdf] [code]SIGDIAL 2008 [pdf]INTERSPEECH 2010 [pdf]2008 IEEE Spoken Language Technology Workshop [pdf]2009 IEEE International Conference on Acoustics, Speech and Signal Processing [pdf]SLT 2008 [pdf]WWW 2015 [pdf]ACL 2017 workshop [pdf] [bib]ACL18 [pdf] [code]RCIS 2020 [pdf]IWANN 2019 [pdf]Proceedings of the 2015 ACM Symposium on Document Engineering, DocEng' 2015 [pdf]LREC 2020 [pdf] [bib]IJCAI21 [pdf] [code]COLING20 Short [pdf] [code]KSEM 2020 [pdf] [code]SLT18 [pdf] [code]Findings of EMNLP20 [pdf] [code] [unofficial-code]WWW19 [pdf]ACL19 [pdf]GIFT18 [pdf]ICMI16 [pdf]ICME 2003 [pdf]EACL21 [pdf]SPECOM 2020 [pdf]SIGDIAL 2012 [pdf]NAACL21 [pdf] [data]ACL 2013 [pdf]ACL 2011 [pdf]SIGDIAL 2009 [pdf] [bib]ACL2021 [pdf] [code]Shen Gao, Xin Cheng, Mingzhe Li, Xiuying Chen, Jinpeng Li, Dongyan Zhao, Rui Yan [pdf] [code]COLING 2022 [pdf]EMNLP 2021 Findings [pdf] [code]EMNLP 2021| newsum [pdf]EMNLP 2021 [pdf]EMNLP 2021 [pdf][code]EMNLP 2021 Findings [pdf] [code]EMNLP 2021 Findings [pdf]EMNLP 2021 [pdf] [code]Interspeech 2021 [pdf]Knowledge-Based Systems [pdf]ACL 2021 Student Research Workshop [pdf] [tool] [data]SIGDIAL 2021 [pdf]CCL 2021 [pdf]ICASSP21 [pdf]Findings of ACL 2021 [pdf]ACL-Findings 2021 [pdf] [code]NAACL21 [pdf] [code]TACL 2021 [pdf]COLING20 [pdf]EMNLP20 [pdf] [code]EMNLP19 [pdf] [data]KDD 2022 [pdf]KDD 2022 ADS Track [pdf]Findings of EMNLP 2021 [pdf]NAACL | NLPMC 2021 [pdf1] [pdf2]ACL 2021 [pdf] [code]COLING 2020 [pdf] [code] [bib]Findings of EMNLP 2020 [pdf] [bib]Advanced Information Systems Engineering Workshops 2020 [pdf] [bib]ACL 2020 Short [pdf] [bib]NAACL 2019 [pdf] [bib]LREC 2020 [pdf] [bib]ASRU 2019 [pdf]ACL 2022 [pdf] [code]NAACL 2022 [pdf]Findings of ACL 2022 [pdf] [code]TASLP [pdf]EMNLP 2021 [pdf] [data]SIGIR 2021 [pdf]AAAI21 [pdf]AAAI21 [pdf] [code]AAAI21 [pdf] [code]KDD19 [pdf]NAACL 2022 [pdf] [code]NAACL21 [pdf] [code]ACL2010 Workshop [pdf]EMNLP 2022 [pdf] [code]EMNLP 2022 [pdf]EMNLP 2022 [pdf]EACL 2023 [pdf] [code]Findings of EACL 2023 [pdf] [code]Findings of ACL 2023EMNLP 2022 [pdf] [code]Transcript Understanding Workshop at COLING 2022 [pdf]COLING 2022 [pdf]COLING 2022 [pdf] [code]COLING 2022 [pdf]EMNLP 2022 [pdf] [code]LREC 2022 [pdf]Findings of NAACL 2022 [pdf]NAACL 2022 Industry Track [pdf]NAACL 2022 Student Research Workshop [pdf] [code]NAACL 2022 [pdf] [code]NAACL 2022 [pdf] [code]NAACL 2022 [pdf]WIT Workshop @ ACL2022 [pdf] [code]ACL 2022 DialDoc Workshop [pdf] [data]Findings of NAACL 2022 [pdf] [code]EMNLP 2021| newsum [pdf]AKBC 2021 [pdf] [code]EMNLP 2021 [pdf] [code]EMNLP 2021 Short [pdf]UIST 2021 [pdf]ACL 2021 [pdf] [code]ACL 2021 [pdf] [code]TREC 2020 Podcasts Track [pdf]CIKM 2019 [pdf]WSDM 2018 [pdf]SemDial 2017 [pdf]SIGDIAL 2016 [pdf]COLING 2004 [pdf] [bib] Switchboard dialogues EACL 2023 [pdf] [code]EACL 2023 [pdf] [code]EMNLP 2022 [pdf] [code]EMNLP 2022 [pdf] [code]COLING 2022 [pdf] [code]COLING 2022 [pdf] [code]EMNLP 2022 [pdf]COLING2022 [pdf] [code]SustaiNLP at EMNLP 2020 [pdf]COLING 2022 [pdf] [code]ACM Computing Surveys [pdf]ACL 2022 [pdf] [code]AAAI 2022 [pdf]EMNLP 2022 [pdf]EMNLP 2022EMNLP 2022 [pdf] [data]ACL 2022 [pdf] [code] [data]ACL 2022 [pdf]ACL 2022 [pdf] [code]EMNLP 2021 Findings [pdf]EMNLP 2021 short paper [pdf] [code]ACL 2021 short [pdf] [data]NAACL21 [pdf] [code]ACL 2021 [pdf]EACL 2021 [pdf]EACL21 [pdf] [code]AAAI 2021 [pdf] [code]NAACL 2021 [pdf] [code] [data]ACL 2021 Student Research Workshop [pdf]SDP EMNLP 2020 [pdf]SDU21 [pdf] [code]AACL20 [pdf [code]EMNLP20 [pdf]COLING20 [pdf]EMNLP20 Short [pdf] [data]IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING [pdf]Findings of EMNLP20 [pdf] [data]EMNLP19 [pdf] [code]AAAI19 [pdf] [data]ACL19 [pdf] [data]NAACL18 [pdf] [data] Toolkit: factsumm
EACL 2023 [pdf] [code]EACL 2023 [pdf] [code]EMNLP 2022 [pdf] [code]EMNLP 2022 [pdf] [code]EMNLP 2022 [pdf]EACL 2023 [pdf] [code]AACL 2022 [pdf]EMNLP 2022 [pdf] [code]EMNLP 2022 [pdf] [code]EMNLP 2022 [pdf]EMNLP 2022 [pdf] [data]EMNLP 2022 [pdf] [code]EMNLP 2022 [pdf] [code]COLING 2022 [pdf] [code]EMNLP 2022 [pdf]Findings of NAACL 2022 [pdf]Findings of NAACL 2022 [pdf] [code]Findings of NAACL 2022 [pdf] [code]NAACL 2022 [pdf] [code]NAACL 2022 findings [pdf]TACL 2022 Volume 10 [pdf] [code]ACL 2022 [pdf] [code]NAACL 2022 [pdf] [code]NAACL 2022 Findings [pdf] [code]ACL 2022 [pdf] [code]NAACL 2022 [pdf] [code]NAACL 2022 [pdf]NAACL 2022 [pdf] [code]EMNLP 2021 Findings [pdf] [code]NAACL 2022 [pdf]EMNLP2021 Findings [pdf] [data]EMNLP 2021 [pdf] [code]ACL 2022 [pdf] [code]EMNLP 2021 Findings [pdf] [code]ACL 2021 Proceedings of The 4th Workshop on e-Commerce and NLP [pdf]Findings of ACL 2021 [pdf] [data]ACL 2021 [pdf]ACL 2021 [pdf] [code]EACL21 [pdf] [code]NAACL21 [pdf] [code]NAACL21 [pdf] [code]NAACL21 [pdf] [code]NAACL21 [pdf]EACL21 [pdf] [code]COLING20 [pdf] [code]NAACL21 [pdf] [code]EMNLP | Eval4NLP 20 [pdf]Findings of ACL 2021 [pdf]EMNLP20 short [pdf] [code]EMNLP20 [pdf]EMNLP20 [pdf]EMNLP20 [pdf] [code]Findings of EMNLP [pdf]ACL20 [pdf] [data]ACL20 [pdf]ACL20 [pdf]ACL20 [pdf] [code]ACL20 [pdf] [code]ACL20 [pdf]NIPS19 [pdf]KDD19 [pdf]ACL19 [pdf] [data]COLING18 [pdf] [code]AAAI18 [pdf]COLING 2022 [pdf] [code]EMNLP 2021 [pdf] [code]AAAI 2022 [pdf]](https://arxiv.org/abs/2109.03481)Findings of EMNLP 2021 Short [pdf]ACL 2021 short [pdf] [code]ICLR 2021 [pdf]AAAI 2019 [pdf] [code]EMNLP 2020 [pdf] [code]EMNLP 2019 [pdf] [code] COLING 2022 [pdf] [code]COLING 2022 [pdf] [code]COLING 2022 [pdf] [code]APWeb-WAIM2022 [pdf]NAACL 2022 [pdf] [code]NAACL 2022 [pdf] [code]NAACL 2022 [pdf] [code]NAACL 2022 [pdf] [code]NAACL 2022 [pdf]AAAI 2022 [pdf] [code]ECIR 2022 [pdf] [code]AAAI 2022 [pdf]EMNLP 2021| newsum [pdf]EMNLP 2021 New Frontiers in Summarization Workshop [pdf]EMNLP 2021 [pdf] [code]EMNLP 2021 [pdf] [code]ACL 2021 [pdf] [code]Findings of ACL 2021 [pdf]EACL21 [pdf] [code]ACL 2021 demo [pdf] [data]Findings of ACL 2021 Oleg Vasilyev, John Bohannon [pdf]EACL21 [pdf] [code]COLING20 [pdf] [bib]EMNLP20 [pdf] [code]ACL19 [pdf] [code] AAAI 2023 [pdf] [code]EMNLP 2022 [pdf] [code]COLING 2022 [pdf] [code]COLING 2022 [pdf] [code]IJCAI 2022 [pdf] [data]NAACL 2022 [pdf] [code]NAACL 2022 [pdf] [code]ACL 2022 [pdf]ACL 2022 [pdf] [code]ACL 2022 [pdf] [code]NAACL 2022 [pdf] [code]NAACL 2022 [pdf] [code]Findings of ACL 2022 [pdf] [code]ACL 2022 [pdf] [code]ACL 2022 [pdf] [code]EMNLP 2021 [pdf] [data]EMNLP 2021 [pdf] [code]Findings of EMNLP 2021 [pdf]EMNLP 2021|newsum [pdf]ACL 2021 [pdf] [data]ACL 2021 Findings [pdf]ACL 2021 Findings [pdf] [code]ACL 2021 [pdf] [code]Findings of ACL 2021 [pdf] [data]NAACL21 [pdf] [code]NAACL21 [pdf] [code]EACL 2021 [pdf]AAAI21 [pdf] [code]COLING20 [pdf]Findings of EMNLP [pdf] [code]EMNLP20 [pdf] [code] [code]COLING20 Short [pdf] [code]EMNLP20 [pdf] [code]ACL20 [pdf] [code]ACL20 [pdf]ACL20 [pdf] [code]ACL20 [pdf] [code]WWW20 [pdf] [code]EMNLP19 [pdf]ACL19 [pdf] [code]ACL19 [pdf] [code]ICML19 [pdf] [code]ICLR18 [pdf] [code]EMNLP18 [pdf] [code]CoNLL17 [pdf]AAAI17 [pdf]COLING16 [pdf]ACL04 [pdf] ACL 2023 [pdf]EMNLP 2022 [pdf] [code]TACL 2022 [pdf]ACL 2022 DialDoc Workshop [pdf] [data]ACL 2022 [pdf]ACL 2022 [pdf] [code]AAAI 2022 [pdf] [code]EMNLP 2021| newsum [pdf]EMNLP 2021 [pdf] [data]EMNLP 2021 [pdf] [code]ACL 2021 Findings [pdf] [data]Findings of ACL 2021 [pdf] [data]Findings of ACL 2021 [pdf] [code]Findings of ACL 2021 [pdf]ACL 2021 [pdf] [code]EACL21 [pdf] [code]EACL21 [pdf] [data]AACL20 [pdf]Findings of EMNLP20 [pdf] [data]ACL20 workshop [pdf] [code]ACL20 [pdf]ACL20 [pdf] [code]AAAI20 [pdf] [code]AAAI 2020 [pdf] [code]EMNLP19 workshop [pdf] [data]EMNLP19 [pdf] [code]ACL19 [pdf] [code]NAACL19 [pdf]ACIIDS19 [pdf]TASLP18 [pdf]NLDB18 [pdf]TASLP16 [pdf]EMNLP15 [pdf]MultiLing13 [pdf]ACL11 [pdf]ACL10 [pdf]LREC08 [pdf]Findings ACL 2023 [pdf]IJCAI 2023 [pdf]EMNLP 2022 [pdf] [data]AAAI 2022 [pdf]AAAI 2022 [pdf] [data]EMNLP 2021 [pdf] [code]SIGIR 2021 [pdf]ACL21 [pdf] [code]ICMR21 [pdf]COLING20 [pdf]EMNLP20 [pdf]EMNLP20 Workshop [pdf] [code]EMNLP20 [pdf] [data]ECIR20 [pdf]AAAI20 [pdf] [code]AAAI20 [pdf]AAAI20 [pdf]ACL19 [pdf]ACL19 [pdf]EMNLP18 [pdf] [data]EMNLP18 [pdf]IJCAI18 [pdf]NIPS18 [pdf] [data]GIFT18 [pdf]EMNLP17 [pdf]ICMI16 [pdf]LREC12 [pdf]LREC04 [pdf]ICME03 [pdf] EMNLP 2022 [pdf] [code]COLING20 [pdf] [code] [bib]SIGIR20 [pdf] [code]IJCAI18 [pdf]IEEE17 [pdf]ICSP16 [pdf] ACL 2023 [pdf] [code]EMNLP 2021 Findings [pdf] [code]EMNLP 2021 [pdf] [code]SIGIR 2021 [pdf]ICML 2021 [pdf] [code]COLING20 [pdf] [code]Findings of EMNLP [pdf] [code]EMNLP20 [pdf] [code]EMNLP20 [pdf]Findings of EMNLP20 [pdf]ACL20 Short [pdf]ICML20 [pdf] [code]LREC20 [pdf]ICML20 [pdf] [code]EMNLP19 [pdf] [code]ACL19 [pdf]ICML19 [pdf] [code]NIPS19 [pdf] [code]ACL19 [pdf] [code]EMNLP 2022 [pdf] [code]EMNLP 2022 [pdf] [code]Findings of EMNLP 2022 [pdf]ACL 2022 [pdf] [code]ACL 2022 [pdf] [code] [data]EMNLP 2021 [pdf] [code]TACL [pdf]NAACL21 [pdf] [code]EACL 2021 [pdf]TACL 2021 [pdf] [code]Findings of ACL 2021 (short) [pdf] [code]ACL-Findings 2021 [pdf] [code]NAACL21 [pdf]NAACL21 short [pdf] [code]AAAI20 [pdf]AAAI2020 [pdf] [code]COLING20 [pdf]COLING20 [pdf] [code]EMNLP20 Short [pdf] [code]ACL20 [pdf]ACL20 [pdf] [code]ACL19 [pdf] [code]ACL19 [pdf] [code]NAACL19 [pdf] [code]NAACL18 [pdf]ACL2018 Workshop [pdf]ACL18 [pdf]EMNLP18 [pdf] [code]ICLR18 [pdf] [code]EMNLP16 [pdf] [code] EACL 2023 [pdf] [code]EMNLP 2022 [pdf]GEM at EMNLP 2022 [pdf] [code]EMNLP 2022 [pdf] [code]EMNLP 2022 [pdf] [code]COLING 2022 [pdf] [code]COLING 2022 [pdf] [code]COLING 2022 [pdf]Findings of NAACL 2022 [pdf]NAACL 2022 Student Research Workshop [pdf] [code]ACL 2022 [pdf] [code]ACL 2022 [pdf] [code]IEEE INDICON 2021 [pdf]ENIAC 2021 [pdf]EMNLP 2021| newsum [pdf]EMNLP 2021| newsum [pdf]EMNLP 2021| newsum [pdf] [code]EMNLP 2021| newsum [pdf] [code]EMNLP 2021 [pdf] [code]EMNLP 2021 [pdf]Journal of Data Science [pdf]ACL 2022 [pdf] [code]ACL21 [pdf]NAACL21 [pdf] [code]NAACL21 [pdf]EACL21 [pdf]Journal [pdf]NAACL21 short [pdf] [code]NAACL21 [pdf] [code]AAAI21 [pdf] [code]EMNLP20 [pdf] [code]ICSCSP20 [pdf]EMNLP20 [pdf]EMNLP20 [pdf]EMNLP20 short [pdf] [code]AACL20 [pdf] [code]IJCAI20 [pdf] [code]IJCAI20 [pdf]IJCAI20 Special Track on AI in FinTech [pdf]ICML20 [pdf]AAAI20 [pdf] [code]AAAI20 [pdf]KDD Converse 2020 [pdf]EMNLP19 [pdf] [code]EMNLP19 [pdf] [code]EMNLP19 [pdf] [code]EMNLP19 [pdf] [code]EMNLP19 [pdf] [code]ACL19 [pdf] [code]ACL19 [pdf] [code]ACL19 [pdf] [code]RANLP19 [pdf] [code]EMNLP18 [pdf]EMNLP18 [pdf]EMNLP18 [pdf] [code]ACL18 [pdf]ACL18 [pdf]ADMA18 [pdf]NAACL18 [pdf]ACL17 [pdf] [code]ACL17 [pdf]ACL17 [pdf]NAACL15 [pdf]ENLG13 [pdf] EMNLP 2022 [pdf] [code]AAAI 2022 [pdf]EMNLP 2021 short [pdf]EMNLP20 [pdf] [code]COLING20 [pdf]ACL20 [pdf] [code]ICLR19 [pdf] [code]ACL19 [pdf] [code]EMNLP19 [pdf]CoNLL17 [pdf]ACL17 [pdf] ICASSP 2023 [pdf] [code]EMNLP 2022 [pdf] [code]COLING 2022 [pdf] [code]ACL 2022 [[pdf] [code]ACL 2022 [pdf] [code]Findings of EMNLP 2021 [pdf]ACL 2021 Findings [pdf] [code]AAAI21 [pdf] [code]COLING20 [pdf] [code]EMNLP20 [pdf] [code]LREC20 [pdf] [code]ACL19 [pdf] [code]ACL19 [pdf] [code]ACL20 [pdf] [code]ICML19 [pdf] [code]NAACL19 [pdf] [code]EMNLP18 [pdf] [code]ACL18 [pdf] [code] NAACL19 [pdf] [code]EMNLP17 [pdf] [code] TOIS [pdf] [code]NAACL 2022 [pdf] [data]ACL 2022 [pdf] [code]TOIS [pdf] [data]ACL 2021 [pdf] [data]SIGIR 2021 [pdf] [data]ACL20 [pdf] [code]IJCAI19 [pdf] [data] EACL 2023 Findings [pdf]EMNLP 2022 [pdf] [code]TACL 2022 [pdf]Findings of NAACL 202 [pdf] [code]SIGIR 2021 [pdf]EMNLP 2021 Findings [pdf] [code]EMNLP 2021| newsum [pdf] [data]EMNLP 2021 [pdf] [code]EMNLP 2021 [pdf] [code]