medicat
1.0.0
Medicat是医学图像,标题,子图 - 掩饰注释和内联文字参考的数据集。此处提供了访问说明。
数字和标题是从PubMed Central中的开放访问文章中提取的,相应的参考文本是从S2orc得出的。
数据集由:
sample/可用数据示例。
示例数据输入:
{
"pdf_hash": "57c9ad0f4aab133f96d40992c46926fabc901ffa",
"fig_key": "Figure1",
"fig_uri": "2-Figure1-1.png",
"s2_caption": "Figure 1. (A) Barium enema and (B) endoscopic image of the high-grade distal colonic obstruction caused by a 5-cm anastomotic stricture.",
"s2orc_caption": "Figure 1. (A) Barium enema and (B) endoscopic image of the high-grade distal colonic obstruction caused by a 5-cm anastomotic stricture.",
"s2orc_references": [
"Computed tomography (CT) showed a distal large bowel obstruction, and a barium enema revealed a high-grade stenosis proximal to the anastomotic site in the recto-sigmoid region (Figure 1 ).",
"Flexible sigmoidoscopy revealed a tight, fibrotic, benign-appearing anastomotic stricture 15 cm from the anal verge ( Figure 1) ."
],
"radiology": false,
"scope": true,
"predicted_type": "Medical images",
"oa_info": {
"doi": "10.14309/crj.2014.54",
"doi_url": "https://doi.org/10.14309/crj.2014.54",
"oa": {
"is_oa": true,
"oa_status": "gold",
"journal_is_oa": true,
"journal_is_in_doaj": true,
"license": "cc-by-nc-nd",
"provenance": "unpaywall"
}
}
}
相应的图位于figures/57c9ad0f4aab133f96d40992c46926fabc901ffa_2-Figure1-1.png {pdf_hash}_{fig_uri} ))。
请填写此表格以供访问。如果您没有在5天后收到指向数据集的链接,请联系[email protected]查询。有时,访问电子邮件也会在垃圾邮件框中结束,因此请先在电子邮件之前先检查此处。
请参阅与我们论文相关的代码的code目录。 code/README.md包括有关如何使用此代码的其他信息。
如果使用此数据集,请引用:
@inproceedings{subramanian-2020-medicat,
title={{MedICaT: A Dataset of Medical Images, Captions, and Textual References}},
author={Sanjay Subramanian, Lucy Lu Wang, Sachin Mehta, Ben Bogin, Madeleine van Zuylen, Sravanthi Parasa, Sameer Singh, Matt Gardner, and Hannaneh Hajishirzi},
year={2020},
booktitle={Findings of EMNLP},
}
Medicat中的每个源文件的许可不同。 Medicat中包含的文章具有开放访问许可证(请参阅CC和UPW)或在公共领域中。每个文章的许可在数据集的关联条目中提供。使用时请遵守这些许可证。 Medicat数据集仅可用于非商业用途。
电子邮件: {sanjays, lucyw}@allenai.org