medicat
1.0.0
Medicat是醫學圖像,標題,子圖 - 掩飾註釋和內聯文字參考的數據集。此處提供了訪問說明。
數字和標題是從PubMed Central中的開放訪問文章中提取的,相應的參考文本是從S2orc得出的。
數據集由:
sample/
可用數據示例。
示例數據輸入:
{
"pdf_hash": "57c9ad0f4aab133f96d40992c46926fabc901ffa",
"fig_key": "Figure1",
"fig_uri": "2-Figure1-1.png",
"s2_caption": "Figure 1. (A) Barium enema and (B) endoscopic image of the high-grade distal colonic obstruction caused by a 5-cm anastomotic stricture.",
"s2orc_caption": "Figure 1. (A) Barium enema and (B) endoscopic image of the high-grade distal colonic obstruction caused by a 5-cm anastomotic stricture.",
"s2orc_references": [
"Computed tomography (CT) showed a distal large bowel obstruction, and a barium enema revealed a high-grade stenosis proximal to the anastomotic site in the recto-sigmoid region (Figure 1 ).",
"Flexible sigmoidoscopy revealed a tight, fibrotic, benign-appearing anastomotic stricture 15 cm from the anal verge ( Figure 1) ."
],
"radiology": false,
"scope": true,
"predicted_type": "Medical images",
"oa_info": {
"doi": "10.14309/crj.2014.54",
"doi_url": "https://doi.org/10.14309/crj.2014.54",
"oa": {
"is_oa": true,
"oa_status": "gold",
"journal_is_oa": true,
"journal_is_in_doaj": true,
"license": "cc-by-nc-nd",
"provenance": "unpaywall"
}
}
}
相應的圖位於figures/57c9ad0f4aab133f96d40992c46926fabc901ffa_2-Figure1-1.png
{pdf_hash}_{fig_uri}
))。
請填寫此表格以供訪問。如果您沒有在5天后收到指向數據集的鏈接,請聯繫[email protected]查詢。有時,訪問電子郵件也會在垃圾郵件框中結束,因此請先在電子郵件之前先檢查此處。
請參閱與我們論文相關的代碼的code
目錄。 code/README.md
包括有關如何使用此代碼的其他信息。
如果使用此數據集,請引用:
@inproceedings{subramanian-2020-medicat,
title={{MedICaT: A Dataset of Medical Images, Captions, and Textual References}},
author={Sanjay Subramanian, Lucy Lu Wang, Sachin Mehta, Ben Bogin, Madeleine van Zuylen, Sravanthi Parasa, Sameer Singh, Matt Gardner, and Hannaneh Hajishirzi},
year={2020},
booktitle={Findings of EMNLP},
}
Medicat中的每個源文件的許可不同。 Medicat中包含的文章具有開放訪問許可證(請參閱CC和UPW)或在公共領域中。每個文章的許可在數據集的關聯條目中提供。使用時請遵守這些許可證。 Medicat數據集僅可用於非商業用途。
電子郵件: {sanjays, lucyw}@allenai.org