跳到主要导航 跳到搜索 跳到主要内容

Boosting text extraction from biomedical images using text region detection

  • Oak Ridge National Laboratory
  • Yale University

科研成果: 书/报告/会议事项章节会议稿件同行评审

7 引用 (Scopus)

摘要

In this paper, we show that domain-optimized text detection in biomedical images is important for boosting text extraction recall via off-the-shelf OCR engines. Methodologically, we contrast OCR performance when processing raw biomedical images, compared to preprocessing those images, and performing OCR on detected image text regions only. To quantify OCR extraction results, we rely on a gold standard image text corpus with manually identified image text strings. To demonstrate the positive effect on biomedical image retrieval, we apply image text detection and extraction to a large corpus of biomedical images in the Yale Image Finder system. We show that improved text extraction results in the retrieval of a larger number of relevant images for a set of domain-relevant keyword searches.

源语言英语
主期刊名Proceedings of the 2011 Biomedical Sciences and Engineering Conference
主期刊副标题Image Informatics and Analytics in Biomedicine, BSEC 2011
DOI
出版状态已出版 - 2011
已对外发布
活动2011 Biomedical Sciences and Engineering Conference: Image Informatics and Analytics in Biomedicine, BSEC 2011 - Knoxville, TN, 美国
期限: 15 3月 201117 3月 2011

出版系列

姓名Proceedings of the 2011 Biomedical Sciences and Engineering Conference: Image Informatics and Analytics in Biomedicine, BSEC 2011

会议

会议2011 Biomedical Sciences and Engineering Conference: Image Informatics and Analytics in Biomedicine, BSEC 2011
国家/地区美国
Knoxville, TN
时期15/03/1117/03/11

学术指纹

探究 'Boosting text extraction from biomedical images using text region detection' 的科研主题。它们共同构成独一无二的指纹。

引用此