跳到主要导航 跳到搜索 跳到主要内容

Generic text summarization using relevance measure and latent semantic analysis

  • NEC Corporation

科研成果: 期刊稿件会议文章同行评审

707 引用 (Scopus)

摘要

In this paper, we propose two generic text summarization methods that create text summaries by ranking and extracting sentences from the original documents. The first method uses standard IR methods to rank sentence relevances, while the second method uses the latent semantic analysis technique to identify semantically important sentences, for summary creations. Both methods strive to select sentences that are highly ranked and different from each other. This is an attempt to create a summary with a wider coverage of the document's main content and less redundancy. Performance evaluations on the two summarization methods are conducted by comparing their summarization outputs with the manual summaries generated by three independent human evaluators. The evaluations also study the influence of different VSM weighting schemes on the text summarization performances. Finally, the causes of the large disparities in the evaluators' manual summarization results are investigated, and discussions on human text summarization patterns are presented.

源语言英语
页(从-至)19-25
页数7
期刊SIGIR Forum (ACM Special Interest Group on Information Retrieval)
DOI
出版状态已出版 - 2001
已对外发布
活动24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - New Orleans, LA, 美国
期限: 9 9月 200113 9月 2001

学术指纹

探究 'Generic text summarization using relevance measure and latent semantic analysis' 的科研主题。它们共同构成独一无二的指纹。

引用此