Skip to main navigation Skip to search Skip to main content

Generic text summarization using relevance measure and latent semantic analysis

  • NEC Corporation

Research output: Contribution to journalConference articlepeer-review

707 Scopus citations

Abstract

In this paper, we propose two generic text summarization methods that create text summaries by ranking and extracting sentences from the original documents. The first method uses standard IR methods to rank sentence relevances, while the second method uses the latent semantic analysis technique to identify semantically important sentences, for summary creations. Both methods strive to select sentences that are highly ranked and different from each other. This is an attempt to create a summary with a wider coverage of the document's main content and less redundancy. Performance evaluations on the two summarization methods are conducted by comparing their summarization outputs with the manual summaries generated by three independent human evaluators. The evaluations also study the influence of different VSM weighting schemes on the text summarization performances. Finally, the causes of the large disparities in the evaluators' manual summarization results are investigated, and discussions on human text summarization patterns are presented.

Original languageEnglish
Pages (from-to)19-25
Number of pages7
JournalSIGIR Forum (ACM Special Interest Group on Information Retrieval)
DOIs
StatePublished - 2001
Externally publishedYes
Event24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval - New Orleans, LA, United States
Duration: 9 Sep 200113 Sep 2001

Keywords

  • Generic Text Summarization
  • Latent Semantic Analysis
  • Relevance Measure

Fingerprint

Dive into the research topics of 'Generic text summarization using relevance measure and latent semantic analysis'. Together they form a unique fingerprint.

Cite this