바로가기메뉴

본문 바로가기 주메뉴 바로가기

logo

  • P-ISSN1013-0799
  • E-ISSN2586-2073
  • KCI

An Experimental Study on Automatic Summarization of Multiple News Articles

Journal of the Korean Society for Information Management / Journal of the Korean Society for Information Management, (P)1013-0799; (E)2586-2073
2006, v.23 no.1, pp.83-98
https://doi.org/10.3743/KOSIM.2006.23.1.083


Abstract

This study proposes a template-based method of automatic summarization of multiple news articles using the semantic categories of sentences. First, the semantic categories for core information to be included in a summary are identified from training set of documents and their summaries. Then, cue words for each slot of the template are selected for later classification of news sentences into relevant slots. When a news article is input, its event/accident category is identified, and key sentences are extracted from the news article and filled in the relevant slots. The template filled with simple sentences rather than original long sentences is used to generate a summary for an event/accident. In the user evaluation of the generated summaries, the results showed the 54.1% recall ratio and the 58.1% precision ratio in essential information extraction and 11.6% redundancy ratio.

keywords
복수문헌 자동요약, 뉴스기사 자동요약, 템플리트, 슬롯 단서어, 의미범주, multi-document summarization, news article summarization, template, slot cue word, semantic category, multi-document summarization, news article summarization, template, slot cue word, semantic category

Reference

1.

(2005.). 정보검색연구. , -.

2.

(2000). Multi-document Summarization by Visualizing Topical Content. , 79-88.

3.

(g.1982). An overview of the FRUMP system eds. Strategies for Natural Language Processing. 149-176.. , -.

4.

(2). New methods in automatic extracting. , 264-285.

5.

(2000). Multi-document summarization by sentence extraction. , 40-48.

6.

(1999). Summarizing similarities and differences among related documents. 1, 35-67.

7.

(1995). Generating summaries of muliple news articles. , 74-82.

8.

(1999). Development and evaluation of a statistically-based document summarization system. , 61-70.

9.

(2000). Centroid-based summarization of multiple documents: sentence extraction, utility-based evaluation, and user studies. , 21-30.

10.

(2000). Multi-document summarization: methodologies and evaluations. , -.

11.

(1997). A comparative study on feature selection in text categorization. , -.

12.

(1997). A comparative study on feature selection in text categorization. , -.

Journal of the Korean Society for Information Management