A supervised aggregation framework for multi-document summarization

Yulong Pei, Wenpeng Yin, Qifeng Fan, Lianen Huang

Research output: Contribution to conferencePaperpeer-review

11 Scopus citations

Abstract

In most summarization approaches, sentence ranking plays a vital role. Most previous work explored different features and combined them into unified ranking methods. However, it would be imprecise to rank sentences from a single point of view because contributions from the features are onefold in these methods. In this paper, a novel supervised aggregation approach for summarization is proposed which combines different summarization methods including Lex- PageRank, LexHITS, manifold-ranking method and DivRank. Human labeled data are used to train an optimization model which combines these multiple summarizers and then the weights assigned to each individual summarizer are learned. Experiments are conducted on DUC2004 data set and the results demonstrate the effectiveness of the supervised aggregation method compared with typical ensemble approaches. In addition, we also investigate the influence of training data construction and component diversity on the summarization results.

Original languageEnglish (US)
Pages2225-2242
Number of pages18
StatePublished - 2012
Event24th International Conference on Computational Linguistics, COLING 2012 - Mumbai, India
Duration: Dec 8 2012Dec 15 2012

Other

Other24th International Conference on Computational Linguistics, COLING 2012
Country/TerritoryIndia
CityMumbai
Period12/8/1212/15/12

All Science Journal Classification (ASJC) codes

  • Computational Theory and Mathematics
  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'A supervised aggregation framework for multi-document summarization'. Together they form a unique fingerprint.

Cite this