TY - GEN
T1 - LDA-based topic formation and topic-sentence reinforcement for graph-based multi-document summarization
AU - Gao, Dehong
AU - Li, Wenjie
AU - Ouyang, You
AU - Zhang, Renxian
PY - 2012/12/31
Y1 - 2012/12/31
N2 - In recent years graph-based ranking algorithms have attracted much attention in document summarization. This paper introduces our recent work on applying a topic model, namely LDA, in graph-based summarization. In the proposed approach, LDA is used to automatically identify a set of semantic topics from the documents to be summarized. The identified topics are then used to construct a bipartite graph to represent the documents. Topic-sentence reinforcement is implemented to calculate the salience scores of topics and sentences simultaneously. By incorporating the information embedded in the topics, the sentence ranking result can be improved. Experiments are conducted on the DUC 2004 data set to evaluate the effectiveness of the proposed approach.
AB - In recent years graph-based ranking algorithms have attracted much attention in document summarization. This paper introduces our recent work on applying a topic model, namely LDA, in graph-based summarization. In the proposed approach, LDA is used to automatically identify a set of semantic topics from the documents to be summarized. The identified topics are then used to construct a bipartite graph to represent the documents. Topic-sentence reinforcement is implemented to calculate the salience scores of topics and sentences simultaneously. By incorporating the information embedded in the topics, the sentence ranking result can be improved. Experiments are conducted on the DUC 2004 data set to evaluate the effectiveness of the proposed approach.
KW - Graph-based sentence ranking
KW - Latent Dirichlet Allocation
KW - Multi-document summarization
UR - http://www.scopus.com/inward/record.url?scp=84871571292&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-35341-3_33
DO - 10.1007/978-3-642-35341-3_33
M3 - Conference article published in proceeding or book
SN - 9783642353406
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 376
EP - 385
BT - Information Retrieval Technology - 8th Asia Information Retrieval Societies Conference, AIRS 2012, Proceedings
T2 - 8th Asia Information Retrieval Societies Conference, AIRS 2012
Y2 - 17 December 2012 through 19 December 2012
ER -