TY - GEN
T1 - Extracting Temporal Patterns From Large-Scale Text Corpus
AU - Liu, Yu
AU - Hua, Wen
AU - Zhou, Xiaofang
N1 - Publisher Copyright:
© 2019, Springer Nature Switzerland AG.
PY - 2019
Y1 - 2019
N2 - Knowledge, in practice, is time-variant and many relations are only valid for a certain period of time. This phenomenon highlights the importance of designing temporal patterns, i.e., indicating phrases and their temporal meanings, for temporal knowledge harvesting. However, pattern design is extremely laborious and time consuming even for a single relation. Therefore, in this work, we study the problem of temporal pattern extraction by automatically analysing a large-scale text corpus with a small number of seed temporal facts. The problem is challenging considering the ambiguous nature of natural language and the huge amount of documents we need to analyse in order to obtain highly representative temporal patterns. To this end, we introduce various techniques, including corpus annotation, pattern generation, scoring and clustering, to reduce ambiguity in the text corpus and improve both accuracy and coverage of the extracted patterns. We conduct extensive experiments on real world datasets and the experimental results verify the effectiveness of our proposals.
AB - Knowledge, in practice, is time-variant and many relations are only valid for a certain period of time. This phenomenon highlights the importance of designing temporal patterns, i.e., indicating phrases and their temporal meanings, for temporal knowledge harvesting. However, pattern design is extremely laborious and time consuming even for a single relation. Therefore, in this work, we study the problem of temporal pattern extraction by automatically analysing a large-scale text corpus with a small number of seed temporal facts. The problem is challenging considering the ambiguous nature of natural language and the huge amount of documents we need to analyse in order to obtain highly representative temporal patterns. To this end, we introduce various techniques, including corpus annotation, pattern generation, scoring and clustering, to reduce ambiguity in the text corpus and improve both accuracy and coverage of the extracted patterns. We conduct extensive experiments on real world datasets and the experimental results verify the effectiveness of our proposals.
KW - Temporal knowledge harvesting
KW - Temporal patterns
KW - Text mining
UR - http://www.scopus.com/inward/record.url?scp=85061075971&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-12079-5_2
DO - 10.1007/978-3-030-12079-5_2
M3 - Conference article published in proceeding or book
AN - SCOPUS:85061075971
SN - 9783030120788
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 17
EP - 30
BT - Databases Theory and Applications - 30th Australasian Database Conference, ADC 2019, Proceedings
A2 - Chang, Lijun
A2 - Cao, Xin
A2 - Gan, Junhao
PB - Springer Verlag
T2 - 30th Australasian Database Conference, ADC 2019
Y2 - 29 January 2019 through 1 February 2019
ER -