NMTF-LTM: Towards an Alignment of Semantics for Lifelong Topic Modeling

Zhiqi Lei, Hai Liu, Jiaxing Yan, Yanghui Rao, Qing Li

Research output: Journal article publicationJournal articleAcademic researchpeer-review

3 Citations (Scopus)

Abstract

Aiming at mining high quality topics by accumulating and utilizing semantic knowledge for a stream of documents, lifelong topic modeling (LTM) has attracted more and more attentions recently. However, the permutation of topics may change over time, resulting in a semantic misalignment between the topic representations of document chunks across the stream. Such a misalignment deteriorates the model performances of various downstream tasks, while it has been overlooked by the existing lifelong topic models. Towards addressing the misalignment of semantics, we formulate LTM as a problem of non-negative matrix tri-factorization (NMTF) and propose a consolidation framework (i.e., NMTF-LTM) to enforce an alignment in a mapped topic space. In addition, a distributed parallel algorithm, namely PNMTF-LTM, is developed to meet the real-time requirement for large-scale stream processing. Empirical results show that our method can not only obtain a superior alignment of semantics without loss of topic quality, but also achieve effective speedup when deployed to a high performance computing cluster.

Original languageEnglish
Pages (from-to)10616-10632
Number of pages17
JournalIEEE Transactions on Knowledge and Data Engineering
Volume35
Issue number10
DOIs
Publication statusPublished - 1 Oct 2023

Keywords

  • Lifelong topic modeling
  • non-negative matrix tri-factorization
  • parallel computing
  • semantic alignment

ASJC Scopus subject areas

  • Information Systems
  • Computer Science Applications
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'NMTF-LTM: Towards an Alignment of Semantics for Lifelong Topic Modeling'. Together they form a unique fingerprint.

Cite this