A novel structural similarity measure on XML data for integrated document management

K. L. Ng, T. Y. Ng

Research output: Journal article publicationJournal articleAcademic researchpeer-review

Abstract

XML has emerged as a standard for data representation on the Web. Driven by advanced Internet technologies and the growth of e-business activities, large amounts of information has already been created in XML format and stored in document management systems. However, many document management systems have not been fully utilizing XML structures to support effective document searching. In this paper, we discuss an efficient and effective approach to be layered on top of or embedded within a document management system in order to support searching XML documents with proximate querying based on structural similarity. As a core component, we propose a novel structural similarity measure and demonstrate it through extensive experiments that our measure brings significant improvement over previous methods, in terms of accuracy and effectiveness.
Original languageEnglish
Pages (from-to)42-52
Number of pages11
JournalJournal of Computer Information Systems
Volume48
Issue number1
Publication statusPublished - 1 Sep 2007

ASJC Scopus subject areas

  • Information Systems
  • Education
  • Computer Networks and Communications

Cite this