Automatic facet extraction based on multidimensional semantic index

Xiao Wei, Xiangfeng Luo, Qing Li

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

2 Citations (Scopus)

Abstract

Faceted search on web pages needs exact facets. However, it is difficult to extract facets exactly from web pages because the web pages are unstructured and lack of facet information. Therefore, facet extraction is a key to faceted search. This paper proposed a method of extracting facets automatically from unstructured web pages to improve the faceted search on web. The Multidimensional Semantic Index (MDSI) of web pages is constructed by mining all kinds of semantic relations among the words from web pages, which creates a semantic-rich index for web pages. In MDSI, the differently dimensional semantic indexes are bridged by mining the semantic mapping between them. Based on the MDSI of web pages, the facets are extracted by analyzing semantic mapping relations in MDSI. To validate the effect of the proposed method, two datasets are constructed and the experimental results show that the proposed method is feasible and comparatively precise.

Original languageEnglish
Title of host publicationProceedings - 2012 8th International Conference on Semantics, Knowledge and Grids, SKG 2012
Pages64-71
Number of pages8
DOIs
Publication statusPublished - 1 Dec 2012
Externally publishedYes
Event2012 8th International Conference on Semantics, Knowledge and Grids, SKG 2012 - Beijing, China
Duration: 22 Oct 201224 Oct 2012

Publication series

NameProceedings - 2012 8th International Conference on Semantics, Knowledge and Grids, SKG 2012

Conference

Conference2012 8th International Conference on Semantics, Knowledge and Grids, SKG 2012
Country/TerritoryChina
CityBeijing
Period22/10/1224/10/12

Keywords

  • facet extraction
  • faceted search
  • multidimensional semantic index
  • semantic mapping

ASJC Scopus subject areas

  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Automatic facet extraction based on multidimensional semantic index'. Together they form a unique fingerprint.

Cite this