Skip to main navigation Skip to search Skip to main content

Active Ensemble Learning for Knowledge Graph Error Detection

  • Junnan Dong
  • , Qinggang Zhang
  • , Xiao Huang
  • , Qiaoyu Tan
  • , Daochen Zha
  • , Zhao Zihao

Research output: Chapter in book / Conference proceedingConference article published in proceeding or bookAcademic researchpeer-review

Abstract

Knowledge graphs (KGs) could effectively integrate a large number of real-world assertions, and improve the performance of various applications, such as recommendation and search. KG error detection has been intensively studied since real-world KGs inevitably contain erroneous triples. While existing studies focus on developing a novel algorithm dedicated to one or a few data characteristics, we explore advancing KG error detection by assembling a set of state-of-the-art (SOTA) KG error detectors. However, it is nontrivial to develop a practical ensemble learning framework for KG error detection. Existing ensemble learning models heavily rely on labels, while it is expensive to acquire labeled errors in KGs. Also, KG error detection itself is challenging since triples contain rich semantic information and might be false because of various reasons. To this end, we propose to leverage active learning to minimize human efforts. Our proposed framework - KAEL, could effectively assemble a set of off-the-shelf error detection algorithms, by actively using a limited number of manual annotations. It adaptively updates the ensemble learning policy in each iteration based on active queries, i.e., the answers from experts. After all annotation budget is used, KAEL utilizes the trained policy to identify remaining suspicious triples. Experiments on real-world KGs demonstrate that we can achieve significant improvement when applying KAEL to assemble SOTA error detectors. KAEL also outperforms SOTA ensemble learning baselines significantly.

Original languageEnglish
Title of host publicationWSDM 2023 - Proceedings of the 16th ACM International Conference on Web Search and Data Mining
PublisherAssociation for Computing Machinery, Inc
Pages877-885
Number of pages9
ISBN (Electronic)9781450394079
DOIs
Publication statusPublished - 27 Feb 2023
Event16th ACM International Conference on Web Search and Data Mining, WSDM 2023 - Singapore, Singapore
Duration: 27 Feb 20233 Mar 2023

Publication series

NameWSDM 2023 - Proceedings of the 16th ACM International Conference on Web Search and Data Mining

Conference

Conference16th ACM International Conference on Web Search and Data Mining, WSDM 2023
Country/TerritorySingapore
CitySingapore
Period27/02/233/03/23

Keywords

  • ensemble learning
  • knowledge graph refinement
  • knowledge graphs

ASJC Scopus subject areas

  • Computer Networks and Communications
  • Computer Science Applications
  • Software

Fingerprint

Dive into the research topics of 'Active Ensemble Learning for Knowledge Graph Error Detection'. Together they form a unique fingerprint.

Cite this