Unlocking Large Language Model Power in Industry: Privacy-Preserving Collaborative Creation of Knowledge Graph

Liqiao Xia, Junming Fan, Ajith Parlikad, Xiao Huang, Pai Zheng

Research output: Journal article publicationJournal articleAcademic researchpeer-review

Abstract

Semantic expertise remains a reliable foundation for industrial decision-making, while Large Language Models (LLMs) can augment the often limited empirical knowledge by generating domain-specific insights, though the quality of this generative knowledge is uncertain. Integrating LLMs with the collective wisdom of multiple stakeholders could enhance the quality and scale of knowledge, yet this integration might inadvertently raise privacy concerns for stakeholders. In response to this challenge, Federated Learning (FL) is harnessed to improve the knowledge base quality by cryptically leveraging other stakeholders' knowledge, where knowledge base is represented in Knowledge Graph (KG) form. Initially, a multi-field hyperbolic (MFH) graph embedding method vectorizes entities, furnishing mathematical representations in lieu of solely semantic meanings. The FL framework subsequently encrypted identifies and fuses common entities, whereby the updated entities' embedding can refine other private entities' embedding locally, thus enhancing the overall KG quality. Finally, the KG complement method refines and clarifies triplets to improve the overall quality of the KG. An experiment assesses the proposed approach across different industrial KGs, confirming its effectiveness as a viable solution for collaborative KG creation, all while maintaining data security.

Original languageEnglish
Pages (from-to)1-15
Number of pages15
JournalIEEE Transactions on Big Data
DOIs
Publication statusPublished - 26 Dec 2024

Keywords

  • Federated Learning
  • Graph Embedding
  • Industrial 4.0
  • Knowledge Graph
  • Large Language Models

ASJC Scopus subject areas

  • Information Systems
  • Information Systems and Management

Fingerprint

Dive into the research topics of 'Unlocking Large Language Model Power in Industry: Privacy-Preserving Collaborative Creation of Knowledge Graph'. Together they form a unique fingerprint.

Cite this