Towards Query Pricing on Incomplete Data

Xiaoye Miao, Yunjun Gao, Lu Chen, Huanhuan Peng, Jianwei Yin, Qing Li

Research output: Journal article publicationJournal articleAcademic researchpeer-review

14 Citations (Scopus)

Abstract

Data have significant economic or social value in many application fields including science, business, governance, etc. This naturally leads to the emergence of many data markets such as GBDEx and YoueData. As a result, the data trade through data markets has started to receive attentions from both industry and academia. During the data buying and selling, how to price the data is an indispensable problem. However, pricing incomplete data is more challenging, even though incomplete data exist pervasively in a vast lot of real-life scenarios. In this paper, we attempt to explore the pricing problem for queries over incomplete data. We propose a sophisticated pricing mechanism, termed as iDBPricer, which takes a series of essential factors into consideration, including the data contribution/usage, data completeness, and query quality. We present two novel price functions, namely, the usage, and completeness-aware price function (UCA price for short) and the quality, usage, and completeness-aware price function (QUCA price for short). Moreover, we develop efficient algorithms for deriving the query prices. Extensive experiments using both real and benchmark datasets demonstrate iDBPricer is of excellent performance in terms of effectiveness and scalability, compared with the state-of-the-art price functions.

Original languageEnglish
Pages (from-to)4024-4036
Number of pages13
JournalIEEE Transactions on Knowledge and Data Engineering
Volume34
Issue number8
DOIs
Publication statusPublished - 1 Aug 2022

Keywords

  • data pricing
  • Data trade
  • incomplete data
  • query quality

ASJC Scopus subject areas

  • Information Systems
  • Computer Science Applications
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Towards Query Pricing on Incomplete Data'. Together they form a unique fingerprint.

Cite this