Abstract
Data have significant economic or social value in many application fields including science, business, governance, etc. This naturally leads to the emergence of many data markets such as GBDEx and YoueData. As a result, the data trade through data markets has started to receive attentions from both industry and academia. During the data buying and selling, how to price the data is an indispensable problem. However, pricing incomplete data is more challenging, even though incomplete data exist pervasively in a vast lot of real-life scenarios. In this paper, we attempt to explore the pricing problem for queries over incomplete data. We propose a sophisticated pricing mechanism, termed as iDBPricer, which takes a series of essential factors into consideration, including the data contribution/usage, data completeness, and query quality. We present two novel price functions, namely, the usage, and completeness-aware price function (UCA price for short) and the quality, usage, and completeness-aware price function (QUCA price for short). Moreover, we develop efficient algorithms for deriving the query prices. Extensive experiments using both real and benchmark datasets demonstrate iDBPricer is of excellent performance in terms of effectiveness and scalability, compared with the state-of-the-art price functions.
Original language | English |
---|---|
Pages (from-to) | 4024-4036 |
Number of pages | 13 |
Journal | IEEE Transactions on Knowledge and Data Engineering |
Volume | 34 |
Issue number | 8 |
DOIs | |
Publication status | Published - 1 Aug 2022 |
Keywords
- data pricing
- Data trade
- incomplete data
- query quality
ASJC Scopus subject areas
- Information Systems
- Computer Science Applications
- Computational Theory and Mathematics