Abstract
Many sequential algorithms have been proposed for mining of association rules. However, very little work has been done in mining association rules in distributed databases. A direct application of sequential algorithms to distributed databases is not effective, because it requires a large amount of communication overhead. In this study, an efficient algorithm, DMA, is proposed. It generates a small number of candidate sets and requires only O(n) messages for support count exchange for each candidate set, where n is the number of sites in a distributed database. The algorithm has been implemented on an experimental test bed and its performance is studied. The results show that DMA has superior performance when comparing with the direct application of a popular sequential algorithm in distributed databases.
Original language | English |
---|---|
Pages (from-to) | 911-922 |
Number of pages | 12 |
Journal | IEEE Transactions on Knowledge and Data Engineering |
Volume | 8 |
Issue number | 6 |
DOIs | |
Publication status | Published - 1 Dec 1996 |
Keywords
- Association rule
- Data mining
- Distributed algorithm
- Distributed data mining
- Distributed database
- Knowledge discovery
- Partitioned database
ASJC Scopus subject areas
- Information Systems
- Computer Science Applications
- Computational Theory and Mathematics