Abstract
Introducing data compression concept to large databases has been proposed for many years. In this project, we propose a new algorithm for the compression of large databases. Our goal is to optimize the I/O effort for finding association rules. The algorithm partitions the databases into two parts and all transactions will be compressed with the help of a reference transaction found in the small partition. We also compared the proposed compression algorithms with a normal compression algorithm - the binary compression. Empirical evaluation shows that the proposed algorithm performs well both in reducing the storage space and the I/O process required to find the large itemsets for association rules.
Original language | English |
---|---|
Title of host publication | Annual Conference of the North American Fuzzy Information Processing Society - NAFIPS |
Pages | 441-446 |
Number of pages | 6 |
Publication status | Published - 1 Dec 2001 |
Event | Joint 9th IFSA World Congress and 20th NAFIPS International Conference - Vancouver, BC, Canada Duration: 25 Jul 2001 → 28 Jul 2001 |
Conference
Conference | Joint 9th IFSA World Congress and 20th NAFIPS International Conference |
---|---|
Country/Territory | Canada |
City | Vancouver, BC |
Period | 25/07/01 → 28/07/01 |
ASJC Scopus subject areas
- General Computer Science
- General Mathematics