TY - GEN
T1 - Speeding up similarity queries over large Chinese calligraphic character databases using data grid
AU - Zhuang, Yi
AU - Zhuang, Yueting
AU - Li, Qing
AU - Wu, Fei
PY - 2007/12/1
Y1 - 2007/12/1
N2 - This paper proposes a novel data-grid-based k nearest neighbor query over large Chinese calligraphic character databases, which can significantly speed up the retrieval efficiency. Three steps are made. Firstly, when a user submits a query request to a query node, a process of character set reduction is performed using iDistance index in different data nodes, followed by sending the candidate characters to the executing nodes through a package-based transfer technique. Secondly, a refinement process of the candidate characters is conducted in the executing nodes in parallel to get the answer set. Finally, the answer set is transferred to the query node. The proposed method incorporates a uniform-start-distance-based character data allocation policy and character reduction algorithm. The analysis and experimental results show that the performance of the algorithm is effective in minimizing the response time by decreasing network transfer cost and increasing the parallelism of I/O and CPU.
AB - This paper proposes a novel data-grid-based k nearest neighbor query over large Chinese calligraphic character databases, which can significantly speed up the retrieval efficiency. Three steps are made. Firstly, when a user submits a query request to a query node, a process of character set reduction is performed using iDistance index in different data nodes, followed by sending the candidate characters to the executing nodes through a package-based transfer technique. Secondly, a refinement process of the candidate characters is conducted in the executing nodes in parallel to get the answer set. Finally, the answer set is transferred to the query node. The proposed method incorporates a uniform-start-distance-based character data allocation policy and character reduction algorithm. The analysis and experimental results show that the performance of the algorithm is effective in minimizing the response time by decreasing network transfer cost and increasing the parallelism of I/O and CPU.
UR - http://www.scopus.com/inward/record.url?scp=46449133510&partnerID=8YFLogxK
U2 - 10.1109/GCC.2007.118
DO - 10.1109/GCC.2007.118
M3 - Conference article published in proceeding or book
AN - SCOPUS:46449133510
SN - 0769528716
SN - 9780769528717
T3 - Proceedings of the 6th International Conference on Grid and Cooperative Computing, GCC 2007
SP - 499
EP - 506
BT - Proceedings of the 6th International Conference on Grid and Cooperative Computing, GCC 2007
T2 - 6th International Conference on Grid and Cooperative Computing, GCC 2007
Y2 - 16 August 2007 through 18 August 2007
ER -