TY - GEN
T1 - Fast similarity search on keyword-induced point groups
AU - Li, Zhe
AU - Li, Yu
AU - Yiu, Man Lung
PY - 2018/11/6
Y1 - 2018/11/6
N2 - Location-based social media (e.g., Twitter, Foursquare) have been generating massive amount of geo-textual data. In this paper, we represent the spatial distribution of a keyword by the group of locations tagged with such keyword. Given a query keyword, our problem is to find k keywords with the most similar distribution of locations. Such query finds applications in targeted marketing and recommendation. The performance of existing solutions degrade when different point groups have significant overlapping, which happens rather frequently in real data. We propose efficient techniques to process similarity search on point groups. Experimental results on Twitter data demonstrate that our solution is faster than the state-of-the-art by up to 6 times.
AB - Location-based social media (e.g., Twitter, Foursquare) have been generating massive amount of geo-textual data. In this paper, we represent the spatial distribution of a keyword by the group of locations tagged with such keyword. Given a query keyword, our problem is to find k keywords with the most similar distribution of locations. Such query finds applications in targeted marketing and recommendation. The performance of existing solutions degrade when different point groups have significant overlapping, which happens rather frequently in real data. We propose efficient techniques to process similarity search on point groups. Experimental results on Twitter data demonstrate that our solution is faster than the state-of-the-art by up to 6 times.
KW - Hausdorff distance
KW - Similarity Searching
KW - Spatio-Textual Searching
UR - http://www.scopus.com/inward/record.url?scp=85058646036&partnerID=8YFLogxK
U2 - 10.1145/3274895.3274920
DO - 10.1145/3274895.3274920
M3 - Conference article published in proceeding or book
AN - SCOPUS:85058646036
T3 - GIS: Proceedings of the ACM International Symposium on Advances in Geographic Information Systems
SP - 109
EP - 118
BT - 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018
A2 - Xiong, Li
A2 - Tamassia, Roberto
A2 - Banaei, Kashani Farnoush
A2 - Guting, Ralf Hartmut
A2 - Hoel, Erik
PB - Association for Computing Machinery
T2 - 26th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems, ACM SIGSPATIAL GIS 2018
Y2 - 6 November 2018 through 9 November 2018
ER -