TY - JOUR
T1 - Supporting K nearest neighbors query on high-dimensional data in P2P systems
AU - Li, Mei
AU - Lee, Wang Chien
AU - Sivasubramaniam, Anand
AU - Zhao, Jizhong
N1 - Funding Information:
Acknowledgements This research was supported in part by National Science Foundation of US (Grant Nos. IIS-0328881, IIS-0534343 and CNS-0626709).
Copyright:
Copyright 2008 Elsevier B.V., All rights reserved.
PY - 2008/9
Y1 - 2008/9
N2 - Peer-to-peer systems have been widely used for sharing and exchanging data and resources among numerous computer nodes. Various data objects identifiable with high dimensional feature vectors, such as text, images, genome sequences, are starting to leverage P2P technology. Most of the existing works have been focusing on queries on data objects with one or few attributes and thus are not applicable on high dimensional data objects. In this study, we investigate K nearest neighbors query (KNN) on high dimensional data objects in P2P systems. Efficient query algorithm and solutions that address various technical challenges raised by high dimensionality, such as search space resolution and incremental search space refinement, are proposed. An extensive simulation using both synthetic and real data sets demonstrates that our proposal efficiently supports KNN query on high dimensional data in P2P systems.
AB - Peer-to-peer systems have been widely used for sharing and exchanging data and resources among numerous computer nodes. Various data objects identifiable with high dimensional feature vectors, such as text, images, genome sequences, are starting to leverage P2P technology. Most of the existing works have been focusing on queries on data objects with one or few attributes and thus are not applicable on high dimensional data objects. In this study, we investigate K nearest neighbors query (KNN) on high dimensional data objects in P2P systems. Efficient query algorithm and solutions that address various technical challenges raised by high dimensionality, such as search space resolution and incremental search space refinement, are proposed. An extensive simulation using both synthetic and real data sets demonstrates that our proposal efficiently supports KNN query on high dimensional data in P2P systems.
UR - http://www.scopus.com/inward/record.url?scp=49549093429&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=49549093429&partnerID=8YFLogxK
U2 - 10.1007/s11704-008-0026-7
DO - 10.1007/s11704-008-0026-7
M3 - Article
AN - SCOPUS:49549093429
SN - 1673-7350
VL - 2
SP - 234
EP - 247
JO - Frontiers of Computer Science in China
JF - Frontiers of Computer Science in China
IS - 3
ER -