TY - JOUR
T1 - IR-tree
T2 - An efficient index for geographic document search
AU - Li, Zhisheng
AU - Lee, Ken C.K.
AU - Zheng, Baihua
AU - Lee, Wang Chien
AU - Lee, Dik
AU - Wang, Xufa
N1 - Funding Information:
Zhisheng Li’s work was conducted during his stay as a visiting student with the PDA research Group of the Pennsylvania State University. Ken C.K. Lee’s work was conducted during his PhD study at the PDA research Group of the Pennsylvania State University. Wang-Chien Lee and Ken C.K. Lee were supported in part by the National Science Foundation under Grant no. IIS-0534343 and CNS-0626709. Dik Lun Lee was supported in part by the Research Grants Council, Hong Kong SAR, China, under GRF 615806 and 615707.
PY - 2011
Y1 - 2011
N2 - Given a geographic query that is composed of query keywords and a location, a geographic search engine retrieves documents that are the most textually and spatially relevant to the query keywords and the location, respectively, and ranks the retrieved documents according to their joint textual and spatial relevances to the query. The lack of an efficient index that can simultaneously handle both the textual and spatial aspects of the documents makes existing geographic search engines inefficient in answering geographic queries. In this paper, we propose an efficient index, called IR-tree, that together with a top-k document search algorithm facilitates four major tasks in document searches, namely, 1) spatial filtering, 2) textual filtering, 3) relevance computation, and 4) document ranking in a fully integrated manner. In addition, IR-tree allows searches to adopt different weights on textual and spatial relevance of documents at the runtime and thus caters for a wide variety of applications. A set of comprehensive experiments over a wide range of scenarios has been conducted and the experiment results demonstrate that IR-tree outperforms the state-of-the-art approaches for geographic document searches.
AB - Given a geographic query that is composed of query keywords and a location, a geographic search engine retrieves documents that are the most textually and spatially relevant to the query keywords and the location, respectively, and ranks the retrieved documents according to their joint textual and spatial relevances to the query. The lack of an efficient index that can simultaneously handle both the textual and spatial aspects of the documents makes existing geographic search engines inefficient in answering geographic queries. In this paper, we propose an efficient index, called IR-tree, that together with a top-k document search algorithm facilitates four major tasks in document searches, namely, 1) spatial filtering, 2) textual filtering, 3) relevance computation, and 4) document ranking in a fully integrated manner. In addition, IR-tree allows searches to adopt different weights on textual and spatial relevance of documents at the runtime and thus caters for a wide variety of applications. A set of comprehensive experiments over a wide range of scenarios has been conducted and the experiment results demonstrate that IR-tree outperforms the state-of-the-art approaches for geographic document searches.
UR - http://www.scopus.com/inward/record.url?scp=79951934075&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79951934075&partnerID=8YFLogxK
U2 - 10.1109/TKDE.2010.149
DO - 10.1109/TKDE.2010.149
M3 - Article
AN - SCOPUS:79951934075
SN - 1041-4347
VL - 23
SP - 585
EP - 599
JO - IEEE Transactions on Knowledge and Data Engineering
JF - IEEE Transactions on Knowledge and Data Engineering
IS - 4
M1 - 5560653
ER -