TY - JOUR
T1 - Improving search ranking of geospatial data based on deep learning using user behavior data
AU - Li, Yun
AU - Jiang, Yongyao
AU - Yang, Chaowei
AU - Yu, Manzhu
AU - Kamal, Lara
AU - Armstrong, Edward M.
AU - Huang, Thomas
AU - Moroni, David
AU - McGibbney, Lewis J.
N1 - Publisher Copyright:
© 2020
PY - 2020/9
Y1 - 2020/9
N2 - Finding geospatial data has been a big challenge regarding the data size and heterogeneity across various domains. Previous work has explored using machine learning to improve geospatial data search ranking, but it usually relies on training data labelled by subject matter experts, which makes it laborious and costly to apply to scenarios in which data relevancy to a query can change over time. When a user interacts with a search engine, plenteous information is recorded in the log file, which is essentially free, sustainable and up-to-the-minute. In this research, we propose a deep learning-based search ranking framework that can expeditiously update the ranking model through capturing real-time user clickstream data. The contributions of the proposed framework consist of 1) a log parser that can ingest and parse Web logs that record users’ behavior in a real-time manner; 2) a set of hypotheses of modelling the relative relevance of data; and 3) a deep learning based ranking model which can be updated dynamically with the increment of user behavior data. Quantitative comparison with a few other machine learning algorithms suggests substantial improvement.
AB - Finding geospatial data has been a big challenge regarding the data size and heterogeneity across various domains. Previous work has explored using machine learning to improve geospatial data search ranking, but it usually relies on training data labelled by subject matter experts, which makes it laborious and costly to apply to scenarios in which data relevancy to a query can change over time. When a user interacts with a search engine, plenteous information is recorded in the log file, which is essentially free, sustainable and up-to-the-minute. In this research, we propose a deep learning-based search ranking framework that can expeditiously update the ranking model through capturing real-time user clickstream data. The contributions of the proposed framework consist of 1) a log parser that can ingest and parse Web logs that record users’ behavior in a real-time manner; 2) a set of hypotheses of modelling the relative relevance of data; and 3) a deep learning based ranking model which can be updated dynamically with the increment of user behavior data. Quantitative comparison with a few other machine learning algorithms suggests substantial improvement.
UR - http://www.scopus.com/inward/record.url?scp=85086013373&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85086013373&partnerID=8YFLogxK
U2 - 10.1016/j.cageo.2020.104520
DO - 10.1016/j.cageo.2020.104520
M3 - Article
AN - SCOPUS:85086013373
SN - 0098-3004
VL - 142
JO - Computers and Geosciences
JF - Computers and Geosciences
M1 - 104520
ER -