TY - GEN
T1 - Updating graph indices with a one-pass algorithm
AU - Yuan, Dayu
AU - Mitra, Prasenjit
AU - Yu, Huiwen
AU - Giles, C. Lee
N1 - Publisher Copyright:
Copyright © 2015 ACM.
PY - 2015/5/27
Y1 - 2015/5/27
N2 - Indices are commonly built into graph databases in order to support fast searches. Any given graph database and the distribution of queries will change over time. Therefore, the cost of processing queries using a static graph index increases because the index is built to optimize old snapshots of the database. There is growing research interest in determining how to update a graph index with the purpose of adapting to database and query changes. Updating features in a graph index is typically an NP-hard problem. In addition, because the features are chosen from a large number of frequent subgraphs, a multi-pass algorithm is not scalable to big datasets. In order to address this issue, we propose a time-efficient one-pass algorithm that is designed to update a graph index by scanning each frequent subgraph at most once. The algorithm replaces a feature with a new subgraph if the latter is "better" than the former one. We use the branch and bound technique to skip subgraphs that cannot outperform any of the features in the graph index. We further use a decomposed index and reduce the space complexity from O(|G||Q|) to O(|G|+|Q|), where G is database graphs and Q is a query workload. Through the empirical study, we show that the one-pass algorithm is 5-100 times faster than all previous algorithms for updating graph indices. In addition, the one-pass algorithm guarantees the return of a close to optimum solution. Our experiments show that when the one-pass algorithm is used to update an index, the query-processing speed is 1-2 times faster than that of other cutting-edge indices, i.e., the FGindex and the gIndex.
AB - Indices are commonly built into graph databases in order to support fast searches. Any given graph database and the distribution of queries will change over time. Therefore, the cost of processing queries using a static graph index increases because the index is built to optimize old snapshots of the database. There is growing research interest in determining how to update a graph index with the purpose of adapting to database and query changes. Updating features in a graph index is typically an NP-hard problem. In addition, because the features are chosen from a large number of frequent subgraphs, a multi-pass algorithm is not scalable to big datasets. In order to address this issue, we propose a time-efficient one-pass algorithm that is designed to update a graph index by scanning each frequent subgraph at most once. The algorithm replaces a feature with a new subgraph if the latter is "better" than the former one. We use the branch and bound technique to skip subgraphs that cannot outperform any of the features in the graph index. We further use a decomposed index and reduce the space complexity from O(|G||Q|) to O(|G|+|Q|), where G is database graphs and Q is a query workload. Through the empirical study, we show that the one-pass algorithm is 5-100 times faster than all previous algorithms for updating graph indices. In addition, the one-pass algorithm guarantees the return of a close to optimum solution. Our experiments show that when the one-pass algorithm is used to update an index, the query-processing speed is 1-2 times faster than that of other cutting-edge indices, i.e., the FGindex and the gIndex.
UR - http://www.scopus.com/inward/record.url?scp=84957601437&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84957601437&partnerID=8YFLogxK
U2 - 10.1145/2723372.2746482
DO - 10.1145/2723372.2746482
M3 - Conference contribution
AN - SCOPUS:84957601437
T3 - Proceedings of the ACM SIGMOD International Conference on Management of Data
SP - 1903
EP - 1916
BT - SIGMOD 2015 - Proceedings of the 2015 ACM SIGMOD International Conference on Management of Data
PB - Association for Computing Machinery
T2 - ACM SIGMOD International Conference on Management of Data, SIGMOD 2015
Y2 - 31 May 2015 through 4 June 2015
ER -