TY - GEN
T1 - Clustering remote RDF data using SPARQL update queries
AU - Qi, Letao
AU - Lin, Harris T.
AU - Honavar, Vasant
PY - 2013
Y1 - 2013
N2 - The emergence of large and distributed RDF data in the Linked Open Data cloud calls for approaches to extract useful knowledge using machine learning techniques such as clustering. However, the massive size and remote nature of RDF data hinder traditional approaches that gather the datasets onto a centralized location for analysis. In this work, we show how to implement two representative clustering algorithms using update queries against the SPARQL endpoint of the RDF store. We compare the time complexity and the communication complexity of our algorithms with of those that require direct centralized access to the data and hence have to retrieve the entire RDF dataset from the remote location. We conduct experiments on a real social network dataset and report our preliminary findings.
AB - The emergence of large and distributed RDF data in the Linked Open Data cloud calls for approaches to extract useful knowledge using machine learning techniques such as clustering. However, the massive size and remote nature of RDF data hinder traditional approaches that gather the datasets onto a centralized location for analysis. In this work, we show how to implement two representative clustering algorithms using update queries against the SPARQL endpoint of the RDF store. We compare the time complexity and the communication complexity of our algorithms with of those that require direct centralized access to the data and hence have to retrieve the entire RDF dataset from the remote location. We conduct experiments on a real social network dataset and report our preliminary findings.
UR - http://www.scopus.com/inward/record.url?scp=84881411773&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84881411773&partnerID=8YFLogxK
U2 - 10.1109/ICDEW.2013.6547456
DO - 10.1109/ICDEW.2013.6547456
M3 - Conference contribution
AN - SCOPUS:84881411773
SN - 9781467353021
T3 - Proceedings - International Conference on Data Engineering
SP - 236
EP - 242
BT - 2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013
T2 - 2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013
Y2 - 8 April 2013 through 11 April 2013
ER -