TY - GEN
T1 - A general strategy for knowledge acquisition from semantically heterogeneous data sources
AU - Caragea, Doina
AU - Bao, Jie
AU - Honavar, Vasant
N1 - Copyright:
Copyright 2011 Elsevier B.V., All rights reserved.
PY - 2006
Y1 - 2006
N2 - With the advent of the Semantic Web, there is increased availability of meta data (ontologies) that make explicit the semantic commitments associated with the data sources. Together with tools for specifying mappings between ontologies, this has opened up for the first time, the possibility of acquiring knowledge from such ontology extended, semantically disparate data sources. Hence, there is an urgent need for machine learning algorithms for building predictive models (e.g., classifiers) in a setting where there is no unique global interpretation of data from semantically disparate sources and it is neither feasible nor desirable to collect data from such sources in a centralized data warehouse. We formulate the problem of learning classifiers from a set of related, semantically heterogeneous data sources, under the assumption that ontologies and mappings from a user ontology to the data source ontologies are given. We design a general strategy for learning classifiers from such data sources by reducing the problem of learning to the problem of answering queries from semantically heterogeneous data and we show how to answer such queries.
AB - With the advent of the Semantic Web, there is increased availability of meta data (ontologies) that make explicit the semantic commitments associated with the data sources. Together with tools for specifying mappings between ontologies, this has opened up for the first time, the possibility of acquiring knowledge from such ontology extended, semantically disparate data sources. Hence, there is an urgent need for machine learning algorithms for building predictive models (e.g., classifiers) in a setting where there is no unique global interpretation of data from semantically disparate sources and it is neither feasible nor desirable to collect data from such sources in a centralized data warehouse. We formulate the problem of learning classifiers from a set of related, semantically heterogeneous data sources, under the assumption that ontologies and mappings from a user ontology to the data source ontologies are given. We design a general strategy for learning classifiers from such data sources by reducing the problem of learning to the problem of answering queries from semantically heterogeneous data and we show how to answer such queries.
UR - http://www.scopus.com/inward/record.url?scp=33947230178&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33947230178&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:33947230178
SN - 1577353048
SN - 9781577353041
T3 - AAAI Fall Symposium - Technical Report
SP - 1
EP - 8
BT - Semantic Web for Collaborative Knowledge Acquisition - Papers from the AAAI Fall Symposium, Technical Report
T2 - 2006 AAAI Fall Symposium
Y2 - 13 October 2006 through 15 October 2006
ER -