A general strategy for knowledge acquisition from semantically heterogeneous data sources

Doina Caragea, Jie Bao, Vasant Honavar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

With the advent of the Semantic Web, there is increased availability of meta data (ontologies) that make explicit the semantic commitments associated with the data sources. Together with tools for specifying mappings between ontologies, this has opened up for the first time, the possibility of acquiring knowledge from such ontology extended, semantically disparate data sources. Hence, there is an urgent need for machine learning algorithms for building predictive models (e.g., classifiers) in a setting where there is no unique global interpretation of data from semantically disparate sources and it is neither feasible nor desirable to collect data from such sources in a centralized data warehouse. We formulate the problem of learning classifiers from a set of related, semantically heterogeneous data sources, under the assumption that ontologies and mappings from a user ontology to the data source ontologies are given. We design a general strategy for learning classifiers from such data sources by reducing the problem of learning to the problem of answering queries from semantically heterogeneous data and we show how to answer such queries.

Original languageEnglish (US)
Title of host publicationSemantic Web for Collaborative Knowledge Acquisition - Papers from the AAAI Fall Symposium, Technical Report
Pages1-8
Number of pages8
StatePublished - 2006
Event2006 AAAI Fall Symposium - Arlington, VA, United States
Duration: Oct 13 2006Oct 15 2006

Publication series

NameAAAI Fall Symposium - Technical Report
VolumeFS-06-06

Other

Other2006 AAAI Fall Symposium
Country/TerritoryUnited States
CityArlington, VA
Period10/13/0610/15/06

All Science Journal Classification (ASJC) codes

  • General Engineering

Fingerprint

Dive into the research topics of 'A general strategy for knowledge acquisition from semantically heterogeneous data sources'. Together they form a unique fingerprint.

Cite this