Towards knowledge discovery in big data

Richard K. Lomotey, Ralph Deters

Research output: Chapter in Book/Report/Conference proceedingConference contribution

41 Scopus citations

Abstract

Analytics-as - A-Service (AaaS) has become indispensable because it affords stakeholders to discover knowledge in Big Data. Previously, data stored in data warehouses follow some schema and standardization which leads to efficient data mining. However, the Big Data epoch has witnessed the rise of structured, semi-structured, and unstructured data, a trend that motivated enterprises to employ the NoSQL data storages to accommodate the high-dimensional data. Unfortunately, the existing data mining techniques which are designed for schema-oriented storages are non-applicable to the unstructured data style. Thus, the AaaS though still in its infancy, is gaining widespread attention for its ability to provide novel ways and opportunities to mine the heterogeneous data. In this paper, we discuss our AaaS tool that performs terms and topics extraction and organization from unstructured data sources such as NoSQL databases, textual contents (e.g., websites), and structured sources (e.g. SQL). The tool is built on methodologies such as tagging, filtering, association maps, and adaptable dictionary. The evaluation of the tool shows high accuracy in the mining process.

Original languageEnglish (US)
Title of host publicationProceedings - IEEE 8th International Symposium on Service Oriented System Engineering, SOSE 2014
PublisherIEEE Computer Society
Pages181-191
Number of pages11
ISBN (Print)9781479925049
DOIs
StatePublished - 2014
Event8th IEEE International Symposium on Service Oriented System Engineering, SOSE 2014 - Oxford, United Kingdom
Duration: Apr 7 2014Apr 11 2014

Publication series

NameProceedings - IEEE 8th International Symposium on Service Oriented System Engineering, SOSE 2014

Other

Other8th IEEE International Symposium on Service Oriented System Engineering, SOSE 2014
Country/TerritoryUnited Kingdom
CityOxford
Period4/7/144/11/14

All Science Journal Classification (ASJC) codes

  • Software
  • Control and Systems Engineering

Fingerprint

Dive into the research topics of 'Towards knowledge discovery in big data'. Together they form a unique fingerprint.

Cite this