Leveraging Ontologies to Improve Model Generalization Automatically with Online Data Sources

abstract

This paper describes an end-to-end learning framework that allows a novice to create a model from data easily by helping structure the model building process and capturing extended aspects of domain knowledge. By treating the whole modeling process interactively and exploiting high-level knowledge in the form of an ontology, the framework is able to aid the user in a number of ways, including in helping to avoid pitfalls such as data dredging. Prudence must be exercised to avoid these hazards: certain conclusions may be supported by extra knowledge if, for example, there are reasons to trust a particular narrower set of hypotheses. This paper adopts the solution of using higher-level knowledge in order to allow this sort of domain knowledge to be inferred automatically, thereby selecting only relevant input attributes and thence constraining the hypothesis space. We describe how the framework automatically exploits structured knowledge in an ontology to identify relevant concepts, and how a data extraction component can make use of online data sources to find measurements of those concepts so that their relevance can be evaluated. To validate our approach, models of four different problem domains were built using our implementation of the framework. Prediction error on unseen examples of these models show that our framework, making use of the ontology, helps to improve model generalization.

authors

Shell, Dylan

published proceedings

PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE

author list (cited authors)

Janpuangtong, S., & Shell, D. A.

citation count

0

complete list of authors

Janpuangtong, Sasin||Shell, Dylan A

publication date

June 2015

publisher

Association for the Advancement of Artificial Intelligence (AAAI) Publisher

published in

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence Journal

keywords

46 Information And Computing Sciences
4602 Artificial Intelligence

Digital Object Identifier (DOI)

10.1609/aaai.v29i2.19058

International Standard Book Number (ISBN) 13

9781577357032

start page

3981

end page

3986

volume

29

issue

2

URL

http://dx.doi.org/10.1609/aaai.v29i2.19058

Leveraging Ontologies to Improve Model Generalization Automatically with Online Data Sources Conference Paper

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

volume

issue

Other

URL