An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums.

abstract

Online healthcare forums (OHFs) have become increasingly popular for patients to share their health-related experiences. The healthcare-related texts posted in OHFs could help doctors and patients better understand specific diseases and the situations of other patients. To extract the meaning of a post, a commonly used way is to classify the sentences into several predefined categories of different semantics. However, the unstructured form of online posts brings challenges to existing classification algorithms. In addition, though many sophisticated classification models such as deep neural networks may have good predictive power, it is hard to interpret the models and the prediction results, which is, however, critical in healthcare applications. To tackle the challenges above, we propose an effective and interpretable OHF post classification framework. Specifically, we classify sentences into three classes: medication, symptom, and background. Each sentence is projected into an interpretable feature space consisting of labeled sequential patterns, UMLS semantic types, and other heuristic features. A forest-based model is developed for categorizing OHF posts. An interpretation method is also developed, where the decision rules can be explicitly extracted to gain an insight of useful information in texts. Experimental results on real-world OHF data demonstrate the effectiveness of our proposed computational framework.

authors

Lawley, Mark

published proceedings

J Healthc Eng

author list (cited authors)

Gao, J., Liu, N., Lawley, M., & Hu, X.

citation count

20

complete list of authors

Gao, Jun||Liu, Ninghao||Lawley, Mark||Hu, Xia

publication date

January 2017

publisher

Hindawi Publisher

keywords

Consumer Health Information
Humans
Information Storage And Retrieval
Online Social Networking
Semantics

Digital Object Identifier (DOI)

10.1155/2017/2460174

URI

https://hdl.handle.net/1969.1/169562

start page

2460174

end page

12

volume

2017

URL

http://dx.doi.org/10.1155/2017/2460174

An Interpretable Classification Framework for Information Extraction from Online Healthcare Forums. Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

Research

keywords

Identity

Digital Object Identifier (DOI)

URI

Additional Document Info

start page

end page

volume

Other

URL