Benchmarking the effectiveness of sequential pattern mining methods

abstract

Recently, there is an increasing interest in new intelligent mining methods to find more meaningful and compact results. In intelligent data mining research, accessing the quality and usefulness of the results from different mining methods is essential. However, there is no general benchmarking criteria to evaluate whether these new methods are indeed more effective compared to the traditional methods. Here we propose a novel benchmarking criteria that can systematically evaluate the effectiveness of any sequential pattern mining method under a variety of situations. The benchmark evaluates how well a mining method finds known common patterns in synthetic data. Such an evaluation provides a comprehensive understanding of the resulting patterns generated from any mining method empirically. In this paper, the criteria are applied to conduct a detailed comparison study of the support-based sequential pattern model with an approximate pattern model based on sequence alignment. The study suggests that the alignment model will give a good summary of the sequential data in the form of a set of common patterns in the data. In contrast, the support model generates massive amounts of frequent patterns with much redundancy. This suggests that the results of the support model require more post processing before it can be of actual use in real applications. 2006 Elsevier B.V. All rights reserved.

authors

Kum, Hye Chung

published proceedings

DATA & KNOWLEDGE ENGINEERING

author list (cited authors)

Kum, H., Chang, J. H., & Wang, W.

citation count

15

complete list of authors

Kum, Hye-Chung||Chang, Joong Hyuk||Wang, Wei

publication date

January 2007

publisher

Elsevier Publisher

published in

Data and Knowledge Engineering Journal

keywords

Benchmarking Effectiveness
Evaluating Quality Of Results
Sequential Pattern Mining

Digital Object Identifier (DOI)

10.1016/j.datak.2006.01.004

start page

30

end page

50

volume

60

issue

1

URL

http://dx.doi.org/10.1016/j.datak.2006.01.004

Benchmarking the effectiveness of sequential pattern mining methods Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL