Quantitative Association Analysis Using Tree Hierarchies Conference Paper uri icon

abstract

  • Association analysis arises in many important applications such as bioinformatics and business intelligence. Given a large collection of measurements over a set of samples, association analysis aims to find dependencies of target variables to subsets of measurements. Most previous algorithms adopt a two-stage approach; they first group samples based on the similarity in the subset of measurements, and then they examine the association between these groups and the specified target variables without considering the inter-group similarities or alternative groupings. This can lead to cases where the strength of association depends significantly on arbitrary clustering choices. In this paper, we propose a tree-based method for quantitative association analysis. Tree hierarchies derived from sample similarities represent many possible sample groupings. They also provide a natural way to incorporate domain knowledge such as ontologies and to identify and remove outliers. Given a tree hierarchy, our association analysis evaluates all possible groupings and selects the one with strongest association to the target variable. We introduce an efficient algorithm, TreeQA, to systematically explore the search-space of all possible groupings in a set of input trees, with integrated permutation tests. Experimental results showthat TreeQA is able to handle large-scale association analysis very efficiently and is more effective and robust in association analysis than previous methods. 2008 IEEE.

name of conference

  • 2008 Eighth IEEE International Conference on Data Mining

published proceedings

  • ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS

author list (cited authors)

  • Pan, F., Yang, L., McMillan, L., de Villena, F., Threadgill, D., & Wang, W.

citation count

  • 2

complete list of authors

  • Pan, Feng||Yang, Lynda||McMillan, Leonard||de Villena, Fernando Pardo Manuel||Threadgill, David||Wang, Wei

publication date

  • December 2008