Quality-Based Distance Measures and Applications to Clustering

abstract

When analyzing biological data sets, a common approach is to partition the data into clusters. Examples of this include finding a subset of genes with co-regulated expression among experiments, grouping similar disease phenotypes, or implicating regions of genetic variation in disease. The ability to separate the data into subsets depends upon the structure of the distribution of points and the choice of clustering algorithm. Furthermore, the biological relevance of the clustering results is biased by the variation among the data points themselves. We introduce a mathematical quality-based distance metric which will allow all data, regardless of its error, to be included in analysis without the need to introduce a cutoff. This removes the need to exclude points or to change the dimensionality. The advantage of this approach is shown by clustering simulated data with added noise. 2006 IEEE.

name of conference

2006 IEEE/NLM Life Science Systems and Applications Workshop

authors

Dougherty, Edward

published proceedings

2006 IEEE/NLM Life Science Systems and Applications Workshop

author list (cited authors)

Taverna, D. M., Brun, M., Dougherty, E. R., & Chen, Y.

citation count

0

complete list of authors

Taverna, Darin M||Brun, Marcel||Dougherty, Edward R||Chen, Yidong

publication date

July 2006

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

keywords

Genetics

Digital Object Identifier (DOI)

10.1109/lssa.2006.250390

International Standard Book Number (ISBN) 10

1424402786

International Standard Book Number (ISBN) 13

9781424402786

start page

1

end page

2

URL

http://dx.doi.org/10.1109/lssa.2006.250390

Quality-Based Distance Measures and Applications to Clustering Conference Paper

Overview

abstract

name of conference

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 10

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

Other

URL