Bayesian Variable Selection in Clustering High-Dimensional Data With Substructure

abstract

In this article we focus on clustering techniques recently proposed for high dimensional data that incorporate variable selection and extend them to the modeling of data with a known substructure, such as the structure imposed by an experimental design. Our method essentially approximates the within-group covariance by facilitating clustering without disrupting the groups defined by the experimenter. The method we adopt simultaneously determines which expression patterns are important, and which genes contribute to such patterns. We evaluate performance on simulated data and on microarray data from a colon carcinogenesis study. Selected genes are biologically consistent with current research and provide strong biological validation of the cluster configuration identified by the method. 2008 American Statistical Association and the International Biometric Society.

authors

Turner, Nancy

published proceedings

JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS

author list (cited authors)

Swartz, M. D., Mo, Q., Murphy, M. E., Lupton, J. R., Turner, N. D., Hong, M. Y., & Vannucci, M.

citation count

9

complete list of authors

Swartz, Michael D||Mo, Qianxing||Murphy, Mary E||Lupton, Joanne R||Turner, Nancy D||Hong, Mee Young||Vannucci, Marina

publication date

December 2008

publisher

Springer Nature Publisher

published in

Journal of Agricultural, Biological, and Environmental Statistics Journal

keywords

Bayesian Inference
Designed Experiments
Microarray Analysis

Digital Object Identifier (DOI)

10.1198/108571108X378317

start page

407

end page

423

volume

13

issue

4

URL

http%3A%2F%2Fdx.doi.org%2F10.1198%2F108571108x378317

Bayesian Variable Selection in Clustering High-Dimensional Data With Substructure Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL