Gene selection: a Bayesian variable selection approach. - Texas A&M University (TAMU) Scholar

abstract

UNLABELLED: Selection of significant genes via expression patterns is an important problem in microarray experiments. Owing to small sample size and the large number of variables (genes), the selection process can be unstable. This paper proposes a hierarchical Bayesian model for gene (variable) selection. We employ latent variables to specialize the model to a regression setting and uses a Bayesian mixture prior to perform the variable selection. We control the size of the model by assigning a prior distribution over the dimension (number of significant genes) of the model. The posterior distributions of the parameters are not in explicit form and we need to use a combination of truncated sampling and Markov Chain Monte Carlo (MCMC) based computation techniques to simulate the parameters from the posteriors. The Bayesian model is flexible enough to identify significant genes as well as to perform future predictions. The method is applied to cancer classification via cDNA microarrays where the genes BRCA1 and BRCA2 are associated with a hereditary disposition to breast cancer, and the method is used to identify a set of significant genes. The method is also applied successfully to the leukemia data. SUPPLEMENTARY INFORMATION: http://stat.tamu.edu/people/faculty/bmallick.html.

authors

published proceedings

Bioinformatics

author list (cited authors)

Lee, K. E., Sha, N., Dougherty, E. R., Vannucci, M., & Mallick, B. K.

citation count

279

complete list of authors

Lee, Kyeong Eun||Sha, Naijun||Dougherty, Edward R||Vannucci, Marina||Mallick, Bani K

publication date

January 2003

publisher

Oxford University Press (OUP) Publisher

published in

Bioinformatics Journal

keywords

Algorithms
Bayes Theorem
Breast Neoplasms
Gene Expression Profiling
Gene Expression Regulation, Neoplastic
Genes
Genes, Brca1
Genes, Brca2
Genetic Markers
Genetic Predisposition To Disease
Humans
Leukemia, Myeloid
Models, Genetic
Models, Statistical
Oligonucleotide Array Sequence Analysis
Precursor Cell Lymphoblastic Leukemia-lymphoma
Sample Size

Digital Object Identifier (DOI)

10.1093/bioinformatics/19.1.90

start page

90

end page

97

volume

19

issue

1

URL

http%3A%2F%2Fdx.doi.org%2F10.1093%2Fbioinformatics%2F19.1.90

Gene selection: a Bayesian variable selection approach. Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL