A Statistical Test for Intrinsically Multivariate Predictive Genes
Conference Paper
Overview
Research
Identity
Additional Document Info
Other
View All
Overview
abstract
Canalizing genes possess broad regulatory power over biological processes. In a previous publication, it was hypothesized that canalizing genes may be identified as those that possess a large number of intrinsically multivariate predictive (IMP) gene sets. An IMP set is a predictor set that predicts well the target, which cannot however be predicted by any of the proper subsets in the IMP set. The IMP property is defined mathematically in terms of a score based on the binary Coefficient of Determination (CoD), but there was until now no rigorous statistical procedure to obtain a test for a nonzero IMP score. In this paper, we address this by assuming a stochastic regulation model and providing an Intersection-Union Test (IUT) based on likelihood-ratio tests for the individual model parameters. We derive exact analytical formulas for the rejection region and p-value for this test. We address the issue of multiplicity of tests due to a large number of candidate predictor sets and predictive logics by means of FWER- and FDR-controlling approaches. The methodology is demonstrated by application to a real melanoma gene-expression data set, which shows that DUSP1, a canalizing gene over important biological processes in melanoma, displayed a large number of significant IMP sets, after correction by either of the multiple testing procedures. 2012 IEEE.
name of conference
Proceedings 2012 IEEE International Workshop on Genomic Signal Processing and Statistics (GENSIPS)