Rank discriminants for predicting phenotypes from RNA expression

abstract

Institute of Mathematical Statistics, 2014. Statistical methods for analyzing large-scale biomolecular data are commonplace in computational biology. A notable example is phenotype prediction from gene expression data, for instance, detecting human cancers, differentiating subtypes and predicting clinical outcomes. Still, clinical applications remain scarce. One reason is that the complexity of the decision rules that emerge from standard statistical learning impedes biological understanding, in particular, any mechanistic interpretation. Here we explore decision rules for binary classification utilizing only the ordering of expression among several genes; the basic building blocks are then two-gene expression comparisons. The simplest example, just one comparison, is the TSP classifier, which has appeared in a variety of cancer-related discovery studies. Decision rules based on multiple comparisons can better accommodate class heterogeneity, and thereby increase accuracy, and might provide a link with biological mechanism. We consider a general framework (rank-in-context) for designing discriminant functions, including a data-driven selection of the number and identity of the genes in the support (context). We then specialize to two examples: voting among several pairs and comparing the median expression in two groups of genes. Comprehensive experiments assess accuracy relative to other, more complex, methods, and reinforce earlier observations that simple classifiers are competitive.

authors

Braga Neto, Ulisses

published proceedings

The Annals of Applied Statistics

altmetric score

5

author list (cited authors)

Afsari, B., Braga-Neto, U. M., & Geman, D.

citation count

21

complete list of authors

Afsari, Bahman||Braga-Neto, Ulisses M||Geman, Donald

publication date

September 2014

publisher

Institute of Mathematical Statistics Publisher

published in

Annals of Applied Statistics Journal

keywords

Cancer
Generic Health Relevance
Genetics
Human Genome

Digital Object Identifier (DOI)

10.1214/14-aoas738

URI

https://hdl.handle.net/1969.1/184776

start page

1469

end page

1491

volume

8

issue

3

URL

http://dx.doi.org/10.1214/14-AOAS738

Rank discriminants for predicting phenotypes from RNA expression

Overview

abstract

authors

published proceedings

altmetric score

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

URI

Additional Document Info

start page

end page

volume

issue

Other

URL