Optimal variable selection in multi-group sparse discriminant analysis Academic Article uri icon

abstract

  • 2015, Institute of Mathematical Statistics. All rights reserved. This article considers the problem of multi-group classification in the setting where the number of variables p is larger than the number of observations n. Several methods have been proposed in the literature that address this problem, however their variable selection performance is either unknown or suboptimal to the results known in the two-group case. In this work we provide sharp conditions for the consistent recovery of relevant variables in the multi-group case using the discriminant analysis proposal of Gaynanova et al. [7]. We achieve the rates of convergence that attain the optimal scaling of the sample size n, number of variables p and the sparsity level s. These rates are significantly faster than the best known results in the multi-group case. Moreover, they coincide with the minimax optimal rates for the two-group case. We validate our theoretical results with numerical analysis.

published proceedings

  • ELECTRONIC JOURNAL OF STATISTICS

altmetric score

  • 1

author list (cited authors)

  • Gaynanova, I., & Kolar, M.

citation count

  • 9

complete list of authors

  • Gaynanova, Irina||Kolar, Mladen

publication date

  • January 2015