A two-sample test for the equality of univariate marginal distributions for high-dimensional data

abstract

2019 Elsevier Inc. A recurring theme in modern statistics is dealing with high-dimensional data whose main feature is a large number, p, of variables but a small sample size. In this context our aim is to address the problem of testing the null hypothesis that the marginal distributions of p variables are the same for two groups. We propose a test statistic motivated by the simple idea of comparing, for each of the p variables, the empirical characteristic functions computed from the two samples. The asymptotic normality of the test statistic is derived under mixing conditions. In our asymptotic analysis the number of variables tends to infinity, while the size of individual samples remains fixed. In order to obtain a practical test several estimators of the variance are proposed, leading to three somewhat different versions of the test. An alternative global test based on the P-values derived from permutation tests is also proposed. A simulation study to investigate the finite sample properties of the proposed tests is carried out, and a practical illustration involving microarray data is provided.

authors

Hart, Jeffrey

published proceedings

JOURNAL OF MULTIVARIATE ANALYSIS

author list (cited authors)

Cousido-Rocha, M., de Una-Alvarez, J., & Hart, J. D.

citation count

3

complete list of authors

Cousido-Rocha, Marta||de Una-Alvarez, Jacobo||Hart, Jeffrey D

publication date

January 2019

publisher

Elsevier Publisher

published in

Journal of Multivariate Analysis Journal

keywords

Characteristic Functions
Goodness-of-fit Tests
Mixing Conditions
Permutation Tests

Digital Object Identifier (DOI)

10.1016/j.jmva.2019.104537

start page

104537

end page

104537

volume

174

URL

http://dx.doi.org/10.1016/j.jmva.2019.104537

A two-sample test for the equality of univariate marginal distributions for high-dimensional data Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

Other

URL