Feature Selection Based on Structured Sparsity: A Comprehensive Study.

abstract

Feature selection (FS) is an important component of many pattern recognition tasks. In these tasks, one is often confronted with very high-dimensional data. FS algorithms are designed to identify the relevant feature subset from the original features, which can facilitate subsequent analysis, such as clustering and classification. Structured sparsity-inducing feature selection (SSFS) methods have been widely studied in the last few years, and a number of algorithms have been proposed. However, there is no comprehensive study concerning the connections between different SSFS methods, and how they have evolved. In this paper, we attempt to provide a survey on various SSFS methods, including their motivations and mathematical representations. We then explore the relationship among different formulations and propose a taxonomy to elucidate their evolution. We group the existing SSFS methods into two categories, i.e., vector-based feature selection (feature selection based on lasso) and matrix-based feature selection (feature selection based on lr,p-norm). Furthermore, FS has been combined with other machine learning algorithms for specific applications, such as multitask learning, multilabel learning, multiview learning, classification, and clustering. This paper not only compares the differences and commonalities of these methods based on regression and regularization strategies, but also provides useful guidelines to practitioners working in related fields to guide them how to do feature selection.

authors

Ji, Shuiwang

published proceedings

IEEE Trans Neural Netw Learn Syst

altmetric score

0.25

author list (cited authors)

Jie Gui, .., Zhenan Sun, .., Shuiwang Ji, .., Dacheng Tao, .., & Tieniu Tan.

citation count

52

publication date

July 2017

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

published in

n2162-237XISSN Journal

keywords

Dimensionality Reduction
Feature Selection
Sparse
Structured Sparsity

Digital Object Identifier (DOI)

10.1109/TNNLS.2016.2551724

start page

1490

end page

1507

volume

28

issue

7

URL

http://dx.doi.org/10.1109/tnnls.2016.2551724

Feature Selection Based on Structured Sparsity: A Comprehensive Study. Academic Article

Overview

abstract

authors

published proceedings

altmetric score

author list (cited authors)

citation count

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL