Neural Architecture Search for Portrait Parsing. - Texas A&M University (TAMU) Scholar

abstract

This work proposes a neural architecture search (NAS) method for portrait parsing, which is a novel up-level task based on portrait segmentation and face labeling. Recently, NAS has become an effective method in terms of automatic machine learning. However, remarkable achievements have been made only in image classification and natural language processing (NLP) areas. Meanwhile, state-of-the-art portrait segmentation and face labeling approaches are all manually designed, but few models reach a tradeoff between efficiency and performance. Thus, we are extremely interested in improving existing NAS methods for dense-per-pixel prediction tasks on portrait datasets. To achieve that, we resort to a cell-based encoder-decoder architecture with an elaborate design of connectivity structure and searching space. As a result, we achieve state-of-the-art performance on three portrait tasks, including 96.8% MIOU on EG1800 (portrait segmentation), 91.2% overall F1 -score on HELEN (face labeling), and 95.1% overall F1 -score on CelebAMask-HQ (portrait parsing) with only 2.29M model parameters. That is, our approach compares favorably with all previous works on portrait datasets. More crucially, we empirically prove that even a fundamental encoder-decoder architecture may reach an outstanding result on the aforementioned tasks with the help of the innovative approach of NAS. To the best of our knowledge, our work is also the first to report the success of applying NAS on these portrait tasks.

authors

Huang, Tingwen

published proceedings

IEEE Trans Neural Netw Learn Syst

altmetric score

0.25

author list (cited authors)

Lyu, B. o., Yang, Y., Wen, S., Huang, T., & Li, K. e.

citation count

13

complete list of authors

Lyu, Bo||Yang, Yin||Wen, Shiping||Huang, Tingwen||Li, Ke

publication date

March 2023

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

published in

IEEE Transactions on Neural Networks and Learning Systems Journal

keywords

Computer Architecture
Face Labeling
Face Recognition
Faces
Image Segmentation
Labeling
Neural Architecture Search (nas)
Portrait Parsing
Portrait Segmentation
Reinforcement Learning
Semantics
Task Analysis

PubMed Central ID

34410930

Digital Object Identifier (DOI)

10.1109/TNNLS.2021.3104872

start page

1112

end page

1121

volume

34

issue

3

URL

http://dx.doi.org/10.1109/tnnls.2021.3104872

Neural Architecture Search for Portrait Parsing. Academic Article

Overview

abstract

authors

published proceedings

altmetric score

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

PubMed Central ID

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL