Multiobjective Reinforcement Learning-Based Neural Architecture Search for Efficient Portrait Parsing.

abstract

This article dedicates to automatically explore efficient portrait parsing models that are easily deployed in edge computing or terminal devices. In the interest of the tradeoff between the resource cost and performance, we design the multiobjective reinforcement learning (RL)-based neural architecture search (NAS) scheme, which comprehensively balances the accuracy, parameters, FLOPs, and inference latency. Finally, under varying hyperparameter configurations, the search procedure emits a bunch of excellent objective-oriented architectures. The combination of two-stage training with precomputing and memory-resident feature maps effectively reduces the time consumption of the RL-based NAS method, so that we complete approximately 1000 search iterations in two GPU days. To accelerate the convergence of the lightweight candidate architecture, we incorporate knowledge distillation into the training of the search process. This also provides a reasonable evaluation signal to the RL controller that enables it to converge well. In the end, we conduct full training with outstanding Pareto-optimal architectures, so that a series of excellent portrait parsing models (with only approximately 0.3M parameters) is received. Furthermore, we directly transfer the architectures searched on CelebAMask-HQ (Portrait Parsing) to other portrait and face segmentation tasks. Finally, we achieve the state-of-the-art performance of 96.5% MIOU on EG1800 (portrait segmentation) and 91.6% overall F1 -score on HELEN (face labeling). That is, our models significantly surpass the artificial network on the accuracy, but with lower resource consumption and higher real-time performance.

authors

Huang, Tingwen

published proceedings

IEEE Trans Cybern

author list (cited authors)

Lyu, B. o., Wen, S., Shi, K., & Huang, T.

citation count

27

complete list of authors

Lyu, Bo||Wen, Shiping||Shi, Kaibo||Huang, Tingwen

publication date

February 2023

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

published in

IEEE TRANSACTIONS ON CYBERNETICS Journal

keywords

Computational Modeling
Computer Architecture
Face Labeling
Faces
Image Segmentation
Labeling
Multiobjective
Neural Architecture Search (nas)
Portrait Parsing
Portrait Segmentation
Reinforcement Learning (rl)
Task Analysis
Training

PubMed Central ID

34460412

Digital Object Identifier (DOI)

10.1109/TCYB.2021.3104866

start page

1158

end page

1169

volume

53

issue

2

URL

http://dx.doi.org/10.1109/tcyb.2021.3104866

Multiobjective Reinforcement Learning-Based Neural Architecture Search for Efficient Portrait Parsing. Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

PubMed Central ID

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL