Pixel Transposed Convolutional Networks. - Texas A&M University (TAMU) Scholar

abstract

Transposed convolutional layers have been widely used in a variety of deep models for up-sampling, including encoder-decoder networks for semantic segmentation and deep generative models for unsupervised learning. One of the key limitations of transposed convolutional operations is that they result in the so-called checkerboard problem. This is caused by the fact that no direct relationship exists among adjacent pixels on the output feature map. To address this problem, we propose the pixel transposed convolutional layer (PixelTCL) to establish direct relationships among adjacent pixels on the up-sampled feature map. Our method is based on a fresh interpretation of the regular transposed convolutional operation. The resulting PixelTCL can be used to replace any transposed convolutional layer in a plug-and-play manner without compromising the fully trainable capabilities of original models. The proposed PixelTCL may result in slight decrease in efficiency, but this can be overcome by an implementation trick. Experimental results on semantic segmentation demonstrate that PixelTCL can consider spatial features such as edges and shapes and yields more accurate segmentation outputs than transposed convolutional layers. When used in image generation tasks, our PixelTCL can largely overcome the checkerboard problem suffered by regular transposed convolutional operations.

authors

Ji, Shuiwang

published proceedings

IEEE Trans Pattern Anal Mach Intell

author list (cited authors)

Gao, H., Yuan, H., Wang, Z., & Ji, S.

citation count

48

complete list of authors

Gao, Hongyang||Yuan, Hao||Wang, Zhengyang||Ji, Shuiwang

publication date

May 2020

publisher

Institute of Electrical and Electronics Engineers (IEEE) Publisher

published in

IEEE Transactions on Pattern Analysis and Machine Intelligence Journal

keywords

Analytical Models
Convolution
Deep Learning
Image Generation
Image Segmentation
Kernel
Pixel Transposed Convolution
Pixel-wise Prediction
Semantics
Task Analysis
Transposed Convolution
Up-sampling

Digital Object Identifier (DOI)

10.1109/TPAMI.2019.2893965

start page

1218

end page

1227

volume

42

issue

5

URL

http://dx.doi.org/10.1109/tpami.2019.2893965

Pixel Transposed Convolutional Networks. Academic Article

Overview

abstract

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

published in

Research

keywords

Identity

Digital Object Identifier (DOI)

Additional Document Info

start page

end page

volume

issue

Other

URL