Efficient and Invariant Convolutional Neural Networks for Dense Prediction Conference Paper uri icon


  • 2017 IEEE. Convolutional neural networks have shown great success on feature extraction from raw input data such as images. Although convolutional neural networks are invariant to translations on the inputs, they are not invariant to other transformations, including rotation and flip. Recent attempts have been made to incorporate more invariance in image recognition applications, but they are not applicable to dense prediction tasks, such as image segmentation. In this paper, we propose a set of methods based on kernel rotation and flip to enable rotation and flip invariance in convolutional neural networks. The kernel rotation can be achieved on kernels of 3 3, while kernel flip can be applied on kernels of any size. By rotating in eight or four angles, the convolutional layers could produce the corresponding number of feature maps based on eight or four different kernels. By using flip, the convolution layer can produce three feature maps. By combining produced feature maps using maxout, the resource requirement could be significantly reduced while still retain the invariance properties. Experimental results demonstrate that the proposed methods can achieve various invariance at reasonable resource requirements in terms of both memory and time.

name of conference

  • 2017 IEEE International Conference on Data Mining (ICDM)

published proceedings


altmetric score

  • 3

author list (cited authors)

  • Gao, H., & Ji, S.

citation count

  • 9

complete list of authors

  • Gao, Hongyang||Ji, Shuiwang

publication date

  • November 2017