Graph Representation Learning via Hard and Channel-Wise Attention Networks

abstract

2019 Association for Computing Machinery. Attention operators have been widely applied in various fields, including computer vision, natural language processing, and network embedding learning. Attention operators on graph data enables learnable weights when aggregating information from neighboring nodes. However, graph attention operators (GAOs) consume excessive computational resources, preventing their applications on large graphs. In addition, GAOs belong to the family of soft attention, instead of hard attention, which has been shown to yield better performance. In this work, we propose novel hard graph attention operator (hGAO) and channel-wise graph attention operator (cGAO). hGAO uses the hard attention mechanism by attending to only important nodes. Compared to GAO, hGAO improves performance and saves computational cost by only attending to important nodes. To further reduce the requirements on computational resources, we propose the cGAO that performs attention operations along channels. cGAO avoids the dependency on the adjacency matrix, leading to dramatic reductions in computational resource requirements. Experimental results demonstrate that our proposed deep models with the new operators achieve consistently better performance. Comparison results also indicates that hGAO achieves significantly better performance than GAO on both node and graph embedding tasks. Efficiency comparison shows that our cGAO leads to dramatic savings in computational resources, making them applicable to large graphs.

name of conference

Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining

authors

Ji, Shuiwang

published proceedings

KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING

author list (cited authors)

Gao, H., & Ji, S.

citation count

18

complete list of authors

Gao, Hongyang||Ji, Shuiwang

publication date

July 2019

publisher

Association for Computing Machinery (ACM) Publisher

keywords

Channel-wise Attention
Graph Neural Networks
Hard Attention

Digital Object Identifier (DOI)

10.1145/3292500.3330897

International Standard Book Number (ISBN) 13

9781450362016

start page

741

end page

749

URL

http://dx.doi.org/10.1145/3292500.3330897

Graph Representation Learning via Hard and Channel-Wise Attention Networks Conference Paper

Overview

abstract

name of conference

authors

published proceedings

author list (cited authors)

citation count

complete list of authors

publication date

publisher

Research

keywords

Identity

Digital Object Identifier (DOI)

International Standard Book Number (ISBN) 13

Additional Document Info

start page

end page

Other

URL