Bridging the Gap of AutoGraph Between Academia and Industry: Analyzing AutoGraph Challenge at KDD Cup 2020

Graph structured data is ubiquitous in daily life and scientific areas and has attracted increasing attention. Graph Neural Networks (GNNs) have been proved to be effective in modeling graph structured data and many variants of GNN architectures have been proposed. However, much human effort is ofte...

Full description

Bibliographic Details
Main Authors: Guyon, I. (Author), Tu, W.-W (Author), Wei, L. (Author), Xu, Z. (Author), Yao, Q. (Author), Ying, R. (Author), Zhao, H. (Author)
Format: Article
Language:English
Published: Frontiers Media S.A. 2022
Subjects:
Online Access:View Fulltext in Publisher
LEADER 02505nam a2200265Ia 4500
001 10.3389-frai.2022.905104
008 220718s2022 CNT 000 0 und d
020 |a 26248212 (ISSN) 
245 1 0 |a Bridging the Gap of AutoGraph Between Academia and Industry: Analyzing AutoGraph Challenge at KDD Cup 2020 
260 0 |b Frontiers Media S.A.  |c 2022 
856 |z View Fulltext in Publisher  |u https://doi.org/10.3389/frai.2022.905104 
520 3 |a Graph structured data is ubiquitous in daily life and scientific areas and has attracted increasing attention. Graph Neural Networks (GNNs) have been proved to be effective in modeling graph structured data and many variants of GNN architectures have been proposed. However, much human effort is often needed to tune the architecture depending on different datasets. Researchers naturally adopt Automated Machine Learning on Graph Learning, aiming to reduce human effort and achieve generally top-performing GNNs, but their methods focus more on the architecture search. To understand GNN practitioners' automated solutions, we organized AutoGraph Challenge at KDD Cup 2020, emphasizing automated graph neural networks for node classification. We received top solutions, especially from industrial technology companies like Meituan, Alibaba, and Twitter, which are already open sourced on GitHub. After detailed comparisons with solutions from academia, we quantify the gaps between academia and industry on modeling scope, effectiveness, and efficiency, and show that (1) academic AutoML for Graph solutions focus on GNN architecture search while industrial solutions, especially the winning ones in the KDD Cup, tend to obtain an overall solution (2) with only neural architecture search, academic solutions achieve on average 97.3% accuracy of industrial solutions (3) academic solutions are cheap to obtain with several GPU hours while industrial solutions take a few months' labors. Academic solutions also contain much fewer parameters. Copyright © 2022 Xu, Wei, Zhao, Ying, Yao, Tu and Guyon. 
650 0 4 |a Automated Machine Learning 
650 0 4 |a data challenge 
650 0 4 |a graph machine learning 
650 0 4 |a Graph Neural Networks 
650 0 4 |a node classification 
700 1 |a Guyon, I.  |e author 
700 1 |a Tu, W.-W.  |e author 
700 1 |a Wei, L.  |e author 
700 1 |a Xu, Z.  |e author 
700 1 |a Yao, Q.  |e author 
700 1 |a Ying, R.  |e author 
700 1 |a Zhao, H.  |e author 
773 |t Frontiers in Artificial Intelligence  |x 26248212 (ISSN)  |g 5