바로가기메뉴

본문 바로가기 주메뉴 바로가기

logo

Sketch Recognition Using LSTM with Attention Mechanism and Minimum Cost Flow Algorithm

INTERNATIONAL JOURNAL OF CONTENTS / INTERNATIONAL JOURNAL OF CONTENTS, (P)1738-6764; (E)2093-7504
2019, v.15 no.4, pp.8-15
https://doi.org/10.5392/IJoC.2019.15.4.008
Bac Nguyen-Xuan

Abstract

This paper presents a solution of the ‘Quick, Draw! Doodle Recognition Challenge’ hosted by Google. Doodles are drawings comprised of concrete representational meaning or abstract lines creatively expressed by individuals. In this challenge, a doodle is presented as a sequence of sketches. From the view of at the sketch level, to learn the pattern of strokes representing a doodle, we propose a sequential model stacked with multiple convolution layers and Long Short-Term Memory (LSTM) cells following the attention mechanism [15]. From the view at the image level, we use multiple models pre-trained on ImageNet to recognize the doodle. Finally, an ensemble and a post-processing method using the minimum cost flow algorithm are introduced to combine multiple models in achieving better results. In this challenge, our solutions garnered 11th place among 1,316 teams. Our performance was 0.95037 MAP@3, only 0.4% lower than the winner. It demonstrates that our method is very competitive. The source code for this competition is published at: https://github.com/ngxbac/Kaggle-QuickDraw.

keywords
Sketch Recognition, Deep Learning, Attention Mechanism, Long-Short Term Memory, Minimum Cost Flow.

INTERNATIONAL JOURNAL OF CONTENTS