바로가기메뉴

본문 바로가기 주메뉴 바로가기

ACOMS+ 및 학술지 리포지터리 설명회

  • 한국과학기술정보연구원(KISTI) 서울분원 대회의실(별관 3층)
  • 2024년 07월 03일(수) 13:30
 

logo

Sketch Recognition Using LSTM with Attention Mechanism and Minimum Cost Flow Algorithm

INTERNATIONAL JOURNAL OF CONTENTS / INTERNATIONAL JOURNAL OF CONTENTS, (P)1738-6764; (E)2093-7504
2019, v.15 no.4, pp.8-15
https://doi.org/10.5392/IJoC.2019.15.4.008
Bac Nguyen-Xuan (전남대학교)
이귀상 (전남대학교)

Abstract

This paper presents a solution of the ‘Quick, Draw! Doodle Recognition Challenge’ hosted by Google. Doodles are drawings comprised of concrete representational meaning or abstract lines creatively expressed by individuals. In this challenge, a doodle is presented as a sequence of sketches. From the view of at the sketch level, to learn the pattern of strokes representing a doodle, we propose a sequential model stacked with multiple convolution layers and Long Short-Term Memory (LSTM) cells following the attention mechanism [15]. From the view at the image level, we use multiple models pre-trained on ImageNet to recognize the doodle. Finally, an ensemble and a post-processing method using the minimum cost flow algorithm are introduced to combine multiple models in achieving better results. In this challenge, our solutions garnered 11th place among 1,316 teams. Our performance was 0.95037 MAP@3, only 0.4% lower than the winner. It demonstrates that our method is very competitive. The source code for this competition is published at: https://github.com/ngxbac/Kaggle-QuickDraw.

keywords
Sketch Recognition, Deep Learning, Attention Mechanism, Long-Short Term Memory, Minimum Cost Flow.

INTERNATIONAL JOURNAL OF CONTENTS