Con-Text

Text Detection for Fine-Grained Object Classification

Journal Article (2017)
Author(s)

Sezer Karaoğlu (Universiteit van Amsterdam)

Ran Tao (Universiteit van Amsterdam)

J.C. Gemert (TU Delft - Pattern Recognition and Bioinformatics)

Theo Gevers (Universitat Autònoma de Barcelona, Universiteit van Amsterdam)

Research Group
Pattern Recognition and Bioinformatics
DOI related publication
https://doi.org/10.1109/TIP.2017.2707805
More Info
expand_more
Publication Year
2017
Language
English
Research Group
Pattern Recognition and Bioinformatics
Issue number
8
Volume number
26
Pages (from-to)
3965-3980

Abstract

This paper focuses on fine-grained object classification using recognized scene text in natural images. While the state-of-the-art relies on visual cues only, this paper is the first work which proposes to combine textual and visual cues. Another novelty is the textual cue extraction. Unlike the state-of-the-art text detection methods, we focus more on the background instead of text regions. Once text regions are detected, they are further processed by two methods to perform text recognition, i.e., ABBYY commercial OCR engine and a state-of-the-art character recognition algorithm. Then, to perform textual cue encoding, bi- and trigrams are formed between the recognized characters by considering the proposed spatial pairwise constraints. Finally, extracted visual and textual cues are combined for fine-grained classification. The proposed method is validated on four publicly available data sets: ICDAR03, ICDAR13, Con-Text, and Flickr-logo. We improve the state-of-the-art end-to-end character recognition by a large margin of 15% on ICDAR03. We show that textual cues are useful in addition to visual cues for fine-grained classification. We show that textual cues are also useful for logo retrieval. Adding textual cues outperforms visual- and textual-only in fine-grained classification (70.7% to 60.3%) and logo retrieval (57.4% to 54.8%).

No files available

Metadata only record. There are no files for this record.