Vis enkel innførsel

dc.contributor.authorCordova, Manuel
dc.contributor.authorPinto, Allan
dc.contributor.authorPedrini, Helio
dc.contributor.authorTorres, Ricardo Da Silva
dc.date.accessioned2021-01-21T08:35:55Z
dc.date.available2021-01-21T08:35:55Z
dc.date.created2020-12-28T18:10:52Z
dc.date.issued2020
dc.identifier.citationIEEE Access. 2020, 8 223172-223188.en_US
dc.identifier.issn2169-3536
dc.identifier.urihttps://hdl.handle.net/11250/2724044
dc.description.abstractScene text detection has become an important field in the computer vision area due to the increasing number of applications. This is a very challenging problem as textual elements are commonly found in “noisy” and complex natural scenes. Another issue refers to the presence of texts encoded into different languages within the same image. State-of-the-art solutions rely on the use of deep neural network approaches or even ensembles of them. However, such solutions are associated with “heavy” models, which are computationally expensive in terms of memory and storage footprints, which hampers their use in real-time mobile applications. In this work, we introduce Pelee-Text++, a lightweight neural network architecture for multi-lingual multi-oriented scene text detection, especially tailored to running on devices with computational restrictions. Additionally, to the best of our knowledge, this is the first work to evaluate the performance of text detection methods in commercial smartphones. Over this scenario, Pelee-Text++ processes 2.94 frames per second and it is the only evaluated approach that did not cause memory issues on smartphones, even using an input image of $1024 × 1024 pixels. Our proposal achieves a promising trade-off between efficiency and effectiveness, with a model size of 27 Megabytes and F-measure of 91.20%, 85.78%, 81.72%, 80.30%, 82.53% and 66.51% on ICDAR 2011, ICDAR 2013, ICDAR 2015, MSRA-TD500, ReCTS 2019 and Multi-lingual 2019 datasets, respectively.en_US
dc.language.isoengen_US
dc.publisherInstitute of Electrical and Electronics Engineers - IEEEen_US
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/
dc.titlePelee-Text++: A Tiny Neural Network for Scene Text Detectionen_US
dc.typeJournal articleen_US
dc.typePeer revieweden_US
dc.description.versionpublishedVersionen_US
dc.source.pagenumber223172-223188en_US
dc.source.volume8en_US
dc.source.journalIEEE Accessen_US
dc.identifier.doi10.1109/ACCESS.2020.3043813
dc.identifier.cristin1863616
dc.rights.licensehttps://creativecommons.org/licenses/by/4.0/
cristin.ispublishedtrue
cristin.fulltextoriginal
cristin.qualitycode1


Tilhørende fil(er)

Thumbnail

Denne innførselen finnes i følgende samling(er)

Vis enkel innførsel

https://creativecommons.org/licenses/by/4.0/
Med mindre annet er angitt, så er denne innførselen lisensiert som https://creativecommons.org/licenses/by/4.0/