Journals & Magazines >IEEE Transactions on Pattern ... >Volume: 41 Issue: 9

ASTER: An Attentional Scene Text Recognizer with Flexible Rectification

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

A challenging aspect of scene text recognition is to handle text with distortions or irregular layout. In particular, perspective text and curved text are common in natur...Show More

Metadata

Abstract:

A challenging aspect of scene text recognition is to handle text with distortions or irregular layout. In particular, perspective text and curved text are common in natural scenes and are difficult to recognize. In this work, we introduce ASTER, an end-to-end neural network model that comprises a rectification network and a recognition network. The rectification network adaptively transforms an input image into a new one, rectifying the text in it. It is powered by a flexible Thin-Plate Spline transformation which handles a variety of text irregularities and is trained without human annotations. The recognition network is an attentional sequence-to-sequence model that predicts a character sequence directly from the rectified image. The whole model is trained end to end, requiring only images and their groundtruth text. Through extensive experiments, we verify the effectiveness of the rectification and demonstrate the state-of-the-art recognition performance of ASTER. Furthermore, we demonstrate that ASTER is a powerful component in end-to-end recognition systems, for its ability to enhance the detector.

Published in: IEEE Transactions on Pattern Analysis and Machine Intelligence ( Volume: 41, Issue: 9, 01 September 2019)

Page(s): 2035 - 2048

Date of Publication: 25 June 2018

ISSN Information:

PubMed ID: 29994467

DOI: 10.1109/TPAMI.2018.2848939

Funding Agency:

Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.

Contents

Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.

References is not available for this document.

ASTER: An Attentional Scene Text Recognizer with Flexible Rectification

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

References

IEEE Account

Purchase Details

Profile Information

Need Help?

ASTER: An Attentional Scene Text Recognizer with Flexible Rectification

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

Funding Agency:

Authors

Figures

References

Citations

Keywords

Metrics

Footnotes

References

IEEE Account

Purchase Details

Profile Information

Need Help?