We contribute an in-the-wild signboard dataset with 79K text instances on both line-level and word-level across 2,104 scene images. The study thoroughly assessed the late...
Abstract:
Scene text detection and recognition have attracted much attention in recent years because of their potential applications. Detecting and recognizing texts in images may ...Show MoreMetadata
Abstract:
Scene text detection and recognition have attracted much attention in recent years because of their potential applications. Detecting and recognizing texts in images may suffer from scene complexity and text variations. Some of these problematic cases are included in popular benchmark datasets, but only to a limited extent. In this work, we investigate the problem of scene text detection and recognition in a domain with extreme challenges. We focus on in-the-wild signboard images in which text commonly appears in different fonts, sizes, artistic styles, or languages with cluttered backgrounds. We first contribute an in-the-wild signboard dataset with 79K text instances on both line-level and word-level across 2,104 scene images. We then comprehensively evaluated recent state-of-the-art (SOTA) approaches for text detection and recognition on the dataset. By doing this, we expect to realize the barriers of current state-of-the-art approaches to solving the extremely challenging issues of scene text detection and recognition, as well as their applicability in this domain. Code and dataset are available at https://github.com/aiclub-uit/SignboardText/ and IEEE DataPort.
We contribute an in-the-wild signboard dataset with 79K text instances on both line-level and word-level across 2,104 scene images. The study thoroughly assessed the late...
Published in: IEEE Access ( Volume: 12)
Funding Agency:

University of Information Technology, Ho Chi Minh City, Vietnam
Vietnam National University, Ho Chi Minh City, Vietnam
Tien do
received the master’s degree in computer science in 2016. He is currently pursuing the Ph.D. degree. He works as a Lecturer with the Faculty of Computer Science, University of Information Technology (UIT), which is part of Vietnam National University Ho Chi Minh City (VNU-HCM). He specializes in computer vision, namely in the areas of scene text recognition and object detection.
Tien do
received the master’s degree in computer science in 2016. He is currently pursuing the Ph.D. degree. He works as a Lecturer with the Faculty of Computer Science, University of Information Technology (UIT), which is part of Vietnam National University Ho Chi Minh City (VNU-HCM). He specializes in computer vision, namely in the areas of scene text recognition and object detection.View more

University of Information Technology, Ho Chi Minh City, Vietnam
Vietnam National University, Ho Chi Minh City, Vietnam
Thuyen Tran
received the bachelor’s degree in computer science from the University of Information Technology in 2022. He is currently a Lecturer with the University of Information Technology. His primary area of interests include revolves around computer vision, with a specific focus on the fascinating field of optical character recognition (OCR).
Thuyen Tran
received the bachelor’s degree in computer science from the University of Information Technology in 2022. He is currently a Lecturer with the University of Information Technology. His primary area of interests include revolves around computer vision, with a specific focus on the fascinating field of optical character recognition (OCR).View more

University of Information Technology, Ho Chi Minh City, Vietnam
Vietnam National University, Ho Chi Minh City, Vietnam
Thua Nguyen
received the bachelor’s degree in computer science from the University of Information Technology, VNU-HCM, in 2019. Currently, he works as a Research Scientist with the Multimedia Communications Laboratory (MMLab), University of Information Technology, VNU-HCM. His research interests include computer vision, with a primary focus on facial recognition and optical character recognition.
Thua Nguyen
received the bachelor’s degree in computer science from the University of Information Technology, VNU-HCM, in 2019. Currently, he works as a Research Scientist with the Multimedia Communications Laboratory (MMLab), University of Information Technology, VNU-HCM. His research interests include computer vision, with a primary focus on facial recognition and optical character recognition.View more

University of Information Technology, Ho Chi Minh City, Vietnam
Vietnam National University, Ho Chi Minh City, Vietnam
Duy-Dinh Le
(Member, IEEE) received the bachelor’s and master’s degrees from the University of Science, Ho Chi Minh City, Vietnam, in 1995 and 2001, respectively, and the Ph.D. degree from The Graduate University for Advanced Studies (SOKENDAI), Japan, in 2006. He was an Associate Professor at the National Institute of Informatics (NII), Japan, from 2013 to 2016. He is currently the Scientist and a Lecturer with the Uni...Show More
Duy-Dinh Le
(Member, IEEE) received the bachelor’s and master’s degrees from the University of Science, Ho Chi Minh City, Vietnam, in 1995 and 2001, respectively, and the Ph.D. degree from The Graduate University for Advanced Studies (SOKENDAI), Japan, in 2006. He was an Associate Professor at the National Institute of Informatics (NII), Japan, from 2013 to 2016. He is currently the Scientist and a Lecturer with the Uni...View more

University of Information Technology, Ho Chi Minh City, Vietnam
Vietnam National University, Ho Chi Minh City, Vietnam
Thanh Duc Ngo
received the Ph.D. degree from The Graduate University for Advanced Studies (SOKENDAI) in 2013. He has been a Lecturer with the Faculty of Computer Science, University of Information Technology (UIT), Vietnam National University Ho Chi Minh City (VNU-HCM), since 2014. His research interests include computer vision and multimedia content analysis.
Thanh Duc Ngo
received the Ph.D. degree from The Graduate University for Advanced Studies (SOKENDAI) in 2013. He has been a Lecturer with the Faculty of Computer Science, University of Information Technology (UIT), Vietnam National University Ho Chi Minh City (VNU-HCM), since 2014. His research interests include computer vision and multimedia content analysis.View more

University of Information Technology, Ho Chi Minh City, Vietnam
Vietnam National University, Ho Chi Minh City, Vietnam
Tien do
received the master’s degree in computer science in 2016. He is currently pursuing the Ph.D. degree. He works as a Lecturer with the Faculty of Computer Science, University of Information Technology (UIT), which is part of Vietnam National University Ho Chi Minh City (VNU-HCM). He specializes in computer vision, namely in the areas of scene text recognition and object detection.
Tien do
received the master’s degree in computer science in 2016. He is currently pursuing the Ph.D. degree. He works as a Lecturer with the Faculty of Computer Science, University of Information Technology (UIT), which is part of Vietnam National University Ho Chi Minh City (VNU-HCM). He specializes in computer vision, namely in the areas of scene text recognition and object detection.View more

University of Information Technology, Ho Chi Minh City, Vietnam
Vietnam National University, Ho Chi Minh City, Vietnam
Thuyen Tran
received the bachelor’s degree in computer science from the University of Information Technology in 2022. He is currently a Lecturer with the University of Information Technology. His primary area of interests include revolves around computer vision, with a specific focus on the fascinating field of optical character recognition (OCR).
Thuyen Tran
received the bachelor’s degree in computer science from the University of Information Technology in 2022. He is currently a Lecturer with the University of Information Technology. His primary area of interests include revolves around computer vision, with a specific focus on the fascinating field of optical character recognition (OCR).View more

University of Information Technology, Ho Chi Minh City, Vietnam
Vietnam National University, Ho Chi Minh City, Vietnam
Thua Nguyen
received the bachelor’s degree in computer science from the University of Information Technology, VNU-HCM, in 2019. Currently, he works as a Research Scientist with the Multimedia Communications Laboratory (MMLab), University of Information Technology, VNU-HCM. His research interests include computer vision, with a primary focus on facial recognition and optical character recognition.
Thua Nguyen
received the bachelor’s degree in computer science from the University of Information Technology, VNU-HCM, in 2019. Currently, he works as a Research Scientist with the Multimedia Communications Laboratory (MMLab), University of Information Technology, VNU-HCM. His research interests include computer vision, with a primary focus on facial recognition and optical character recognition.View more

University of Information Technology, Ho Chi Minh City, Vietnam
Vietnam National University, Ho Chi Minh City, Vietnam
Duy-Dinh Le
(Member, IEEE) received the bachelor’s and master’s degrees from the University of Science, Ho Chi Minh City, Vietnam, in 1995 and 2001, respectively, and the Ph.D. degree from The Graduate University for Advanced Studies (SOKENDAI), Japan, in 2006. He was an Associate Professor at the National Institute of Informatics (NII), Japan, from 2013 to 2016. He is currently the Scientist and a Lecturer with the University of Information Technology, Vietnam. His research interests include semantic concept detection, video analysis and indexing, pattern recognition, machine learning, and data mining.
Duy-Dinh Le
(Member, IEEE) received the bachelor’s and master’s degrees from the University of Science, Ho Chi Minh City, Vietnam, in 1995 and 2001, respectively, and the Ph.D. degree from The Graduate University for Advanced Studies (SOKENDAI), Japan, in 2006. He was an Associate Professor at the National Institute of Informatics (NII), Japan, from 2013 to 2016. He is currently the Scientist and a Lecturer with the University of Information Technology, Vietnam. His research interests include semantic concept detection, video analysis and indexing, pattern recognition, machine learning, and data mining.View more

University of Information Technology, Ho Chi Minh City, Vietnam
Vietnam National University, Ho Chi Minh City, Vietnam
Thanh Duc Ngo
received the Ph.D. degree from The Graduate University for Advanced Studies (SOKENDAI) in 2013. He has been a Lecturer with the Faculty of Computer Science, University of Information Technology (UIT), Vietnam National University Ho Chi Minh City (VNU-HCM), since 2014. His research interests include computer vision and multimedia content analysis.
Thanh Duc Ngo
received the Ph.D. degree from The Graduate University for Advanced Studies (SOKENDAI) in 2013. He has been a Lecturer with the Faculty of Computer Science, University of Information Technology (UIT), Vietnam National University Ho Chi Minh City (VNU-HCM), since 2014. His research interests include computer vision and multimedia content analysis.View more