Sequential Transformer for End-to-End Video Text Detection | IEEE Conference Publication | IEEE Xplore