Sequence Feature Representation Based on k-mer Tokenization: A Comparative Study of Methods and the Impact of Data Scale on Model Performance | IEEE Conference Publication | IEEE Xplore