I. Introduction
News categorization (NC) is an application field for natural language processing (NLP). The task of NC is to extract the characteristics from raw texts and then predict their categories based on these features. However, with the exponential growth of information, it becomes harder to classify these news data by individual. Thus, NC technology has attracted increasing attention in recent years.