Skip to Main Content
This paper proposes a method for extracting useful expressions from English research papers. The method extracts sequences of words from research papers and refine them into phrasal expressions (PEs). We use base-phrases for acquiring such the expressions. The method extracts PEs from the set of sequences of base-phrases by using three kinds of statistical information: frequency, length, and the number of kinds of the succeeding base-phrases. In our experiment using 1,232 research papers, the precision of acquisition at the top-200 was 62.0%. The precision was higher than all of the baselines, and therefore, we confirmed the feasibility of our method.