I. Introduction
Multiple choice question (MCQ) is the most popular form of assessment for reading comprehension [1], which is widely used in many well-known tests, e.g., GRE, TOEFL, and SAT. As shown in Fig. 1, a typical MCQ contains three elements: (i) question; (ii) correct answer, which is a single gold answer, i.e., the option of (A); (iii) distractors, which are alternative answers used to interfere with the students to select the correct answer, e.g., the options of (B1) and (B2) [1]. Besides, a complete test of MCQ usually contains an article, which provides reading comprehension materials for candidates to answer a question. The quality of an MCQ depends heavily on the quality of its multiple distractors. When distractors cannot interfere with the student selecting the correct answer, an MCQ will become simple and the test will lose the ability of the assessment. However, it needs a lot of time to manually construct high-quality distractors. Therefore, automatically generating high-quality multiple distractors is an important and challenging task [2].