Skip to Main Content
Accelerated growth of the Internet has enabled users worldwide to share their feelings and experiences. User-generated content (UGC) websites are the most abundant sources of user reviews. Accurately identifying sentiment phrases is essential to understand the expressed opinions in user reviews. To achieve this, part-of-speech (POS) patterns of phrases are useful. However, previous studies for Chinese opinion extraction only translate English POS patterns directly into Chinese for this task without considering the feasibility. Therefore, this work proposes a Chinese opinion extraction method that exploits the observed Sinica Treebank POS patterns for sentiment phrase identification. Sinica Treebank is a widely representative POS corpus for Chinese. The results of preliminary experiments indicate that the proposed method is highly effective in extracting opinions from Chinese UGC reviews.
Date of Conference: 5-8 July 2010