Skip to Main Content
This work presents the design and development of a web-based system that supports cross-language similarity analysis and plagiarism detection. A suspicious document dq in a language Lq is to be submitted to the system via a PHP web-based interface. The system will accept the text through either uploading or pasting it directly to a text-area. In order to lighten large texts and provide an ideal set of queries, we introduce the idea of query document reduction via summarisation. Our proposed system utilised a fuzzy swarm-based summarisation tool originally built in Java. Then, the summary is used as a query to find similar web resources in languages Lx other than Lq via a dictionary-based translation. Thereafter, a detailed similarity analysis across the languages Lq and Lx is performed and friendly report of results is produced. Such report has global similarity score on the whole document, which assures high flexibility of utilisation.
Date of Conference: Nov. 29 2010-Dec. 1 2010