Skip to Main Content
Language identification is an important task for Web information retrieval services. This paper presents the implementation of a platform for language identification in multi-lingual documents on Web. The platform consists of five modules to achieve the tasks automatically. Furthermore, artificial neural networks were used for the identification of languages in multi-lingual documents. Results for six languages including Turkish, French, Italian, Danish and Deutsch are present. The major benefit of the approach is that the ANN based language identification system could meet the expectations in real-time language identification accuracy with the help of a developed system. Experiments have shown that system achieves the tasks in high accuracy in discriminating different languages and converting them other languages on Web pages.
Date of Conference: 13-15 Dec. 2007