WorldCIST'14 - The 2014 World Conference on Information Systems and Technologies

Full Program »

Assessing the quality of Thai Wikipedia articles using concept and Statistical Features

The quality evaluation of Thai Wikipedia articles relies on user con-sideration. There are increasing numbers of articles every day therefore the automatic evaluation method is needed for user. Components of Wikipedia articles such as headers, pictures, references, and links are useful to indicate the quality of articles. However readers need complete content to cover all of concepts in that article. The concept features are investigated in this work. The aim of this research is to classify Thai Wikipedia articles into two classes namely high-quality and low-quality class. Three article domains (Biography, Animal, and Place) are testes with decision tree and Naïve Bayes. We found that Naïve Bayes gets high TP Rate compared to decision tree in every domain. Moreover, we found that the concept feature is important for assessing the quality of Thai Wikipedia articles.

Author(s):

Kanchana Saengthongpattana    
Department of Computer Science, Kasetsart University
Thailand

Nuanwan Soonthornphisaj    
Department of Computer Science, Kasetsart University
Thailand

 

Powered by OpenConf®
Copyright ©2002-2013 Zakon Group LLC