Journal of Software, Vol 5, No 12 (2010), 1334-1341, Dec 2010
doi:10.4304/jsw.5.12.1334-1341

An Automated Error Detection for News Webpages of Chinese Portal

Deng-Yiv Chiu, Chi-Chung Lee, Ya-Chen Pan

Abstract


There exists some news obviously classified into incorrect categories on Chinese webpage portals. This phenomenon is owing mainly the difficulty in automatically classifying Chinese news and the fact that news appearing on webpage portals is retrieved from numerous media sources. This study integrates genetic algorithms and multi-class support vector machine classifiers to construct an automated classification error detection approach for Chinese news classification. A genetic algorithm is utilized to select four feature thresholds used to obtain representative features/words of each class. The multi-class SVM classifier is then trained to construct an appropriate classifier to aid automated classification error detection. The experiment applies the proposed method to the Chinese news on Taiwan Yahoo!



Keywords


multi-class support vector machine, genetic algorithm, news classification error detection

References



Full Text: PDF


Journal of Software (JSW, ISSN 1796-217X)

Copyright @ 2006-2012 by ACADEMY PUBLISHER – All rights reserved.