A Novel Approach to Feature Selection Using PageRank algorithm for Web Page Classification
(ندگان)پدیدآور
Rezvani, FarhadSoleimanian Gharehchopogh, Farhad
نوع مدرک
TextOriginal Manuscript
زبان مدرک
Englishچکیده
In this paper, a novel filter-based approach is proposed using the PageRank algorithm to select the optimal subset of features as well as to compute their weights for web page classification. To evaluate the proposed approach multiple experiments are performed using accuracy score as the main criterion on four different datasets, namely WebKB, Reuters-R8, Reuters-R52, and 20NewsGroups. By analyzing the obtained results, it is observed that the accuracy score of the classifier on WebKB, Reuters-R8, and Reuters-R52 datasets significantly improved from 91% up to 96% compared to the best result achieved by other feature selection methods like IG and Chi-2. Whereas, the accuracy score of the classifier on 20NewsGroups dataset didn't see any noticeable improvement and remained close to the most compared methods. Evaluating the performance of the proposed approach shows the superiority of it in obtaining higher accuracy scores when compared with the feature sets selected by other methods.
کلید واژگان
Feature SelectionPageRank algorithm
HITS
Web Page Classification
H.3. Artificial Intelligence
شماره نشریه
4تاریخ نشر
2019-11-011398-08-10
ناشر
Sari Branch, Islamic Azad Universityسازمان پدید آورنده
Department of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia, IranDepartment of Computer Engineering, Urmia Branch, Islamic Azad University, Urmia, Iran
شاپا
2345-606X2345-6078



