TY - JOUR
T1 - A step towards the improvement in the performance of text classification
AU - Hussain, Shahid
AU - Mufti, Muhammad Rafiq
AU - Sohail, Muhammad Khalid
AU - Afzal, Humaira
AU - Ahmad, Ghufran
AU - Khan, Arif Ali
N1 - Publisher Copyright:
© 2019, Korean Society for Internet Information. All rights reserved.
PY - 2019/4/30
Y1 - 2019/4/30
N2 - The performance of text classification is highly related to the feature selection methods. Usually, two tasks are performed when a feature selection method is applied to construct a feature set; 1) assign score to each feature and 2) select the top-N features. The selection of top-N features in the existing filter-based feature selection methods is biased by their discriminative power and the empirical process which is followed to determine the value of N. In order to improve the text classification performance by presenting a more illustrative feature set, we present an approach via a potent representation learning technique, namely DBN (Deep Belief Network). This algorithm learns via the semantic illustration of documents and uses feature vectors for their formulation. The nodes, iteration, and a number of hidden layers are the main parameters of DBN, which can tune to improve the classifier’s performance. The results of experiments indicate the effectiveness of the proposed method to increase the classification performance and aid developers to make effective decisions in certain domains.
AB - The performance of text classification is highly related to the feature selection methods. Usually, two tasks are performed when a feature selection method is applied to construct a feature set; 1) assign score to each feature and 2) select the top-N features. The selection of top-N features in the existing filter-based feature selection methods is biased by their discriminative power and the empirical process which is followed to determine the value of N. In order to improve the text classification performance by presenting a more illustrative feature set, we present an approach via a potent representation learning technique, namely DBN (Deep Belief Network). This algorithm learns via the semantic illustration of documents and uses feature vectors for their formulation. The nodes, iteration, and a number of hidden layers are the main parameters of DBN, which can tune to improve the classifier’s performance. The results of experiments indicate the effectiveness of the proposed method to increase the classification performance and aid developers to make effective decisions in certain domains.
UR - http://www.scopus.com/inward/record.url?scp=85068480990&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85068480990&partnerID=8YFLogxK
U2 - 10.3837/tiis.2019.04.024
DO - 10.3837/tiis.2019.04.024
M3 - Article
AN - SCOPUS:85068480990
SN - 1976-7277
VL - 13
SP - 2162
EP - 2179
JO - KSII Transactions on Internet and Information Systems
JF - KSII Transactions on Internet and Information Systems
IS - 4
ER -