An Automated Text Classification Method: Using Improved Fuzzy Set Approach for Feature Selection

Bushra Zaheer Abbasi, Shahid Hussain, Muhammad Imran Faisal

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Scopus citations

Abstract

A well representing feature set that has enough differentiated power plays an important role in the classification. The existing techniques for feature set selection are mostly statistical. They are not flexible to incorporate the human reasoning and the changing requirements and preferences of the real-life systems. They only make a decision between a feature inclusion or exclusion. The fuzziness of human reasoning and thinking are not considered at all that may improve the feature selection and hence the accuracy of the classifier. Also, the selection of overlapping features in case of Local Feature Selection (LFS) methods is an important issue that negatively impacts classification accuracy. For example, in case of Odd Ratio (OR), the selection may contain overlapping features of multiple classes. In this paper, a Fuzzy Set Theory (FST) based feature selection method has been proposed. The approach aims to tackle both above mentioned issues efficiently. The selected final feature set is used to train the well-known classification algorithms and the results are compared with Global Feature Selection (GFS) and LFS methods. The comparison shows that the proposed method has improved the accuracy of the classifiers and also extract comparatively small feature set that ultimately reduces the time complexity of the system.

Original languageEnglish (US)
Title of host publicationProceedings of 2019 16th International Bhurban Conference on Applied Sciences and Technology, IBCAST 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages666-670
Number of pages5
ISBN (Electronic)9781538677292
DOIs
StatePublished - Mar 13 2019
Event16th International Bhurban Conference on Applied Sciences and Technology, IBCAST 2019 - Islamabad, Pakistan
Duration: Jan 8 2019Jan 12 2019

Publication series

NameProceedings of 2019 16th International Bhurban Conference on Applied Sciences and Technology, IBCAST 2019

Conference

Conference16th International Bhurban Conference on Applied Sciences and Technology, IBCAST 2019
Country/TerritoryPakistan
CityIslamabad
Period1/8/191/12/19

All Science Journal Classification (ASJC) codes

  • Computer Science (miscellaneous)

Fingerprint

Dive into the research topics of 'An Automated Text Classification Method: Using Improved Fuzzy Set Approach for Feature Selection'. Together they form a unique fingerprint.

Cite this