Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review

Osamah Mohammed Alyasiri; Yu-N Cheah; Ammar Kamal Abasi; Omar Mustafa Al-Janabi

doi:10.1109/ACCESS.2022.3165814

Details

Publication Date

Sat Jan 01 2022

Journal Name

Ieee Access

Volume

10

DOI

10.1109/ACCESS.2022.3165814

Choose Citation Style

Statistics

View publication

49

View original publication

2

Click abstract more

2

View pdf

5

Statistics

(72)

(58)

Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review

Metaheuristics

Feature extraction

Text categorization

Classification algorithms

Systematics

Search problems

Business

Osamah Mohammed Alyasiri

Yu-N Cheah

Ammar Kamal Abasi

Omar Mustafa Al-Janabi

...Show More Authors

Feature selection (FS) constitutes a series of processes used to decide which relevant features/attributes to include and which irrelevant features to exclude for predictive modeling. It is a crucial task that aids machine learning classifiers in reducing error rates, computation time, overfitting, and improving classification accuracy. It has demonstrated its efficacy in myriads of domains, ranging from its use for text classification (TC), text mining, and image recognition. While there are many traditional FS methods, recent research efforts have been devoted to applying metaheuristic algorithms as FS techniques for the TC task. However, there are few literature reviews concerning TC. Therefore, a comprehensive overview was systematically studied by exploring available studies of different metaheuristic algorithms used for FS to improve TC. This paper will contribute to the body of existing knowledge by answering four research questions (RQs): 1) What are the different approaches of FS that apply metaheuristic algorithms to improve TC? 2) Does applying metaheuristic algorithms for TC lead to better accuracy than the typical FS methods? 3) How effective are the modified, hybridized metaheuristic algorithms for text FS problems?, and 4) What are the gaps in the current studies and their future directions? These RQs led to a study of recent works on metaheuristic-based FS methods, their contributions, and limitations. Hence, a final list of thirty-seven (37) related articles was extracted and investigated to align with our RQs to generate new knowledge in the domain of study. Most of the conducted papers focused on addressing the TC in tandem with metaheuristic algorithms based on the wrapper and hybrid FS approaches. Future research should focus on using a hybrid-based FS approach as it intuitively handles complex optimization problems and potentiality provide new research opportunities in this rapidly developing field.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Sat Jul 06 2024

Journal Name

Multimedia Tools And Applications

Text classification based on optimization feature selection methods: a review and future directions

Text mining Text classification Text categorization Feature selection Optimization algorithms Machine learning classifiers

Osamah Mohammed

Yu-N

Hao

Omar Mustafa

Ammar Kamal

...Show More Authors

A substantial portion of today’s multimedia data exists in the form of unstructured text. However, the unstructured nature of text poses a significant task in meeting users’ information requirements. Text classification (TC) has been extensively employed in text mining to facilitate multimedia data processing. However, accurately categorizing texts becomes challenging due to the increasing presence of non-informative features within the corpus. Several reviews on TC, encompassing various feature selection (FS) approaches to eliminate non-informative features, have been previously published. However, these reviews do not adequately cover the recently explored approaches to TC problem-solving utilizing FS, such as optimization techniques.

View Publication Preview PDF

(15)

(12)

Publication Date

Wed Sep 23 2020

Journal Name

Artificial Intelligence Research

Hybrid approaches to feature subset selection for data classification in high-dimensional feature space

Maysa

John Q

...Show More Authors

This paper proposes two hybrid feature subset selection approaches based on the combination (union or intersection) of both supervised and unsupervised filter approaches before using a wrapper, aiming to obtain low-dimensional features with high accuracy and interpretability and low time consumption. Experiments with the proposed hybrid approaches have been conducted on seven high-dimensional feature datasets. The classifiers adopted are support vector machine (SVM), linear discriminant analysis (LDA), and K-nearest neighbour (KNN). Experimental results have demonstrated the advantages and usefulness of the proposed methods in feature subset selection in high-dimensional space in terms of the number of selected features and time spe

View Publication

Publication Date

Mon Jul 01 2024

Journal Name

Journal Of Engineering

Efficient Intrusion Detection Through the Fusion of AI Algorithms and Feature Selection Methods

Intrusion Detection System (IDS)

Machine learning

Naïve bayes

K-Nearest Neighbor (KNN)

Decision tree

Feature selection

Muna Hadi

...Show More Authors

With the proliferation of both Internet access and data traffic, recent breaches have brought into sharp focus the need for Network Intrusion Detection Systems (NIDS) to protect networks from more complex cyberattacks. To differentiate between normal network processes and possible attacks, Intrusion Detection Systems (IDS) often employ pattern recognition and data mining techniques. Network and host system intrusions, assaults, and policy violations can be automatically detected and classified by an Intrusion Detection System (IDS). Using Python Scikit-Learn the results of this study show that Machine Learning (ML) techniques like Decision Tree (DT), Naïve Bayes (NB), and K-Nearest Neighbor (KNN) can enhance the effectiveness of an Intrusi

View Publication Preview PDF

(5)

(1)

Publication Date

Fri Jan 01 2021

Journal Name

Computers, Materials & Continua

A New Hybrid Feature Selection Method Using T-test and Fitness Function

Husam

...Show More Authors

View Publication

(11)

(8)

Publication Date

Sun Jan 30 2022

Journal Name

Iraqi Journal Of Science

A Survey on Arabic Text Classification Using Deep and Machine Learning Algorithms

Farah A.

Nada A.Z.

...Show More Authors

Text categorization refers to the process of grouping text or documents into classes or categories according to their content. Text categorization process consists of three phases which are: preprocessing, feature extraction and classification. In comparison to the English language, just few studies have been done to categorize and classify the Arabic language. For a variety of applications, such as text classification and clustering, Arabic text representation is a difficult task because Arabic language is noted for its richness, diversity, and complicated morphology. This paper presents a comprehensive analysis and a comparison for researchers in the last five years based on the dataset, year, algorithms and the accuracy th

(18)

(8)

Publication Date

Wed Mar 16 2022

Journal Name

2022 Muthanna International Conference On Engineering Science And Technology (micest)

A hybrid feature selection technique using chi-square with genetic algorithm

Kadhim A.I.

...Show More Authors

View Publication

(2)

Publication Date

Tue May 06 2025

Journal Name

Algorithms

Speech Enhancement Algorithms: A Systematic Literature Review

Sally Taha

Basheera M.

...Show More Authors

A growing and pressing need for Speech Enhancement Algorithms (SEAs) has emerged with the proliferation of hearing devices and mobile devices that aim to improve speech intelligibility without sacrificing speech quality. Recently, a tremendous number of studies have been conducted in the field of speech enhancement. This study aims to map the field of speech enhancement by conducting a systematic literature review to provide comprehensive details of recently proposed SEAs. This systematic review aims to highlight research trends in SEAs and direct researchers to the most important topics published between 2015 and 2024. It attempts to address seven key research questions related to this topic. Moreover, it covers articles available

View Publication

(15)

(14)

Publication Date

Wed Jan 01 2025

Journal Name

Aip Conference Proceedings

A review on hybrid methods using playfair and RSA techniques

Saja Mohammed

Zaynab Anwer

...Show More Authors

View Publication

(2)

(1)

Publication Date

Tue Jul 09 2024

Journal Name

Diagnostics

A Novel Hybrid Machine Learning-Based System Using Deep Learning Techniques and Meta-Heuristic Algorithms for Various Medical Datatypes Classification

deep learning

autoencoder

classification

medical dataset

COVID-19

brain tumor

meta-heuristic algorithm

Yezi Ali

Mehmet Serdar

Alok

...Show More Authors

Medicine is one of the fields where the advancement of computer science is making significant progress. Some diseases require an immediate diagnosis in order to improve patient outcomes. The usage of computers in medicine improves precision and accelerates data processing and diagnosis. In order to categorize biological images, hybrid machine learning, a combination of various deep learning approaches, was utilized, and a meta-heuristic algorithm was provided in this research. In addition, two different medical datasets were introduced, one covering the magnetic resonance imaging (MRI) of brain tumors and the other dealing with chest X-rays (CXRs) of COVID-19. These datasets were introduced to the combination network that contained deep lea

View Publication

(8)

(10)

Publication Date

Sat Aug 01 2026

Journal Name

International Journal Of Data And Network Science

Multi-objective of wind-driven optimization as feature selection and clustering to enhance text clustering

Text Clustering

Multi-Objectives

Wind Driven Optimization

K-Means

Unsupervised Feature Selection

Meta-heuristics optimization

MEHDI G. DUAIMI

Bsoul,Q.

AL-Gburi, A.

...Show More Authors

Text Clustering consists of grouping objects of similar categories. The initial centroids influence operation of the system with the potential to become trapped in local optima. The second issue pertains to the impact of a huge number of features on the determination of optimal initial centroids. The problem of dimensionality may be reduced by feature selection. Therefore, Wind Driven Optimization (WDO) was employed as Feature Selection to reduce the unimportant words from the text. In addition, the current study has integrated a novel clustering optimization technique called the WDO (Wasp Swarm Optimization) to effectively determine the most suitable initial centroids. The result showed the new meta-heuristic which is WDO was employed as t

View Publication Preview PDF

(1)

1 2 3 4 ... 2351 2352 2353 2354