Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review

Osamah Mohammed Alyasiri; Yu-N Cheah; Ammar Kamal Abasi; Omar Mustafa Al-Janabi

doi:10.1109/ACCESS.2022.3165814

Details

Publication Date

Sat Jan 01 2022

Journal Name

Ieee Access

Volume

10

DOI

10.1109/ACCESS.2022.3165814

Choose Citation Style

Statistics

View publication

49

View original publication

2

Click abstract more

2

View pdf

5

Statistics

(72)

(58)

Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review

Metaheuristics

Feature extraction

Text categorization

Classification algorithms

Systematics

Search problems

Business

Osamah Mohammed Alyasiri

Yu-N Cheah

Ammar Kamal Abasi

Omar Mustafa Al-Janabi

...Show More Authors

Feature selection (FS) constitutes a series of processes used to decide which relevant features/attributes to include and which irrelevant features to exclude for predictive modeling. It is a crucial task that aids machine learning classifiers in reducing error rates, computation time, overfitting, and improving classification accuracy. It has demonstrated its efficacy in myriads of domains, ranging from its use for text classification (TC), text mining, and image recognition. While there are many traditional FS methods, recent research efforts have been devoted to applying metaheuristic algorithms as FS techniques for the TC task. However, there are few literature reviews concerning TC. Therefore, a comprehensive overview was systematically studied by exploring available studies of different metaheuristic algorithms used for FS to improve TC. This paper will contribute to the body of existing knowledge by answering four research questions (RQs): 1) What are the different approaches of FS that apply metaheuristic algorithms to improve TC? 2) Does applying metaheuristic algorithms for TC lead to better accuracy than the typical FS methods? 3) How effective are the modified, hybridized metaheuristic algorithms for text FS problems?, and 4) What are the gaps in the current studies and their future directions? These RQs led to a study of recent works on metaheuristic-based FS methods, their contributions, and limitations. Hence, a final list of thirty-seven (37) related articles was extracted and investigated to align with our RQs to generate new knowledge in the domain of study. Most of the conducted papers focused on addressing the TC in tandem with metaheuristic algorithms based on the wrapper and hybrid FS approaches. Future research should focus on using a hybrid-based FS approach as it intuitively handles complex optimization problems and potentiality provide new research opportunities in this rapidly developing field.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Tue Jan 01 2013

Journal Name

Al-mustansiriyah Journal Of Science

Encrypting a Text by Using Affine Cipher and Hiding it in the Colored Image by Using the Quantization stage

Salah Taha

Nada Abdul Aziz

...Show More Authors

ST Alawi, NA Mustafa, Al-Mustansiriyah Journal of Science, 2013

View Publication

Publication Date

Mon Jun 01 2026

Journal Name

Iraqi Journal For Computers And Informatics

Explainable Federated Learning for Brain Tumor Classification Using Multi-Source MRI Data

Brain Tumor Classification

Magnetic Resonance Imaging (MRI)

Federated Learning FL

Non-IID

Suhad

Belal

...Show More Authors

Early diagnosis and clinical decision-making depend on accurate brain tumor classification using magnetic resonance imaging (MRI). However, traditional deep learning methods usually rely on centralized medical data, which raises privacy concerns and limits the use of distributed clinical data. This research proposes a privacy-preserving federated learning framework for MRI image-based binary brain tumor classification using a decentralized ResNet-18 architecture that enables collaborative training without sharing raw patient data. To reflect realistic clinical conditions, the framework integrates heterogeneous multi-source datasets in different image formats (PNG and JPG) and evaluates performance under both IID and non-IID settings

View Publication Preview PDF

Publication Date

Tue Nov 19 2019

Journal Name

Iranian Journal Of Science And Technology, Transactions A: Science

Systematic Study of the Nuclear Structure for Some Exotic Nuclei Using Skyrme–Hartree–Fock Method

Ahmed N.

...Show More Authors

The ground state charge, neutron, proton and matter densities, the associated nuclear radii and the binding energy per nucleon of 8B, 17Ne, 23Al and 27P halo nuclei have been investigated using the Skyrme–Hartree–Fock (SHF) model with the new SKxs25 parameters. According to the calculated results, it is found that the SHF model with these Skyrme parameters provides a good description on the nuclear structure of above proton-rich halo nuclei. The elastic charge form factors of 8B and 17Ne halo nuclei and those of their stable isotopes 10B and 20Ne are calculated using plane-wave Born approximation with the charge density distributions obtained by SHF model to investigate the effect of the extended charge distributions of proton-rich nucl

View Publication

(9)

(8)

Publication Date

Mon May 15 2017

Journal Name

Journal Of Theoretical And Applied Information Technology

Anomaly detection in text data that represented as a graph using dbscan algorithm

Anomaly Detection

Enhanced DBSCAN algorithm

Unsupervised anomaly detection and Concept Frame Graph (CFG)

Asma Khazaal Abdulsahib

...Show More Authors

Anomaly detection is still a difficult task. To address this problem, we propose to strengthen DBSCAN algorithm for the data by converting all data to the graph concept frame (CFG). As is well known that the work DBSCAN method used to compile the data set belong to the same species in a while it will be considered in the external behavior of the cluster as a noise or anomalies. It can detect anomalies by DBSCAN algorithm can detect abnormal points that are far from certain set threshold (extremism). However, the abnormalities are not those cases, abnormal and unusual or far from a specific group, There is a type of data that is do not happen repeatedly, but are considered abnormal for the group of known. The analysis showed DBSCAN using the

Preview PDF

(4)

Publication Date

Sat Jul 15 2023

Journal Name

2023 6th International Conference On Engineering Technology And Its Applications (iiceta)

Methodology for the Design and Programming Methods for a Smart Home

Al-Araji Z.H.

...Show More Authors

View Publication

Publication Date

Sun Dec 01 2024

Journal Name

Chilean Journal Of Statistics

A method of multi-dimensional variable selection for additive partial linear models.

Adaptive least absolute shrinkage and selection operator · Dimension reduction · LASSO · Mean squared error · Minimum average variance estimation · Smoothly clipped absolute deviation

Munaf Y.

Hayder

...Show More Authors

In high-dimensional semiparametric regression, balancing accuracy and interpretability often requires combining dimension reduction with variable selection. This study intro- duces two novel methods for dimension reduction in additive partial linear models: (i) minimum average variance estimation (MAVE) combined with the adaptive least abso- lute shrinkage and selection operator (MAVE-ALASSO) and (ii) MAVE with smoothly clipped absolute deviation (MAVE-SCAD). These methods leverage the flexibility of MAVE for sufficient dimension reduction while incorporating adaptive penalties to en- sure sparse and interpretable models. The performance of both methods is evaluated through simulations using the mean squared error and variable selection cri

View Publication Preview PDF

Publication Date

Fri Dec 01 2023

Journal Name

Al-khwarizmi Engineering Journal

Development of an ANN Model for RGB Color Classification using the Dataset Extracted from a Fabricated Colorimeter

Shahad A.

Furat I.

Ahmed

...Show More Authors

Codes of red, green, and blue data (RGB) extracted from a lab-fabricated colorimeter device were used to build a proposed classifier with the objective of classifying colors of objects based on defined categories of fundamental colors. Primary, secondary, and tertiary colors namely red, green, orange, yellow, pink, purple, blue, brown, grey, white, and black, were employed in machine learning (ML) by applying an artificial neural network (ANN) algorithm using Python. The classifier, which was based on the ANN algorithm, required a definition of the mentioned eleven colors in the form of RGB codes in order to acquire the capability of classification. The software's capacity to forecast the color of the code that belongs to an ob

View Publication Preview PDF

(1)

Publication Date

Fri Dec 01 2023

Journal Name

Al-khwarizmi Engineering Journal

Development of an ANN Model for RGB Color Classification using the Dataset Extracted from a Fabricated Colorimeter

Colorimeter

RGB classifier

ANN

TensorFlow

ML.

Shahad A.

Furat I.

Ahmed

...Show More Authors

Codes of red, green, and blue data (RGB) extracted from a lab-fabricated colorimeter device were used to build a proposed classifier with the objective of classifying colors of objects based on defined categories of fundamental colors. Primary, secondary, and tertiary colors namely red, green, orange, yellow, pink, purple, blue, brown, grey, white, and black, were employed in machine learning (ML) by applying an artificial neural network (ANN) algorithm using Python. The classifier, which was based on the ANN algorithm, required a definition of the mentioned eleven colors in the form of RGB codes in order to acquire the capability of classification. The software's capacity to forecast the color of the code that belongs to an object under de

Preview PDF

(1)

Publication Date

Mon Jun 22 2020

Journal Name

Baghdad Science Journal

Using Evolving Algorithms to Cryptanalysis Nonlinear Cryptosystems

Ant Colony Optimization (ACO)

Cryptanalysis

Genetic Algorithm (GA)

Shrinking Generator

Stream Cipher.

Riyam Noori

Faez Hassan

...Show More Authors

In this paper, new method have been investigated using evolving algorithms (EA's) to cryptanalysis one of the nonlinear stream cipher cryptosystems which depends on the Linear Feedback Shift Register (LFSR) unit by using cipher text-only attack. Genetic Algorithm (GA) and Ant Colony Optimization (ACO) which are used for attacking one of the nonlinear cryptosystems called "shrinking generator" using different lengths of cipher text and different lengths of combined LFSRs. GA and ACO proved their good performance in finding the initial values of the combined LFSRs. This work can be considered as a warning for a stream cipher designer to avoid the weak points, which may be f

View Publication Preview PDF

(10)

(1)

Publication Date

Mon Jun 19 2023

Journal Name

Journal Of Engineering

Data Classification using Quantum Neural Network

Signal classification

artificial neural network

quantum computing

data analysis and fuzziness.

Ghassan H.

Zainab T.

Hassan Saadallah

...Show More Authors

In this paper, integrated quantum neural network (QNN), which is a class of feedforward

neural networks (FFNN’s), is performed through emerging quantum computing (QC) with artificial neural network(ANN) classifier. It is used in data classification technique, and here iris flower data is used as a classification signals. For this purpose independent component analysis (ICA) is used as a feature extraction technique after normalization of these signals, the architecture of (QNN’s) has inherently built in fuzzy, hidden units of these networks (QNN’s) to develop quantized representations of sample information provided by the training data set in various graded levels of certainty. Experimental results presented here show that

View Publication Preview PDF

1 2 ... 17 18 19 20 ... 2338 2339