Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review

Osamah Mohammed Alyasiri; Yu-N Cheah; Ammar Kamal Abasi; Omar Mustafa Al-Janabi

doi:10.1109/ACCESS.2022.3165814

Details

Publication Date

Sat Jan 01 2022

Journal Name

Ieee Access

Volume

10

DOI

10.1109/ACCESS.2022.3165814

Choose Citation Style

Statistics

View publication

49

View original publication

2

Click abstract more

2

View pdf

5

Statistics

(72)

(58)

Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review

Metaheuristics

Feature extraction

Text categorization

Classification algorithms

Systematics

Search problems

Business

Osamah Mohammed Alyasiri

Yu-N Cheah

Ammar Kamal Abasi

Omar Mustafa Al-Janabi

...Show More Authors

Feature selection (FS) constitutes a series of processes used to decide which relevant features/attributes to include and which irrelevant features to exclude for predictive modeling. It is a crucial task that aids machine learning classifiers in reducing error rates, computation time, overfitting, and improving classification accuracy. It has demonstrated its efficacy in myriads of domains, ranging from its use for text classification (TC), text mining, and image recognition. While there are many traditional FS methods, recent research efforts have been devoted to applying metaheuristic algorithms as FS techniques for the TC task. However, there are few literature reviews concerning TC. Therefore, a comprehensive overview was systematically studied by exploring available studies of different metaheuristic algorithms used for FS to improve TC. This paper will contribute to the body of existing knowledge by answering four research questions (RQs): 1) What are the different approaches of FS that apply metaheuristic algorithms to improve TC? 2) Does applying metaheuristic algorithms for TC lead to better accuracy than the typical FS methods? 3) How effective are the modified, hybridized metaheuristic algorithms for text FS problems?, and 4) What are the gaps in the current studies and their future directions? These RQs led to a study of recent works on metaheuristic-based FS methods, their contributions, and limitations. Hence, a final list of thirty-seven (37) related articles was extracted and investigated to align with our RQs to generate new knowledge in the domain of study. Most of the conducted papers focused on addressing the TC in tandem with metaheuristic algorithms based on the wrapper and hybrid FS approaches. Future research should focus on using a hybrid-based FS approach as it intuitively handles complex optimization problems and potentiality provide new research opportunities in this rapidly developing field.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Tue Feb 18 2025

Journal Name

International Journal Of Scientific Research In Science, Engineering And Technology

A Comprehensive Review on Cryptography Algorithms: Methods and Comparative Analysis

Rusul

...Show More Authors

The evolution of cryptography has been crucial to preservation subtle information in the digital age. From early cipher algorithms implemented in earliest societies to recent cryptography methods, cryptography has developed alongside developments in computing field. The growing in cyber threats and the increase of comprehensive digital communications have highlighted the significance of selecting effective and robust cryptographic techniques. This article reviews various cryptography algorithms, containing symmetric key and asymmetric key cryptography, via evaluating them according to security asset, complexity, and execution speed. The main outcomes demonstrate the growing trust on elliptic curve cryptography outstanding its capabi

View Publication

(2)

Publication Date

Thu Nov 17 2022

Journal Name

Journal Of Information And Optimization Sciences

Hybrid deep learning model for Arabic text classification based on mutual information

Farah A.

Nada A. Z.

...Show More Authors

View Publication

(6)

Publication Date

Sun Dec 06 2009

Journal Name

Baghdad Science Journal

Automatic Block Selection for Synthesizing Texture Images using Genetic Algorithms

Texture synthesis

Patch-based

genetic algorithms.

Noor Adnan

Mokhtar Mohammed

Shaima

...Show More Authors

Texture synthesis using genetic algorithms is one way; proposed in the previous research, to synthesis texture in a fast and easy way. In genetic texture synthesis algorithms ,the chromosome consist of random blocks selected manually by the user .However ,this method of selection is highly dependent on the experience of user .Hence, wrong selection of blocks will greatly affect the synthesized texture result. In this paper a new method is suggested for selecting the blocks automatically without the participation of user .The results show that this method of selection eliminates some blending caused from the previous manual method of selection.

View Publication Preview PDF

Publication Date

Tue Oct 16 2018

Journal Name

Springer Science And Business Media Llc

MOGSABAT: a metaheuristic hybrid algorithm for solving multi-objective optimisation problems

Iraq

...Show More Authors

(68)

(51)

Publication Date

Tue Dec 05 2023

Journal Name

Baghdad Science Journal

AlexNet-Based Feature Extraction for Cassava Classification: A Machine Learning Approach

Color

Feature extraction

KNN

Naïve Bayes

Shape

SVM

Texture

Miftahus

Mohd Farhan Md

Mohd Norasri

...Show More Authors

Cassava, a significant crop in Africa, Asia, and South America, is a staple food for millions. However, classifying cassava species using conventional color, texture, and shape features is inefficient, as cassava leaves exhibit similarities across different types, including toxic and non-toxic varieties. This research aims to overcome the limitations of traditional classification methods by employing deep learning techniques with pre-trained AlexNet as the feature extractor to accurately classify four types of cassava: Gajah, Manggu, Kapok, and Beracun. The dataset was collected from local farms in Lamongan Indonesia. To collect images with agricultural research experts, the dataset consists of 1,400 images, and each type of cassava has

View Publication Preview PDF

(11)

(5)

Publication Date

Sat Jan 01 2022

Journal Name

Turkish Journal Of Physiotherapy And Rehabilitation

classification coco dataset using machine learning algorithms

learning Machine

classification

MCOCO dataset

K Nearest Neighbor's

Stochastic Gradient Descent learning (SGD)

Logistic Regression Algorithm(LR)

and Multi-Layer Perceptron (MLP).

Rasool

Bushra

Raaid

...Show More Authors

In this paper, we used four classification methods to classify objects and compareamong these methods, these are K Nearest Neighbor's (KNN), Stochastic Gradient Descentlearning (SGD), Logistic Regression Algorithm(LR), and Multi-Layer Perceptron (MLP). Weused MCOCO dataset for classification and detection the objects, these dataset image wererandomly divided into training and testing datasets at a ratio of 7:3, respectively. In randomlyselect training and testing dataset images, converted the color images to the gray level, thenenhancement these gray images using the histogram equalization method, resize (20 x 20) fordataset image. Principal component analysis (PCA) was used for feature extraction, andfinally apply four classification metho

Publication Date

Sat May 01 2021

Journal Name

Journal Of Physics: Conference Series

The Prediction of COVID 19 Disease Using Feature Selection Techniques

Feature selection

COVID 19

Recursive Feature Elimination

Extra Tree Classifier

Restricted Boltzmann Machine

Naïve Bayesian

Rasha H.

Wisal Hashim

...Show More Authors

Abstract<p>COVID 19 has spread rapidly around the world due to the lack of a suitable vaccine; therefore the early prediction of those infected with this virus is extremely important attempting to control it by quarantining the infected people and giving them possible medical attention to limit its spread. This work suggests a model for predicting the COVID 19 virus using feature selection techniques. The proposed model consists of three stages which include the preprocessing stage, the features selection stage, and the classification stage. This work uses a data set consists of 8571 records, with forty features for patients from different countries. Two feature selection techniques are used in </p> ... Show More

View Publication Preview PDF

(31)

(24)

Publication Date

Thu Feb 01 2024

Journal Name

Bulletin Of Electrical Engineering And Informatics

A systematic literature review for smart hydroponic system

Ali Yahya Gheni

...Show More Authors

Hydroponics is the cultivation of plants by utilizing water without using soil which emphasizes the fulfillment of the nutritional needs of plants. This research has introduced smart hydroponic system that enables regular monitoring of every aspect to maintain the pH values, water, temperature, and soil. Nevertheless, there is a lack of knowledge that can systematically represent the current research. The proposed study suggests a systematic literature review of smart hydroponics system to overcome this limitation. This systematic literature review will assist practitioners draw on existing literature and propose new solutions based on available knowledge in the smart hydroponic system. The outcomes of this paper can assist future r

View Publication

(10)

(6)

Publication Date

Mon Jan 01 2024

Journal Name

Fusion: Practice And Applications

Optimizing Task Scheduling and Resource Allocation in Computing Environments using Metaheuristic Methods

Fadhil H.M.

...Show More Authors

Optimizing system performance in dynamic and heterogeneous environments and the efficient management of computational tasks are crucial. This paper therefore looks at task scheduling and resource allocation algorithms in some depth. The work evaluates five algorithms: Genetic Algorithms (GA), Particle Swarm Optimization (PSO), Ant Colony Optimization (ACO), Firefly Algorithm (FA) and Simulated Annealing (SA) across various workloads achieved by varying the task-to-node ratio. The paper identifies Finish Time and Deadline as two key performance metrics for gauging the efficacy of an algorithm, and a comprehensive investigation of the behaviors of these algorithms across different workloads was carried out. Results from the experiment

View Publication

(2)

(1)

Publication Date

Wed Nov 06 2024

Journal Name

2024 17th International Conference On Development In Esystem Engineering (dese)

Improving Cardiovascular Prediction Performance Using Machine Learning Based Feature Selection

Marwah Abdulrazzaq

Muntadher

Sadiq H.

Basheera M

Abir

Dhiya

...Show More Authors

View Publication

(1)

1 2 3 4 ... 2328 2329 2330 2331