Text classification based on optimization feature selection methods: a review and future directions

Osamah Mohammed Alyasiri; Yu-N Cheah; Hao Zhang; Omar Mustafa Al-Janabi; Ammar Kamal Abasi

doi:10.1007/s11042-024-19769-6

Details

Publication Date

Sat Jul 06 2024

Journal Name

Multimedia Tools And Applications

DOI

10.1007/s11042-024-19769-6

Choose Citation Style

Statistics

View publication

23

Statistics

(15)

(12)

Text classification based on optimization feature selection methods: a review and future directions

Text mining Text classification Text categorization Feature selection Optimization algorithms Machine learning classifiers

Osamah Mohammed Alyasiri

Yu-N Cheah

Hao Zhang

Omar Mustafa Al-Janabi

Ammar Kamal Abasi

...Show More Authors

A substantial portion of today’s multimedia data exists in the form of unstructured text. However, the unstructured nature of text poses a significant task in meeting users’ information requirements. Text classification (TC) has been extensively employed in text mining to facilitate multimedia data processing. However, accurately categorizing texts becomes challenging due to the increasing presence of non-informative features within the corpus. Several reviews on TC, encompassing various feature selection (FS) approaches to eliminate non-informative features, have been previously published. However, these reviews do not adequately cover the recently explored approaches to TC problem-solving utilizing FS, such as optimization techniques. This study comprehensively analyzes different FS approaches based on optimization algorithms for TC. We begin by introducing the primary phases involved in implementing TC. Subsequently, we explore a wide range of FS approaches for categorizing text documents and attempt to organize the existing works into four fundamental approaches: filter, wrapper, hybrid, and embedded. Furthermore, we review four optimization algorithms utilized in solving text FS problems: swarm intelligence-based, evolutionary-based, physics-based, and human behavior-related algorithms. We discuss the advantages and disadvantages of state-of-the-art studies that employ optimization algorithms for text FS methods. Additionally, we consider several aspects of each proposed method and thoroughly discuss the challenges associated with datasets, FS approaches, optimization algorithms, machine learning classifiers, and evaluation criteria employed to assess new and existing techniques. Finally, by identifying research gaps and proposing future directions, our review provides valuable guidance to researchers in developing and situating further studies within the current body of literature.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Fri Sep 15 2017

Journal Name

Research Journal Of Applied Sciences, Engineering And Technology

Graph-Based Text Representation: A Survey of Current Approaches

Geehan Sabah

Asma Khazaal

Siti Sakira

...Show More Authors

View Publication

(3)

Publication Date

Fri Mar 23 2018

Journal Name

Entropy

Methods and Challenges in Shot Boundary Detection: A Review

Sadiq

Abd

M.

Basheera

Syed

Wissam

...Show More Authors

View Publication

(68)

(61)

Publication Date

Mon Jul 01 2019

Journal Name

International Journal Of Swarm Intelligence Research

A New Strategy Based on GSABAT to Solve Single Objective Optimization Problem

Single Objective

Optimization Problem

GSA

BAT

Iraq

...Show More Authors

This article proposes a new strategy based on a hybrid method that combines the gravitational search algorithm (GSA) with the bat algorithm (BAT) to solve a single-objective optimization problem. It first runs GSA, followed by BAT as the second step. The proposed approach relies on a parameter between 0 and 1 to address the problem of falling into local research because the lack of a local search mechanism increases intensity search, whereas diversity remains high and easily falls into the local optimum. The improvement is equivalent to the speed of the original BAT. Access speed is increased for the best solution. All solutions in the population are updated before the end of the operation of the proposed algorithm. The diversification f

View Publication Preview PDF

(6)

Publication Date

Sun Jun 01 2014

Journal Name

Baghdad Science Journal

Classification of fetal abnormalities based on CTG signal

fetal heart rate monitoring

heart rate analysis by neural network

fuzzy classification

FHR wavelet transform.

Safa'a S.

Israa R.

...Show More Authors

The fetal heart rate (FHR) signal processing based on Artificial Neural Networks (ANN),Fuzzy Logic (FL) and frequency domain Discrete Wavelet Transform(DWT) were analysis in order to perform automatic analysis using personal computers. Cardiotocography (CTG) is a primary biophysical method of fetal monitoring. The assessment of the printed CTG traces was based on the visual analysis of patterns that describing the variability of fetal heart rate signal. Fetal heart rate data of pregnant women with pregnancy between 38 and 40 weeks of gestation were studied. The first stage in the system was to convert the cardiotocograghy (CTG) tracing in to digital series so that the system can be analyzed ,while the second stage ,the FHR time series was t

View Publication Preview PDF

Publication Date

Sat Oct 01 2016

Journal Name

2016 6th International Conference On Information Communication And Management (icicm)

Enhancing case-based reasoning retrieval using classification based on associations

Ahmed

...Show More Authors

View Publication

(4)

(2)

Publication Date

Mon Jan 19 2026

Journal Name

American Journal Of Alzheimer's Disease & Other Dementias®

Comparison Study of Different Feature Selection Techniques for the Diagnosis of Alzheimer’s Disease

Farah

...Show More Authors

Objective : Alzheimer’s disease (AD) continues to be a major challenge because handling high-dimensional data is time-consuming and expensive due to its complexity. A large feature space often increases computational costs and reduces model interpretability. This study addresses this problem by evaluating and comparing multiple feature selection techniques to identify the most informative biomarkers for AD diagnosis.

Methods : Our study used data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) to implement and test three feature selection a

View Publication

Publication Date

Sun Jun 20 2021

Journal Name

Baghdad Science Journal

Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network

Arabic alphabet

deep learning

speech classification

COVID-19

spectrogram

Asroni

Ku Ruhana

Cahya

Hasan Basri

...Show More Authors

Deep learning convolution neural network has been widely used to recognize or classify voice. Various techniques have been used together with convolution neural network to prepare voice data before the training process in developing the classification model. However, not all model can produce good classification accuracy as there are many types of voice or speech. Classification of Arabic alphabet pronunciation is a one of the types of voice and accurate pronunciation is required in the learning of the Qur’an reading. Thus, the technique to process the pronunciation and training of the processed data requires specific approach. To overcome this issue, a method based on padding and deep learning convolution neural network is proposed to

View Publication Preview PDF

(25)

(7)

Publication Date

Sat Oct 22 2022

Journal Name

Aro-the Scientific Journal Of Koya University

Classification of Different Shoulder Girdle Motions for Prosthesis Control Using a Time-Domain Feature Extraction Technique

Bio-signal analysis

Dimensionality reduction

LDA classifier

Time domain

Huda M.

Alia K.

Ali H.

...Show More Authors

Abstract—The upper limb amputation exerts a significant burden on the amputee, limiting their ability to perform everyday activities, and degrading their quality of life. Amputee patients’ quality of life can be improved if they have natural control over their prosthetic hands. Among the biological signals, most commonly used to predict upper limb motor intentions, surface electromyography (sEMG), and axial acceleration sensor signals are essential components of shoulder-level upper limb prosthetic hand control systems. In this work, a pattern recognition system is proposed to create a plan for categorizing high-level upper limb prostheses in seven various types of shoulder girdle motions. Thus, combining seven feature groups, w

View Publication Preview PDF

(6)

(3)

Publication Date

Sun Dec 01 2024

Journal Name

Baghdad Science Journal

Densenet Model for Binary Glaucoma Classification Performance Assessment with Texture Feature

Wildan

Amal

Sahar

Mays

Mina

...Show More Authors

تعتبر شبكية العين جزءًا مهمًا من العين لأن الأطباء يستخدمون صورها لتشخيص العديد من أمراض العيون مثل الجلوكوما واعتلال الشبكية السكري وإعتام عدسة العين. في الواقع، يعد تصوير الشبكية المجزأ أداة قوية للكشف عن النمو غير العادي في منطقة العين بالإضافة إلى تحديد حجم وبنية القرص البصري. يمكن أن يؤدي الجلوكوما إلى إتلاف القرص البصري، مما يغير مظهر القرص البصري للعين. تعمل تقنيتنا على الكشف عن الجلوكوما وتصنيفه

View Publication

(4)

(3)

Publication Date

Tue May 01 2018

Journal Name

Journal Of Physics: Conference Series

Performance of Case-Based Reasoning Retrieval Using Classification Based on Associations versus Jcolibri and FreeCBR: A Further Validation Study

Ahmed

...Show More Authors

View Publication Preview PDF

(6)

(1)

1 2 ... 7 8 9 10 ... 2220 2221