Text classification based on optimization feature selection methods: a review and future directions

Osamah Mohammed Alyasiri; Yu-N Cheah; Hao Zhang; Omar Mustafa Al-Janabi; Ammar Kamal Abasi

doi:10.1007/s11042-024-19769-6

Details

Publication Date

Sat Jul 06 2024

Journal Name

Multimedia Tools And Applications

DOI

10.1007/s11042-024-19769-6

Choose Citation Style

Statistics

View publication

16

Statistics

(2)

(4)

Text classification based on optimization feature selection methods: a review and future directions

Text mining Text classification Text categorization Feature selection Optimization algorithms Machine learning classifiers

Osamah Mohammed Alyasiri

Yu-N Cheah

Hao Zhang

Omar Mustafa Al-Janabi

Ammar Kamal Abasi

...Show More Authors

A substantial portion of today’s multimedia data exists in the form of unstructured text. However, the unstructured nature of text poses a significant task in meeting users’ information requirements. Text classification (TC) has been extensively employed in text mining to facilitate multimedia data processing. However, accurately categorizing texts becomes challenging due to the increasing presence of non-informative features within the corpus. Several reviews on TC, encompassing various feature selection (FS) approaches to eliminate non-informative features, have been previously published. However, these reviews do not adequately cover the recently explored approaches to TC problem-solving utilizing FS, such as optimization techniques. This study comprehensively analyzes different FS approaches based on optimization algorithms for TC. We begin by introducing the primary phases involved in implementing TC. Subsequently, we explore a wide range of FS approaches for categorizing text documents and attempt to organize the existing works into four fundamental approaches: filter, wrapper, hybrid, and embedded. Furthermore, we review four optimization algorithms utilized in solving text FS problems: swarm intelligence-based, evolutionary-based, physics-based, and human behavior-related algorithms. We discuss the advantages and disadvantages of state-of-the-art studies that employ optimization algorithms for text FS methods. Additionally, we consider several aspects of each proposed method and thoroughly discuss the challenges associated with datasets, FS approaches, optimization algorithms, machine learning classifiers, and evaluation criteria employed to assess new and existing techniques. Finally, by identifying research gaps and proposing future directions, our review provides valuable guidance to researchers in developing and situating further studies within the current body of literature.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Tue Jun 01 2021

Journal Name

Swarm And Evolutionary Computation

A review of heuristics and metaheuristics for community detection in complex networks: Current usage, emerging development and future directions

Bara'a Ali

Amenah D.

Ammar A.

Clara

Mayyadah

Suat

Rawaa Dawoud

...Show More Authors

Sensibly highlighting the hidden structures of many real-world networks has attracted growing interest and triggered a vast array of techniques on what is called nowadays community detection (CD) problem. Non-deterministic metaheuristics are proved to competitively transcending the limits of the counterpart deterministic heuristics in solving community detection problem. Despite the increasing interest, most of the existing metaheuristic based community detection (MCD) algorithms reflect one traditional language. Generally, they tend to explicitly project some features of real communities into different definitions of single or multi-objective optimization functions. The design of other operators, however, remains canonical lacking any inte

(58)

(43)

Publication Date

Sat Dec 30 2023

Journal Name

Traitement Du Signal

Optimizing Acoustic Feature Selection for Estimating Speaker Traits: A Novel Threshold-Based Approach

Umniah

...Show More Authors

View Publication

Publication Date

Wed Feb 01 2023

Journal Name

Baghdad Science Journal

Breast Cancer MRI Classification Based on Fractional Entropy Image Enhancement and Deep Feature Extraction

Breast MRI scans

Classification

CNN

Deep features

LSTM

علي

Asaad F.

Hamid A.

Rabha W.

...Show More Authors

Disease diagnosis with computer-aided methods has been extensively studied and applied in diagnosing and monitoring of several chronic diseases. Early detection and risk assessment of breast diseases based on clinical data is helpful for doctors to make early diagnosis and monitor the disease progression. The purpose of this study is to exploit the Convolutional Neural Network (CNN) in discriminating breast MRI scans into pathological and healthy. In this study, a fully automated and efficient deep features extraction algorithm that exploits the spatial information obtained from both T2W-TSE and STIR MRI sequences to discriminate between pathological and healthy breast MRI scans. The breast MRI scans are preprocessed prior to the feature

View Publication Preview PDF

(24)

(6)

Publication Date

Tue Sep 01 2015

Journal Name

2015 7th Computer Science And Electronic Engineering Conference (ceec)

An experimental investigation on PCA based on cosine similarity and correlation for text feature dimensionality reduction

Maysa

John Q

...Show More Authors

View Publication

(6)

(5)

Publication Date

Thu Jan 30 2020

Journal Name

Telecommunication Systems

Nature-inspired optimization algorithms for community detection in complex networks: a review and future trends

Dhuha Abdulhadi

Siti Zaiton Mohd

Roselina

...Show More Authors

View Publication

(28)

(23)

Publication Date

Thu Nov 17 2022

Journal Name

Journal Of Information And Optimization Sciences

Hybrid deep learning model for Arabic text classification based on mutual information

Farah A.

Nada A. Z.

...Show More Authors

View Publication

(1)

Publication Date

Fri Feb 15 2013

Journal Name

American Journal Of Health-system Pharmacy

Pharmacy in Iraq: History, current status, and future directions

Ali Azeez

Saad Abdulrahman

Bernard

...Show More Authors

View Publication

(38)

(19)

Publication Date

Thu Dec 01 2022

Journal Name

Journal Of Engineering

Deep Learning-Based Segmentation and Classification Techniques for Brain Tumor MRI: A Review

Brain Tumor

Magnetic Resonance Imaging (MRI)

Convolutional Neural Network (CNN)

Classification

Segmentation

Feature Extraction.

Noor Mohammed

Nassir H.

...Show More Authors

Early detection of brain tumors is critical for enhancing treatment options and extending patient survival. Magnetic resonance imaging (MRI) scanning gives more detailed information, such as greater contrast and clarity than any other scanning method. Manually dividing brain tumors from many MRI images collected in clinical practice for cancer diagnosis is a tough and time-consuming task. Tumors and MRI scans of the brain can be discovered using algorithms and machine learning technologies, making the process easier for doctors because MRI images can appear healthy when the person may have a tumor or be malignant. Recently, deep learning techniques based on deep convolutional neural networks have been used to analyze med

View Publication Preview PDF

(9)

Publication Date

Mon Dec 01 2014

Journal Name

2014 Ieee Student Conference On Research And Development

Feature extraction for co-occurrence-based cosine similarity score of text documents

Kadhim A.I.

...Show More Authors

View Publication

(10)

(9)

Publication Date

Sun Jan 30 2022

Journal Name

Iraqi Journal Of Science

A Survey on Arabic Text Classification Using Deep and Machine Learning Algorithms

Farah A.

Nada A.Z.

...Show More Authors

Text categorization refers to the process of grouping text or documents into classes or categories according to their content. Text categorization process consists of three phases which are: preprocessing, feature extraction and classification. In comparison to the English language, just few studies have been done to categorize and classify the Arabic language. For a variety of applications, such as text classification and clustering, Arabic text representation is a difficult task because Arabic language is noted for its richness, diversity, and complicated morphology. This paper presents a comprehensive analysis and a comparison for researchers in the last five years based on the dataset, year, algorithms and the accuracy th

(14)

(4)

1 2 3 4 ... 2027 2028 2029 2030