Text classification based on optimization feature selection methods: a review and future directions

Osamah Mohammed Alyasiri; Yu-N Cheah; Hao Zhang; Omar Mustafa Al-Janabi; Ammar Kamal Abasi

doi:10.1007/s11042-024-19769-6

Details

Publication Date

Sat Jul 06 2024

Journal Name

Multimedia Tools And Applications

DOI

10.1007/s11042-024-19769-6

Choose Citation Style

Statistics

View publication

23

Statistics

(15)

(12)

Text classification based on optimization feature selection methods: a review and future directions

Text mining Text classification Text categorization Feature selection Optimization algorithms Machine learning classifiers

Osamah Mohammed Alyasiri

Yu-N Cheah

Hao Zhang

Omar Mustafa Al-Janabi

Ammar Kamal Abasi

...Show More Authors

A substantial portion of today’s multimedia data exists in the form of unstructured text. However, the unstructured nature of text poses a significant task in meeting users’ information requirements. Text classification (TC) has been extensively employed in text mining to facilitate multimedia data processing. However, accurately categorizing texts becomes challenging due to the increasing presence of non-informative features within the corpus. Several reviews on TC, encompassing various feature selection (FS) approaches to eliminate non-informative features, have been previously published. However, these reviews do not adequately cover the recently explored approaches to TC problem-solving utilizing FS, such as optimization techniques. This study comprehensively analyzes different FS approaches based on optimization algorithms for TC. We begin by introducing the primary phases involved in implementing TC. Subsequently, we explore a wide range of FS approaches for categorizing text documents and attempt to organize the existing works into four fundamental approaches: filter, wrapper, hybrid, and embedded. Furthermore, we review four optimization algorithms utilized in solving text FS problems: swarm intelligence-based, evolutionary-based, physics-based, and human behavior-related algorithms. We discuss the advantages and disadvantages of state-of-the-art studies that employ optimization algorithms for text FS methods. Additionally, we consider several aspects of each proposed method and thoroughly discuss the challenges associated with datasets, FS approaches, optimization algorithms, machine learning classifiers, and evaluation criteria employed to assess new and existing techniques. Finally, by identifying research gaps and proposing future directions, our review provides valuable guidance to researchers in developing and situating further studies within the current body of literature.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Thu Jan 30 2020

Journal Name

Telecommunication Systems

Nature-inspired optimization algorithms for community detection in complex networks: a review and future trends

Dhuha Abdulhadi

Siti Zaiton Mohd

Roselina

...Show More Authors

View Publication

(33)

(28)

Publication Date

Fri Feb 15 2013

Journal Name

American Journal Of Health-system Pharmacy

Pharmacy in Iraq: History, current status, and future directions

Ali Azeez

Saad Abdulrahman

Bernard

...Show More Authors

View Publication

(40)

(21)

Publication Date

Thu Dec 01 2022

Journal Name

Journal Of Engineering

Deep Learning-Based Segmentation and Classification Techniques for Brain Tumor MRI: A Review

Brain Tumor

Magnetic Resonance Imaging (MRI)

Convolutional Neural Network (CNN)

Classification

Segmentation

Feature Extraction.

Noor Mohammed

Nassir H.

...Show More Authors

Early detection of brain tumors is critical for enhancing treatment options and extending patient survival. Magnetic resonance imaging (MRI) scanning gives more detailed information, such as greater contrast and clarity than any other scanning method. Manually dividing brain tumors from many MRI images collected in clinical practice for cancer diagnosis is a tough and time-consuming task. Tumors and MRI scans of the brain can be discovered using algorithms and machine learning technologies, making the process easier for doctors because MRI images can appear healthy when the person may have a tumor or be malignant. Recently, deep learning techniques based on deep convolutional neural networks have been used to analyze med

View Publication Preview PDF

(12)

Publication Date

Sun Jan 30 2022

Journal Name

Iraqi Journal Of Science

A Survey on Arabic Text Classification Using Deep and Machine Learning Algorithms

Farah A.

Nada A.Z.

...Show More Authors

Text categorization refers to the process of grouping text or documents into classes or categories according to their content. Text categorization process consists of three phases which are: preprocessing, feature extraction and classification. In comparison to the English language, just few studies have been done to categorize and classify the Arabic language. For a variety of applications, such as text classification and clustering, Arabic text representation is a difficult task because Arabic language is noted for its richness, diversity, and complicated morphology. This paper presents a comprehensive analysis and a comparison for researchers in the last five years based on the dataset, year, algorithms and the accuracy th

(18)

(8)

Publication Date

Mon Dec 01 2014

Journal Name

2014 Ieee Student Conference On Research And Development

Feature extraction for co-occurrence-based cosine similarity score of text documents

Kadhim A.I.

...Show More Authors

View Publication

(11)

(10)

Publication Date

Sat Oct 03 2009

Journal Name

Proceeding Of 3rd Scientific Conference Of The College Of Science

Research Address: New Multispectral Image Classification Methods Based on Scatterplot Technique

Taghreed

...Show More Authors

Publication Date

Thu Feb 01 2024

Journal Name

Baghdad Science Journal

A Novel Gravity ‎Optimization Algorithm for Extractive Arabic Text Summarization

Abstractive Summarization

Extractive Summarization

Arabic Text Summarization

Similarity Graph

Gravitational Optimization Algorithm

Mustafa

Ayad R.

Osamah Y.

...Show More Authors

An automatic text summarization system mimics how humans summarize by picking the most ‎significant sentences in a source text. However, the complexities of the Arabic language have become ‎challenging to obtain information quickly and effectively. The main disadvantage of the ‎traditional approaches is that they are strictly constrained (especially for the Arabic language) by the ‎accuracy of sentence feature ‎functions, weighting schemes, ‎and similarity calculations. On the other hand, the meta-heuristic search approaches have a feature tha

View Publication Preview PDF

(2)

Publication Date

Fri Jan 01 2021

Journal Name

Computers, Materials & Continua

A New Hybrid Feature Selection Method Using T-test and Fitness Function

Husam

...Show More Authors

View Publication

(11)

(8)

Publication Date

Thu Oct 01 2015

Journal Name

Engineering And Technology Journal

Genetic Based Optimization Models for Enhancing Multi- Document Text Summarization

Hilal

Nasreen J.

...Show More Authors

View Publication

Publication Date

Fri Nov 11 2022

Journal Name

Al-mansour Journal

Text Cryptography Based on Three Different Keys

Text Cryptography

Cryptography

Plaintext

Ciphertext

Omar Fitian

Mohammed Jasim

Mustafa

...Show More Authors

Secure information transmission over the internet is becoming an important requirement in data communication. These days, authenticity, secrecy, and confidentiality are the most important concerns in securing data communication. For that reason, information hiding methods are used, such as Cryptography, Steganography and Watermarking methods, to secure data transmission, where cryptography method is used to encrypt the information in an unreadable form. At the same time, steganography covers the information within images, audio or video. Finally, watermarking is used to protect information from intruders. This paper proposed a new cryptography method by using thre

1 2 3 4 ... 2218 2219 2220 2221