Text classification based on optimization feature selection methods: a review and future directions

Osamah Mohammed Alyasiri; Yu-N Cheah; Hao Zhang; Omar Mustafa Al-Janabi; Ammar Kamal Abasi

doi:10.1007/s11042-024-19769-6

Details

Publication Date

Sat Jul 06 2024

Journal Name

Multimedia Tools And Applications

DOI

10.1007/s11042-024-19769-6

Choose Citation Style

Statistics

View publication

19

Statistics

(2)

(7)

Text classification based on optimization feature selection methods: a review and future directions

Text mining Text classification Text categorization Feature selection Optimization algorithms Machine learning classifiers

Osamah Mohammed Alyasiri

Yu-N Cheah

Hao Zhang

Omar Mustafa Al-Janabi

Ammar Kamal Abasi

...Show More Authors

A substantial portion of today’s multimedia data exists in the form of unstructured text. However, the unstructured nature of text poses a significant task in meeting users’ information requirements. Text classification (TC) has been extensively employed in text mining to facilitate multimedia data processing. However, accurately categorizing texts becomes challenging due to the increasing presence of non-informative features within the corpus. Several reviews on TC, encompassing various feature selection (FS) approaches to eliminate non-informative features, have been previously published. However, these reviews do not adequately cover the recently explored approaches to TC problem-solving utilizing FS, such as optimization techniques. This study comprehensively analyzes different FS approaches based on optimization algorithms for TC. We begin by introducing the primary phases involved in implementing TC. Subsequently, we explore a wide range of FS approaches for categorizing text documents and attempt to organize the existing works into four fundamental approaches: filter, wrapper, hybrid, and embedded. Furthermore, we review four optimization algorithms utilized in solving text FS problems: swarm intelligence-based, evolutionary-based, physics-based, and human behavior-related algorithms. We discuss the advantages and disadvantages of state-of-the-art studies that employ optimization algorithms for text FS methods. Additionally, we consider several aspects of each proposed method and thoroughly discuss the challenges associated with datasets, FS approaches, optimization algorithms, machine learning classifiers, and evaluation criteria employed to assess new and existing techniques. Finally, by identifying research gaps and proposing future directions, our review provides valuable guidance to researchers in developing and situating further studies within the current body of literature.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Wed Sep 16 2020

Journal Name

International Journal Of Dentistry And Oral Science

Attitude and Knowledge of Orthodontics among General Dentists and Non-Orthodontic Specialists: A Questionnaire Based Survey

Orthodontics

Knowledge

Attitude

Dentists.

Muhanad

Nada

...Show More Authors

Aim: This study aimed to assessing orthodontic knowledge and attitude among general dentists and non-orthodontic specialists. Background: Early detection of orthodontic disorders is essentialin motivating patients to intervene prior to long term complications when the disorders are not recongised. Methods: A questionnaire was distributed amongst dentistsother than orthodontists. This questionnaire consisted of three sections. The first one aimed to collect demographic, educational level and practice type information. Further two sections consisted of closed-end questions designed to evaluateknowledge and attitude of orthodontics. Results: A total of 313 responses to the survey were submitted. No significant correlation was observed, e

View Publication

(1)

Publication Date

Fri Oct 01 2021

Journal Name

Applied Thermal Engineering

Simultaneous and consecutive charging and discharging of a PCM-based domestic air heater with metal foam

Jasim M.

Hayder

Pouyan

Mohammad

Hasan

Afrasyab

Wahiba

Donald

...Show More Authors

View Publication

(65)

(66)

Publication Date

Sat Apr 09 2022

Journal Name

Engineering, Technology & Applied Science Research

A Semi-Empirical Equation based on the Strut-and-Tie Model for the Shear Strength Prediction of Deep Beams with Multiple Large Web Openings

L. T.

R. M.

...Show More Authors

The behavior and shear strength of full-scale (T-section) reinforced concrete deep beams, designed according to the strut-and-tie approach of ACI Code-19 specifications, with various large web openings were investigated in this paper. A total of 7 deep beam specimens with identical shear span-to-depth ratios have been tested under mid-span concentrated load applied monotonically until beam failure. The main variables studied were the effects of width and depth of the web openings on deep beam performance. Experimental data results were calibrated with the strut-and-tie approach, adopted by ACI 318-19 code for the design of deep beams. The provided strut-and-tie design model in ACI 318-19 code provision was assessed and found to be u

View Publication

(6)

(8)

Publication Date

Tue May 24 2022

Journal Name

International Journal Of Interactive Mobile Technologies (ijim)

A Blind Video Copyright Protection Technique in Maximum and Minimum Energy Frames Based on The Fast Walsh Hadamard Transform (FWHT) and Discrete Wavelet Transform (DWT) and Arnold Map

Muna Majeed

...Show More Authors

Video copyright protection is the most generally acknowledged method of preventing data piracy. This paper proposes a blind video copyright protection technique based on the Fast Walsh Hadamard Transform (FWHT), Discrete Wavelet Transform (DWT), and Arnold Map. The proposed method chooses only frames with maximum and minimum energy features to host the watermark. It also exploits the advantages of both the fast Walsh Hadamard transform (FWHT) and discrete wavelet transforms (DWT) for watermark embedding. The Arnold map encrypts watermarks before the embedding process and decrypts watermarks after extraction. The results show that the proposed method can achieve a fast embedding time, good transparency, and robustness against various

View Publication

(1)

Publication Date

Mon Aug 21 2023

Journal Name

Sport Tk-revista Euroamericana De Ciencias Del Deporte

Determining the grades and standard levels of some mental skills as an indicator for the selection of young volleyball players

: (standard levels

mental skills

young players).

Khalil

...Show More Authors

ABSTRACT Purpose: The determination of standard scores and levels for some mental skills by researchers is of great importance, especially if it matches the target research sample, Method: as the researchers used the descriptive approach in the survey method, and the researchers chose the sample of youth players for clubs for the season (2022-2021), numbering (127) players, and the researchers identified the scale and procedures and applied it to the research sample, Results: obtained the results that were processed, extracted grades and standard levels, and then interpreted them and obtained conclusions, Conclusion: the most important of which are: The standard levels of mental skills reached the results of the sample studied within the

View Publication

Publication Date

Sat Aug 02 2025

Journal Name

Engineering, Technology & Applied Science Research

A New Method for Face-Based Recognition Using a Fuzzy Face Deep Model

Zainab

Hind Moutaz

...Show More Authors

Face recognition is a crucial biometric technology used in various security and identification applications. Ensuring accuracy and reliability in facial recognition systems requires robust feature extraction and secure processing methods. This study presents an accurate facial recognition model using a feature extraction approach within a cloud environment. First, the facial images undergo preprocessing, including grayscale conversion, histogram equalization, Viola-Jones face detection, and resizing. Then, features are extracted using a hybrid approach that combines Linear Discriminant Analysis (LDA) and Gray-Level Co-occurrence Matrix (GLCM). The extracted features are encrypted using the Data Encryption Standard (DES) for security

View Publication

Publication Date

Fri Mar 01 2024

Journal Name

Iaes International Journal Of Artificial Intelligence (ij-ai)

Analyzing the behavior of different classification algorithms in diabetes prediction

Israa N.

...Show More Authors

<span lang="EN-US">Diabetes is one of the deadliest diseases in the world that can lead to stroke, blindness, organ failure, and amputation of lower limbs. Researches state that diabetes can be controlled if it is detected at an early stage. Scientists are becoming more interested in classification algorithms in diagnosing diseases. In this study, we have analyzed the performance of five classification algorithms namely naïve Bayes, support vector machine, multi layer perceptron artificial neural network, decision tree, and random forest using diabetes dataset that contains the information of 2000 female patients. Various metrics were applied in evaluating the performance of the classifiers such as precision, area under the c

View Publication

(1)

Publication Date

Thu May 05 2022

Journal Name

Periodicals Of Engineering And Natural Sciences (pen)

Classification SINGLE-LEAD ECG by using conventional neural network algorithm

Jabbar S.F.

...Show More Authors

View Publication

Publication Date

Mon Dec 01 2014

Journal Name

Journal Of Economics And Administrative Sciences

Comparison between some of linear classification models with practical application

Linear discriminant analysis

binary response logistic regression and misclassification probability.

حمزة اسماعيل

...Show More Authors

Linear discriminant analysis and logistic regression are the most widely used in multivariate statistical methods for analysis of data with categorical outcome variables .Both of them are appropriate for the development of linear classification models .linear discriminant analysis has been that the data of explanatory variables must be distributed multivariate normal distribution. While logistic regression no assumptions on the distribution of the explanatory data. Hence ,It is assumed that logistic regression is the more flexible and more robust method in case of violations of these assumptions.

In this paper we have been focus for the comparison between three forms for classification data belongs

View Publication Preview PDF

Publication Date

Wed Jan 01 2020

Journal Name

Communications In Computer And Information Science

Performance Evaluation for Four Supervised Classifiers in Internet Traffic Classification

Munther A.

imad j. mohammed

...Show More Authors

View Publication

(7)

(1)

1 2 ... 120 121 122 123 ... 2101 2102