Details

Publication Date

Sat Sep 30 2023

Journal Name

Iraqi Journal Of Science

Volume

64

Issue Number

9

Choose Citation Style

Statistics

Abstract Views

30

Galley Views

23

Statistics

An Integrated Information Gain with A Black Hole Algorithm for Feature Selection: A Case Study of E-mail Spam Filtering

E-mail Spam Filtering

Black Hole Algorithm

Feature Selection

Naïve Bayesian Classifier

Amaal Mahmood

Adnan Hadi Mahdi Al-Helali

...Show More Authors

The current issues in spam email detection systems are directly related to spam email classification's low accuracy and feature selection's high dimensionality. However, in machine learning (ML), feature selection (FS) as a global optimization strategy reduces data redundancy and produces a collection of precise and acceptable outcomes. A black hole algorithm-based FS algorithm is suggested in this paper for reducing the dimensionality of features and improving the accuracy of spam email classification. Each star's features are represented in binary form, with the features being transformed to binary using a sigmoid function. The proposed Binary Black Hole Algorithm (BBH) searches the feature space for the best feature subsets, and feature selection is based on a fitness function that is proportional to the accuracy achieved using a Naive Bayesian Classifier (NBC). When measuring the performance of the BBH with the SpamBase dataset, the performance of the classifier and the dimension of the selected feature vector used as a classifier input are considered. The experiments revealed that the BBH can produce good FS results even with a small set of selected features. This shows that when utilizing the NBC-based BBH, good spam email categorization accuracy is possible.

View Publication Preview PDF

Quick Preview PDF