Improved Firefly Algorithm with Variable Neighborhood Search for Data Clustering

Hayder Naser Khraibet Al-Behadili

doi:10.21123/bsj.2022.19.2.0409

Details

Publication Date

Fri Apr 01 2022

Journal Name

Baghdad Science Journal

Volume

19

Issue Number

2

DOI

10.21123/bsj.2022.19.2.0409

Choose Citation Style

Statistics

View publication

14

Statistics

(16)

(5)

Improved Firefly Algorithm with Variable Neighborhood Search for Data Clustering

Data clustering

Data mining

Firefly algorithm

Machine learning

Variable neighborhood search.

Hayder Naser Khraibet Al-Behadili

...Show More Authors

Among the metaheuristic algorithms, population-based algorithms are an explorative search algorithm superior to the local search algorithm in terms of exploring the search space to find globally optimal solutions. However, the primary downside of such algorithms is their low exploitative capability, which prevents the expansion of the search space neighborhood for more optimal solutions. The firefly algorithm (FA) is a population-based algorithm that has been widely used in clustering problems. However, FA is limited in terms of its premature convergence when no neighborhood search strategies are employed to improve the quality of clustering solutions in the neighborhood region and exploring the global regions in the search space. On these bases, this work aims to improve FA using variable neighborhood search (VNS) as a local search method, providing VNS the benefit of the trade-off between the exploration and exploitation abilities. The proposed FA-VNS allows fireflies to improve the clustering solutions with the ability to enhance the clustering solutions and maintain the diversity of the clustering solutions during the search process using the perturbation operators of VNS. To evaluate the performance of the algorithm, eight benchmark datasets are utilized with four well-known clustering algorithms. The comparison according to the internal and external evaluation metrics indicates that the proposed FA-VNS can produce more compact clustering solutions than the well-known clustering algorithms.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Sun Dec 01 2019

Journal Name

Baghdad Science Journal

Symmetric- Based Steganography Technique Using Spiral-Searching Method for HSV Color Images

Hierarchal decomposition Image processing

Information hiding

Security Techniques

Spiral search

Steganography.

Raheem Abdul Sahib

...Show More Authors

Steganography is defined as hiding confidential information in some other chosen media without leaving any clear evidence of changing the media's features. Most traditional hiding methods hide the message directly in the covered media like (text, image, audio, and video). Some hiding techniques leave a negative effect on the cover image, so sometimes the change in the carrier medium can be detected by human and machine. The purpose of suggesting hiding information is to make this change undetectable. The current research focuses on using complex method to prevent the detection of hiding information by human and machine based on spiral search method, the Structural Similarity Index Metrics measures are used to get the accuracy and quality

View Publication Preview PDF

(5)

Publication Date

Thu Feb 01 2018

Journal Name

Journal Of Economics And Administrative Sciences

Comparison of Slice inverse regression with the principal components in reducing high-dimensions data by using simulation

اختزال الابعاد

الانحدار الشرائحي المعكوس

المركبات الرئيسية.

dimensions reduction

Slice inverse regression

principal components.

عمر عبد المحسن

زينة ابراهيم

...Show More Authors

This research aims to study the methods of reduction of dimensions that overcome the problem curse of dimensionality when traditional methods fail to provide a good estimation of the parameters So this problem must be dealt with directly . Two methods were used to solve the problem of high dimensional data, The first method is the non-classical method Slice inverse regression ( SIR ) method and the proposed weight standard Sir (WSIR) method and principal components (PCA) which is the general method used in reducing dimensions, (SIR ) and (PCA) is based on the work of linear combinations of a subset of the original explanatory variables, which may suffer from the problem of heterogeneity and the problem of linear

View Publication Preview PDF

Publication Date

Fri Oct 02 2015

Journal Name

American Journal Of Applied Sciences

Advances in Document Clustering with Evolutionary-Based Algorithms

Text Document Clustering

Hypertext Clustering

Evolutionary Algorithms

Genetic Algorithms

Text Dimensional Reduction

Sarmad

...Show More Authors

Document clustering is the process of organizing a particular electronic corpus of documents into subgroups of similar text features. Formerly, a number of conventional algorithms had been applied to perform document clustering. There are current endeavors to enhance clustering performance by employing evolutionary algorithms. Thus, such endeavors became an emerging topic gaining more attention in recent years. The aim of this paper is to present an up-to-date and self-contained review fully devoted to document clustering via evolutionary algorithms. It firstly provides a comprehensive inspection to the document clustering model revealing its various components with its related concepts. Then it shows and analyzes the principle research wor

View Publication

(2)

Publication Date

Fri Aug 01 2014

Journal Name

Journal Of Economics And Administrative Sciences

Efficiency Measurement Model for Postgraduate Programs and Undergraduate Programs by Using Data Envelopment Analysis

خالد زغيتون

...Show More Authors

Measuring the efficiency of postgraduate and undergraduate programs is one of the essential elements in educational process. In this study, colleges of Baghdad University and data for the academic year (2011-2012) have been chosen to measure the relative efficiencies of postgraduate and undergraduate programs in terms of their inputs and outputs. A relevant method to conduct the analysis of this data is Data Envelopment Analysis (DEA). The effect of academic staff to the number of enrolled and alumni students to the postgraduate and undergraduate programs are the main focus of the study.

View Publication Preview PDF

Publication Date

Sun Jan 01 2023

Journal Name

In Press

Man’s Search for Meaning

Abdali Hammood Shihan

...Show More Authors

Publication Date

Thu Dec 31 2020

Journal Name

Journal Of Accounting And Financial Studies ( Jafs )

Application of data content analysis (DEA) technology to evaluate performance efficiency: applied research in the General Tax Authority

data envelopment analysis

performance efficiency assessment

input and output

عمر عبد الواحد

أ.د. بيداء ستار

...Show More Authors

The aim of the research is to use the data content analysis technique (DEA) in evaluating the efficiency of the performance of the eight branches of the General Tax Authority, located in Baghdad, represented by Karrada, Karkh parties, Karkh Center, Dora, Bayaa, Kadhimiya, New Baghdad, Rusafa according to the determination of the inputs represented by the number of non-accountable taxpayers and according to the categories professions and commercial business, deduction, transfer of property ownership, real estate and tenders, In addition to determining the outputs according to the checklist that contains nine dimensions to assess the efficiency of the performance of the investigated branches by investing their available resources T

View Publication Preview PDF

Publication Date

Sun Mar 01 2015

Journal Name

Journal Of Engineering

Multi-Sites Multi-Variables Forecasting Model for Hydrological Data using Genetic Algorithm Modeling

forecasting

multi-sites

multi-variables

cross sites correlation

serial correlation

cross variables correlations

hydrology.

Rafa H.

...Show More Authors

A two time step stochastic multi-variables multi-sites hydrological data forecasting model was developed and verified using a case study. The philosophy of this model is to use the cross-variables correlations, cross-sites correlations and the two steps time lag correlations simultaneously, for estimating the parameters of the model which then are modified using the mutation process of the genetic algorithm optimization model. The objective function that to be minimized is the Akiake test value. The case study is of four variables and three sites. The variables are the monthly air temperature, humidity, precipitation, and evaporation; the sites are Sulaimania, Chwarta, and Penjwin, which are located north Iraq. The model performance was

View Publication Preview PDF

Publication Date

Thu Jun 01 2023

Journal Name

Bulletin Of Electrical Engineering And Informatics

A missing data imputation method based on salp swarm algorithm for diabetes disease

Geehan Sabah Hassan

Noora Jamal Ali

Asma Khazaal Abdulsahib

Farah Jasim Mohammed

...Show More Authors

Most of the medical datasets suffer from missing data, due to the expense of some tests or human faults while recording these tests. This issue affects the performance of the machine learning models because the values of some features will be missing. Therefore, there is a need for a specific type of methods for imputing these missing data. In this research, the salp swarm algorithm (SSA) is used for generating and imputing the missing values in the pain in my ass (also known Pima) Indian diabetes disease (PIDD) dataset, the proposed algorithm is called (ISSA). The obtained results showed that the classification performance of three different classifiers which are support vector machine (SVM), K-nearest neighbour (KNN), and Naïve B

View Publication

(10)

(2)

Publication Date

Wed Jan 01 2014

Journal Name

Scienceasia

A combined compact genetic algorithm and local search method for optimizing the ARMA(1,1) model of a likelihood estimator

R.D.

Azeddien

Mohd Sapiyan

Saad

...Show More Authors

In this paper, a compact genetic algorithm (CGA) is enhanced by integrating its selection strategy with a steepest descent algorithm (SDA) as a local search method to give I-CGA-SDA. This system is an attempt to avoid the large CPU time and computational complexity of the standard genetic algorithm. Here, CGA dramatically reduces the number of bits required to store the population and has a faster convergence. Consequently, this integrated system is used to optimize the maximum likelihood function lnL(φ1, θ1) of the mixed model. Simulation results based on MSE were compared with those obtained from the SDA and showed that the hybrid genetic algorithm (HGA) and I-CGA-SDA can give a good estimator of (φ1, θ1) for the ARMA(1,1) model. Anot

View Publication

(3)

Publication Date

Thu Mar 30 2023

Journal Name

Journal Of Economics And Administrative Sciences

An Artificial Intelligence Algorithm to Optimize the Classification of the Hepatitis Type

Regression Tree Classification (CART)

Radial Basis Function (RBF)

Genetic Algorithm (GA)

شجرة الانحدار التصنيفية

شبكة دالة الاساس الشعاعي

الخوارزمية الجينية

Hiba

Sabah

...Show More Authors

Hepatitis is one of the diseases that has become more developed in recent years in terms of the high number of infections. Hepatitis causes inflammation that destroys liver cells, and it occurs as a result of viruses, bacteria, blood transfusions, and others. There are five types of hepatitis viruses, which are (A, B, C, D, E) according to their severity. The disease varies by type. Accurate and early diagnosis is the best way to prevent disease, as it allows infected people to take preventive steps so that they do not transmit the difference to other people, and diagnosis using artificial intelligence gives an accurate and rapid diagnostic result. Where the analytical method of the data relied on the radial basis network to diagnose the

View Publication Preview PDF

1 2 ... 11 12 13 14 ... 1722 1723