A missing data imputation method based on salp swarm algorithm for diabetes disease

Geehan Sabah Hassan Sabah; Noora Jamal Ali Jamal; Asma Khazaal Abdulsahib Abdulsahib; Farah Jasim Mohammed Mohammed

doi:10.11591/eei.v12i3.4528

Details

Publication Date

Thu Jun 01 2023

Journal Name

Bulletin Of Electrical Engineering And Informatics

Volume

12

DOI

10.11591/eei.v12i3.4528

Choose Citation Style

Statistics

View publication

19

Statistics

(6)

(1)

A missing data imputation method based on salp swarm algorithm for diabetes disease

Geehan Sabah Hassan Sabah

Noora Jamal Ali Jamal

Asma Khazaal Abdulsahib Abdulsahib

Farah Jasim Mohammed Mohammed

...Show More Authors

Most of the medical datasets suffer from missing data, due to the expense of some tests or human faults while recording these tests. This issue affects the performance of the machine learning models because the values of some features will be missing. Therefore, there is a need for a specific type of methods for imputing these missing data. In this research, the salp swarm algorithm (SSA) is used for generating and imputing the missing values in the pain in my ass (also known Pima) Indian diabetes disease (PIDD) dataset, the proposed algorithm is called (ISSA). The obtained results showed that the classification performance of three different classifiers which are support vector machine (SVM), K-nearest neighbour (KNN), and Naïve Bayesian classifier (NBC) have been enhanced as compared to the dataset before applying the proposed method. Moreover, the results indicated that issa was performed better than the statistical imputation techniques such as deleting the samples with missing values, replacing the missing values with zeros, mean, or random values.

View Publication

Publication Date

Mon Dec 21 2020

Journal Name

Bulletin Of The Iraq Natural History Museum (p-issn: 1017-8678 , E-issn: 2311-9799)

MONITORING OF THE WILD MAMMAL FAUNA IN BAMO MOUNTAIN IN NORTHERN IRAQ (KURDISTAN) FOR THE FIRST TIME USING CAMERA TRAP METHOD AND RAISING AWARENESS FOR ITS CONSERVATION

Fauna of Iraq

Key species

Persian leopard

Protected areas

Sustainable management.

Soran H.

Soma I.

...Show More Authors

Mammals are under threat worldwide due to deforestation, hunting, and other human activities. In Iraq, a total of 93 species of wild mammals have been recorded including species with global conservation concern. Bamo Mountain is situated within the Zagros Mountains in northern Iraq which is a suitable habitat for wild mammals. Due to scarcity of the field survey efforts and cryptic behavior, monitoring of the wild mammals fauna in Zagros Mountain seems challenging. Therefore, we used a camera trap which seems to be an ideal way to determine species diversity of wild mammals in Bamo Mountain. Moreover, interviews with local villagers were performed. The mammalian diversity of Bamo Mountain is not fully explored but seemed threatened by lo

View Publication Preview PDF

(3)

Publication Date

Wed Jan 01 2020

Journal Name

Advances In Science, Technology And Engineering Systems Journal

Bayes Classification and Entropy Discretization of Large Datasets using Multi-Resolution Data Aggregation

Safaa

Li

...Show More Authors

Big data analysis has important applications in many areas such as sensor networks and connected healthcare. High volume and velocity of big data bring many challenges to data analysis. One possible solution is to summarize the data and provides a manageable data structure to hold a scalable summarization of data for efficient and effective analysis. This research extends our previous work on developing an effective technique to create, organize, access, and maintain summarization of big data and develops algorithms for Bayes classification and entropy discretization of large data sets using the multi-resolution data summarization structure. Bayes classification and data discretization play essential roles in many learning algorithms such a

View Publication

Publication Date

Tue Dec 01 2015

Journal Name

Journal Of Engineering

Ten Years of OpenStreetMap Project: Have We Addressed Data Quality Appropriately? – Review Paper

OpenStreetMap

VGI

spatial data quality

geometrical similarity

positional accuracy.

Maythm

...Show More Authors

It has increasingly been recognised that the future developments in geospatial data handling will centre on geospatial data on the web: Volunteered Geographic Information (VGI). The evaluation of VGI data quality, including positional and shape similarity, has become a recurrent subject in the scientific literature in the last ten years. The OpenStreetMap (OSM) project is the most popular one of the leading platforms of VGI datasets. It is an online geospatial database to produce and supply free editable geospatial datasets for a worldwide. The goal of this paper is to present a comprehensive overview of the quality assurance of OSM data. In addition, the credibility of open source geospatial data is discussed, highlight

View Publication Preview PDF

Publication Date

Sun May 11 2025

Journal Name

Iraqi Statisticians Journal

Estimating General Linear Regression Model of Big Data by Using Multiple Test Technique

Ahmed Mahdi

Munaf Yousif

...Show More Authors

View Publication

Publication Date

Tue Dec 01 2015

Journal Name

Journal Of Engineering

Ten Years of OpenStreetMap Project: Have We Addressed Data Quality Appropriately? – Review Paper

Maythm

...Show More Authors

It has increasingly been recognised that the future developments in geospatial data handling will centre on geospatial data on the web: Volunteered Geographic Information (VGI). The evaluation of VGI data quality, including positional and shape similarity, has become a recurrent subject in the scientific literature in the last ten years. The OpenStreetMap (OSM) project is the most popular one of the leading platforms of VGI datasets. It is an online geospatial database to produce and supply free editable geospatial datasets for a worldwide. The goal of this paper is to present a comprehensive overview of the quality assurance of OSM data. In addition, the credibility of open source geospatial data is discussed, highlighting the diff

(3)

Publication Date

Fri Feb 28 2025

Journal Name

The Iraqi Geological Journal

Structural Interpretation of Najaf Basin Using the Magnetic and Gravity Data, Central Iraq

Tuqa

Osamah S.

Ahmed S.

...Show More Authors

Potential data interpretation is significant for subsurface structure characterization. The current study is an attempt to explore the magnetic low lying between Najaf and Diwaniyah Cities, In central Iraq. It aims to understand the subsurface structures that may result from this anomaly and submit a better subsurface structural image of the region. The study area is situated in the transition zone, known as the Abu Jir Fault Zone. This tectonic boundary is an inherited basement weak zone extending towards the NW-SE direction. Gravity and magnetic data processing and enhancement techniques; Total Horizontal Gradient, Tilt Angle, Fast Sigmoid Edge Detection, Improved Logistic, and Theta Map filters highlight source boundaries and the

View Publication Preview PDF

(3)

Publication Date

Sun Jan 01 2023

Journal Name

Journal Of Engineering

State-of-the-Art in Data Integrity and Privacy-Preserving in Cloud Computing

Cloud Computing (CC)

data integrity

privacy-preserving.

Mariam Duraid

Yousra Abdul Alsahib

...Show More Authors

Cloud computing (CC) is a fast-growing technology that offers computers, networking, and storage services that can be accessed and used over the internet. Cloud services save users money because they are pay-per-use, and they save time because they are on-demand and elastic, a unique aspect of cloud computing. However, several security issues must be addressed before users store data in the cloud. Because the user will have no direct control over the data that has been outsourced to the cloud, particularly personal and sensitive data (health, finance, military, etc.), and will not know where the data is stored, the user must ensure that the cloud stores and maintains the outsourced data appropriately. The study's primary goals are to mak

View Publication Preview PDF

(7)

Publication Date

Sun Jan 01 2023

Journal Name

Petroleum And Coal

Analyzing of Production Data Using Combination of empirical Methods and Advanced Analytical Techniques

Sarah

Sameera

...Show More Authors

(1)

Publication Date

Fri Mar 01 2019

Journal Name

Spatial Statistics

Efficient Bayesian modeling of large lattice data using spectral properties of Laplacian matrix

Adaptive specification

Areal spatial data

Conditionally autoregressive prior

Dimension reduction

Plant abundance

Spike and slab prior

Ghadeer J.M.

Avishek

Mark E.

Anthony G.

...Show More Authors

Spatial data observed on a group of areal units is common in scientific applications. The usual hierarchical approach for modeling this kind of dataset is to introduce a spatial random effect with an autoregressive prior. However, the usual Markov chain Monte Carlo scheme for this hierarchical framework requires the spatial effects to be sampled from their full conditional posteriors one-by-one resulting in poor mixing. More importantly, it makes the model computationally inefficient for datasets with large number of units. In this article, we propose a Bayesian approach that uses the spectral structure of the adjacency to construct a low-rank expansion for modeling spatial dependence. We propose a pair of computationally efficient estimati

View Publication

(9)

(6)

Publication Date

Tue Jan 01 2019

Journal Name

Journal Of Southwest Jiaotong University

Recognizing Job Apathy Patterns of Iraqi Higher Education Employees Using Data Mining Techniques

Mustafa S.

Suhad Faisal

...Show More Authors

Psychological research centers help indirectly contact professionals from the fields of human life, job environment, family life, and psychological infrastructure for psychiatric patients. This research aims to detect job apathy patterns from the behavior of employee groups in the University of Baghdad and the Iraqi Ministry of Higher Education and Scientific Research. This investigation presents an approach using data mining techniques to acquire new knowledge and differs from statistical studies in terms of supporting the researchers’ evolving needs. These techniques manipulate redundant or irrelevant attributes to discover interesting patterns. The principal issue identifies several important and affective questions taken from

View Publication

(1)

1 2 ... 202 203 204 205 ... 1581 1582