XGBOOST AND COST-SENSITIVE CART FOR
IMBALANCED MULTICLASS DIABETES
CLASSIFICATION IN IRAQ

Nabila A. Alsharif Alsharif; Inaam Aboud Hussain Hussain; Loaiy F. Naji Naji

Details

Publication Date

Tue Feb 03 2026

Journal Name

Journal Of Mechanics Of Continua And Mathematical Sciences

Volume

21

Issue Number

2

Choose Citation Style

Statistics

View publication

5

Statistics

XGBOOST AND COST-SENSITIVE CART FOR IMBALANCED MULTICLASS DIABETES CLASSIFICATION IN IRAQ

Classification

XGBoost

CART

Class imbalance

Diabetes

Pre diabetic

Nabila A. Alsharif Alsharif

Inaam Aboud Hussain Hussain

Loaiy F. Naji Naji

...Show More Authors

Diabetes imposes a substantial public health burden; according to the International Diabetes Federation, there were about 3.4 million diabetes related deaths worldwide in 2024, and in Iraq, the Federation reports that one in nine adults lives with diabetes in 2024, with 14,683 adult deaths attributable to diabetes and a total diabetes related health expenditure of 2,078 million United States dollars. The dataset analyzed in this study contains 1,000 records collected in 2020 from two Iraqi teaching hospitals and includes multiple clinical and laboratory measurements with three outcome classes, namely Non diabetic, Pre diabetic, and Diabetic, with a low prevalence of the Pre diabetic class and an imbalanced overall class distribution; the data are challenging because they contain many outliers, non homogeneous covariance matrices across classes, exact duplicate rows that were removed before modelling, and linear correlations among certain variables. The study objective was to train and evaluate models that discriminate among the three classes and yield accurate, well calibrated predictions for future cases in similar clinical settings, but the diagnostic properties of the data limited the applicability of classical discriminant functions; therefore two supervised learners were employed: Classification and Regression Trees (CART) and Extreme Gradient Boosting (XGBoost), together with preprocessing that removed exact duplicate rows and excluded VLDL because it is algebraically derived from triglycerides in mmol per liter as VLDL equals triglycerides divided by 2.2, which would introduce redundancy and multicollinearity. On the heldout test set, XGBoost achieved higher Accuracy at 98.18 percent compared with 97.58 percent for CART and higher Balanced Accuracy at 93.84 percent compared with 88.16 percent for CART, indicating that XGBoost provided the strongest overall operating point for this three-class task while CART remains useful when simple and transparent rules are required.

Preview PDF

Quick Preview PDF

Publication Date

Sat Jan 19 2019

Journal Name

Artificial Intelligence Review

Survey on supervised machine learning techniques for automatic text classification

Kadhim A.I.

...Show More Authors

View Publication

(350)

(312)

Publication Date

Wed Sep 23 2020

Journal Name

Artificial Intelligence Research

Hybrid approaches to feature subset selection for data classification in high-dimensional feature space

Maysa

John Q

...Show More Authors

This paper proposes two hybrid feature subset selection approaches based on the combination (union or intersection) of both supervised and unsupervised filter approaches before using a wrapper, aiming to obtain low-dimensional features with high accuracy and interpretability and low time consumption. Experiments with the proposed hybrid approaches have been conducted on seven high-dimensional feature datasets. The classifiers adopted are support vector machine (SVM), linear discriminant analysis (LDA), and K-nearest neighbour (KNN). Experimental results have demonstrated the advantages and usefulness of the proposed methods in feature subset selection in high-dimensional space in terms of the number of selected features and time spe

View Publication

Publication Date

Sun Mar 26 2017

Journal Name

Iraqi Journal Of Pharmaceutical Sciences ( P-issn 1683 - 3597 E-issn 2521 - 3512)

Gestational Diabetes Mellitus and Hormonal Alteration

Sura

Amer

Aufaira

...Show More Authors

Gestational Diabetes Mellitus is known as carbohydrate intolerance first detected during pregnancy. Pregnancy is periods of intense hormonal changes. The aim of the present study was to investigate a possible relation between the changes in serum hormones such as Luteinizing hormone (LH) , follicle stimulating hormone(FSH), Progesterone, and Prolactin with gestational diabetes mellitus. Thirty patients with gestational diabetes mellitus aged (22 -40) year attending the national center for treatment and research of diabetes/ AL-Mustansiriya University in Baghdad and 29 controls aged (20-39) year were participated. Hormonal tests including, FSH, LH, Progesterone, and Prolactin were detected by using Enzyme Linked Fluorescent Assay (ELFA) k

View Publication Preview PDF

(2)

Publication Date

Sat Jun 01 2024

Journal Name

Alexandria Engineering Journal

U-Net for genomic sequencing: A novel approach to DNA sequence classification

DNA sequence classification U-net architecture Deep learning Genomics Sequence data

Raghad K

Azmi Tawfeq Hussein

Ali Jbaeer

...Show More Authors

The precise classification of DNA sequences is pivotal in genomics, holding significant implications for personalized medicine. The stakes are particularly high when classifying key genetic markers such as BRAC, related to breast cancer susceptibility; BRAF, associated with various malignancies; and KRAS, a recognized oncogene. Conventional machine learning techniques often necessitate intricate feature engineering and may not capture the full spectrum of sequence dependencies. To ameliorate these limitations, this study employs an adapted UNet architecture, originally designed for biomedical image segmentation, to classify DNA sequences.The attention mechanism was also tested LONG WITH u-Net architecture to precisely classify DNA sequences

View Publication Preview PDF

(3)

Publication Date

Fri Mar 01 2013

Journal Name

Journal Of Economics And Administrative Sciences

Integration The Cost Techniques with Balanced Scorecard for The Purposes of Measuring and Evaluating Performance

/ قياس الأداء

تقويم الأداء

بطاقة العلامات المتوازنة

المقاييس المالية وغير المالية

تقنيات إدارة الكلفة.

: Performance Measures

Performance Evaluation

Balanced Scorecard (BSC(

Financial and Non-Financial Measures

Cost Management Techniques.

منال جبار

لينا كرابيت

...Show More Authors

The effective application of the method of measuring and evaluating performance according to the Balanced Scorecard the need for an information system a comprehensive and integrated for internal and external environment, Which requires the need to develop accounting information system in general and cost management information systems to suit the particular requirements of the environment in terms of the development of modern methods of measurement to include the use of some methods that have proven effective in measuring and evaluating performance.

The research problem in need of management to develop methods of measuring and evaluating performance through the use of both financial measures and non

View Publication Preview PDF

Publication Date

Sat Jan 01 2022

Journal Name

Ieee Access

Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review

Metaheuristics

Feature extraction

Text categorization

Classification algorithms

Systematics

Search problems

Business

Osamah Mohammed

Yu-N

Ammar Kamal

Omar Mustafa

...Show More Authors

Feature selection (FS) constitutes a series of processes used to decide which relevant features/attributes to include and which irrelevant features to exclude for predictive modeling. It is a crucial task that aids machine learning classifiers in reducing error rates, computation time, overfitting, and improving classification accuracy. It has demonstrated its efficacy in myriads of domains, ranging from its use for text classification (TC), text mining, and image recognition. While there are many traditional FS methods, recent research efforts have been devoted to applying metaheuristic algorithms as FS techniques for the TC task. However, there are few literature reviews concerning TC. Therefore, a comprehensive overview was systematicall

View Publication Preview PDF

(72)

(58)

Publication Date

Wed Dec 08 2021

Journal Name

Scientific Reports

Weakly Supervised Sensitive Heatmap framework to classify and localize diabetic retinopathy lesions

Mohammed

Ameer Hussein

Mustafa

MD Samiul

...Show More Authors

Abstract<p>Vision loss happens due to diabetic retinopathy (DR) in severe stages. Thus, an automatic detection method applied to diagnose DR in an earlier phase may help medical doctors to make better decisions. DR is considered one of the main risks, leading to blindness. Computer-Aided Diagnosis systems play an essential role in detecting features in fundus images. Fundus images may include blood vessels, exudates, micro-aneurysm, hemorrhages, and neovascularization. In this paper, our model combines automatic detection for the diabetic retinopathy classification with localization methods depending on weakly-supervised learning. The model has four stages; in stage one, various preprocessing techniques are app</p> ... Show More

View Publication

(8)

(7)

Publication Date

Mon Jun 01 2026

Journal Name

Iraqi Journal For Computers And Informatics

Explainable Federated Learning for Brain Tumor Classification Using Multi-Source MRI Data

Brain Tumor Classification

Magnetic Resonance Imaging (MRI)

Federated Learning FL

Non-IID

Suhad

Belal

...Show More Authors

Early diagnosis and clinical decision-making depend on accurate brain tumor classification using magnetic resonance imaging (MRI). However, traditional deep learning methods usually rely on centralized medical data, which raises privacy concerns and limits the use of distributed clinical data. This research proposes a privacy-preserving federated learning framework for MRI image-based binary brain tumor classification using a decentralized ResNet-18 architecture that enables collaborative training without sharing raw patient data. To reflect realistic clinical conditions, the framework integrates heterogeneous multi-source datasets in different image formats (PNG and JPG) and evaluates performance under both IID and non-IID settings

View Publication Preview PDF

Publication Date

Thu Jan 04 2024

Journal Name

Journal Of Accounting And Financial Studies ( Jafs )

The effect of material flow cost accounting in reducing the cost of products - an applied study in Diyala State Company

material flow cost accounting

product cost reduction

Asst. Lect. Hassan Nayeb Dahi

Prof. Dr. hanan sahabat abdallah

...Show More Authors

Abstract

The current research sought to demonstrate the effect of material flow cost accounting on reducing products through the application of material flow cost accounting technique, which works on the optimal utilization of materials and energy and the reduction of environmental impacts.The research aims to clarify the knowledge foundations for material flow cost accounting, in addition to studying the material flow cost accounting technique that helps reduce the cost of products and make them environmentally friendly. To achieve this, the research relied on the descriptive approach with regard to the theoretical aspect of the resea

View Publication Preview PDF

Publication Date

Fri Jul 29 2022

Journal Name

Journal For Vascular Ultrasound

A Comparative Study of the Right and Left Carotid Arteries in Relation to Age for Patients With Diabetes and Hypertension

Introduction: Age

hypertension

and diabetes can cause significant alterations in arterial structure and function

including changes in lumen diameter (LD)

intimal-medial thickness (IMT)

flow velocities

and arterial compliance. These are also considered risk markers of atherosclerosis and cerebrovascular disease. A difference between right and left carotid artery blood flow and IMT has been reported by some researchers

and a difference in the incidence of nonlacunar stroke has been reported between the right and left brain hemispheres. The aim of this study was to determine whether there are differences between the right and left common carotid arteries and internal carotid arteries in patients with hypertension and diabetes for 2 age groups. Methods: We studied 250 patients with both diabetes and hypertension. Patients were divided into 2 age groups with the old age group being 56 to 75 years and the young age group 35 to 55 years. The bilateral common carotid and internal carotid arteries were evaluated with B-mode ultrasound and Doppler examinations. The LD and IMT were measured for both common carotid arteries

and spectral waveform parameters and indices were recorded for both internal carotid arteries. Results: The difference in LD between the left and right common carotid arteries for the old age group was 11.64% and for the young age group was 6.42%

with significant P values of <.05 for both age groups. The difference in IMT between the left and right common carotid arteries was 18.27% in the old age group compared with 15.38% in the young age group

with significant P values of <.05. There was a difference in peak systolic velocity between the left and right internal carotid arteries of 4.85% in the old age group which was not significant

compared with 14.28% in the young age group with a significant P value <.05

whereas the difference in end-diastolic velocity between the left and right internal carotid arteries was not significant for both age groups. Differences between the right and left internal carotid arteries for resistive index

pulsatility index

and pressure gradient were significant only in the young age group. Conclusion: We found significant differences between the right and left common carotid and internal carotid arteries in patients with diabetes and hypertension which were more prominent in the young age group. Values for common carotid IMT and LD were significantly higher in the left common carotid artery versus the right common carotid artery in both age groups. Differences between the 2 carotid sides may be attributed to anatomic variations in the common carotid artery origins which lead to differences in stress between the 2 sides

Ahmed Abduljabar

Samar I.

Anmar Z

...Show More Authors

Introduction:

Age, hypertension, and diabetes can cause significant alterations in arterial structure and function, including changes in lumen diameter (LD), intimal-medial thickness (IMT), flow velocities, and arterial compliance. These are also considered risk markers of atherosclerosis and cerebrovascular disease. A difference between right and left carotid artery blood flow and IMT has been reported by some researchers, and a difference in the incidence of nonlacunar stroke has been reported between the right and left brain hemispheres. The aim of this study was to determine whether there are differences between the right and left common carotid arteries and internal carotid arteries in patient

View Publication

(7)

(5)

1 2 ... 10 11 12 13 ... 2629 2630