Adaptation Proposed Methods for Handling Imbalanced Datasets based on Over-Sampling Technique

Liqaa M. Shoohi; Jamila H. Saud

doi:10.23851/mjs.v31i2.740

Details

Publication Date

Wed Apr 15 2020

Journal Name

Al-mustansiriyah Journal Of Science

Volume

31

DOI

10.23851/mjs.v31i2.740

Choose Citation Style

Statistics

View publication

17

View original publication

1

View pdf

1

Statistics

Adaptation Proposed Methods for Handling Imbalanced Datasets based on Over-Sampling Technique

Imbalanced Datasets

O.S.

SMOTE

Borderline-SMOTE

ADASYN.

Liqaa M. Shoohi

Jamila H. Saud

...Show More Authors

Classification of imbalanced data is an important issue. Many algorithms have been developed for classification, such as Back Propagation (BP) neural networks, decision tree, Bayesian networks etc., and have been used repeatedly in many fields. These algorithms speak of the problem of imbalanced data, where there are situations that belong to more classes than others. Imbalanced data result in poor performance and bias to a class without other classes. In this paper, we proposed three techniques based on the Over-Sampling (O.S.) technique for processing imbalanced dataset and redistributing it and converting it into balanced dataset. These techniques are (Improved Synthetic Minority Over-Sampling Technique (Improved SMOTE), Borderline-SMOTE + Imbalanced Ratio(IR), Adaptive Synthetic Sampling (ADASYN) +IR) Algorithm, where the work these techniques are generate the synthetic samples for the minority class to achieve balance between minority and majority classes and then calculate the IR between classes of minority and majority. Experimental results show ImprovedSMOTE algorithm outperform the Borderline-SMOTE + IR and ADASYN + IR algorithms because it achieves a high balance between minority and majority classes.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Thu Jan 01 2015

Journal Name

Journal Of Engineering

GNSS Baseline Configuration Based on First Order Design

configuration baselines

FOD

GNSS network

A-optimality

E-optimality

Oday Yaseen

Muayed Yaseen

Zahraa Azeldeen

...Show More Authors

The quality of Global Navigation Satellite Systems (GNSS) networks are considerably influenced by the configuration of the observed baselines. Where, this study aims to find an optimal configuration for GNSS baselines in terms of the number and distribution of baselines to improve the quality criteria of the GNSS networks. First order design problem (FOD) was applied in this research to optimize GNSS network baselines configuration, and based on sequential adjustment method to solve its objective functions.

FOD for optimum precision (FOD-p) was the proposed model which based on the design criteria of A-optimality and E-optimality. These design criteria were selected as objective functions of precision, whic

View Publication

Publication Date

Sun Nov 01 2020

Journal Name

Journal Of Physics: Conference Series

Improve topic modeling algorithms based on Twitter hashtags

Hayder M.

...Show More Authors

Abstract<p>Today with increase using social media, a lot of researchers have interested in topic extraction from Twitter. Twitter is an unstructured short text and messy that it is critical to find topics from tweets. While topic modeling algorithms such as Latent Semantic Analysis (LSA) and Latent Dirichlet Allocation (LDA) are originally designed to derive topics from large documents such as articles, and books. They are often less efficient when applied to short text content like Twitter. Luckily, Twitter has many features that represent the interaction between users. Tweets have rich user-generated hashtags as keywords. In this paper, we exploit the hashtags feature to improve topics learned</p> ... Show More

View Publication

(20)

(18)

Publication Date

Mon Mar 01 2021

Journal Name

Iop Conference Series: Materials Science And Engineering

Speech Enhancement Algorithm Based on a Hybrid Estimator

Basheera M.

Sadiq H.

Marwah A.

Muntadher

Jamila

...Show More Authors

Abstract<p>Speech is the essential way to interact between humans or between human and machine. However, it is always contaminated with different types of environment noise. Therefore, speech enhancement algorithms (SEA) have appeared as a significant approach in speech processing filed to suppress background noise and return back the original speech signal. In this paper, a new efficient two-stage SEA with low distortion is proposed based on minimum mean square error sense. The estimation of clean signal is performed by taking the advantages of Laplacian speech and noise modeling based on orthogonal transform (Discrete Krawtchouk-Tchebichef transform) coefficients distribution. The Discrete Kra</p> ... Show More

View Publication

(11)

Publication Date

Mon May 15 2017

Journal Name

International Journal Of Image And Data Fusion

Image edge detection operators based on orthogonal polynomials

Sadiq H.

Abd. Rahman

Basheera M.

S.A.R.

Wissam A.

...Show More Authors

View Publication

(32)

(10)

Publication Date

Sun Sep 24 2023

Journal Name

Journal Of Al-qadisiyah For Computer Science And Mathematics

Iris Data Compression Based on Hexa-Data Coding

Ghadah

Haider Hameed

Mohammed M.

Marcos. A.

...Show More Authors

Iris research is focused on developing techniques for identifying and locating relevant biometric features, accurate segmentation and efficient computation while lending themselves to compression methods. Most iris segmentation methods are based on complex modelling of traits and characteristics which, in turn, reduce the effectiveness of the system being used as a real time system. This paper introduces a novel parameterized technique for iris segmentation. The method is based on a number of steps starting from converting grayscale eye image to a bit plane representation, selection of the most significant bit planes followed by a parameterization of the iris location resulting in an accurate segmentation of the iris from the origin

View Publication

Publication Date

Sun Nov 23 2025

Journal Name

Jornal Of Al-muthanna For Agricultural Sciences

A Proposed Approach to Agricultural Extension in Iraq for a Better Response to the Needs of farmer’s to Address Their Challenges

Hussain

...Show More Authors

View Publication

Publication Date

Mon Jul 18 2022

Journal Name

International Journal Of Early Childhood Special Education

The effectiveness of a proposed teaching strategy according to The Common Knowledge Construction Model in Mathematical Proficiency for middle school student

Lina Fouad

Hussein Raheem

...Show More Authors

This research aims to know the effectiveness of teaching with a proposed strategy according to the common Knowledge construction modelin mathematical proficiency among students of the second middle class. The researchers adopted the method of the experimental approach, as the experimental design was used for two independent and equal groups with a post-test. The experiment was applied to a sample consisting of (83) students divided into two groups: an experimental comprising (42) students and a control group, the second comprising (41) students., from Badr Shaker Al-Sayyab Intermediate School for Boys, for the first semester of the academic year (2021-2022), the two groups were rewarded in four variables: (chronological age calculated in mo

Publication Date

Sat Jul 01 2023

Journal Name

Journal Of Engineering

Material Selection for Unmanned Aerial Vehicles (UAVs) Wings Using Ashby Indices Integrated with Grey Relation Analysis Approach Based on Weighted Entropy for Ranking

Material selection

drone wings

Material Performance Index

Grey Relation Analysis

Weighted Entropy Method

Alya I.

Qasim M.

...Show More Authors

The designer must find the optimum match between the object's technical and economic needs and the performance and production requirements of the various material options when choosing material for an engineering application. This study proposes an integrated (hybrid) strategy for selecting the optimal material for an engineering design depending on design requirements. The primary objective is to determine the best candidate material for the drone wings based on Ashby's performance indices and then rank the result using a grey relational technique with the entropy weight method. Aluminum alloys, titanium alloys, composites, and wood have been suggested as suitable materials for manufacturing drone wings. The requirement

View Publication Preview PDF

Publication Date

Thu Jun 01 2017

Journal Name

Journal Of Economics And Administrative Sciences

Compared of estimating two methods for nonparametric function to cluster data for the white blood cells to leukemia patients

البيانات العنقودية

الطريقة المقدرات اللبية غير المرتبطة ظاهريا

وطريقة المربعات الصغرى المعممة لمقدرات الشريحة التمهيدية

MSE

MAE

cluster data

the seemingly unrelated Kernel Estimators method

and the Generalized Least Squares Smoothing Spline Estimators method

MSE

MAE.

سجى محمد

حلا كاظم

...Show More Authors

Abstract:

We can notice cluster data in social, health and behavioral sciences, so this type of data have a link between its observations and we can express these clusters through the relationship between measurements on units within the same group.

In this research, I estimate the reliability function of cluster function by using the seemingly unrelate

View Publication Preview PDF

Publication Date

Fri Sep 30 2022

Journal Name

Journal Of Economics And Administrative Sciences

Comparison of Some Methods for Estimating the Survival Function and Failure Rate for the Exponentiated Expanded Power Function Distribution

توزيع دالة القوة الموسع الاسي

البقاء ومعدل الفشل

الإمكان الأعظم

المربعات الصغرى المطورة

نيوتن رافسون

Nelder maed.

Expanded Exponentiated Power function distribution

Survival and failure rate

Maximum likelihood

Developed least squares

Newton Raphson

Nelder mead.

Maysam

Emad

...Show More Authors

We have presented the distribution of the exponentiated expanded power function (EEPF) with four parameters, where this distribution was created by the exponentiated expanded method created by the scientist Gupta to expand the exponential distribution by adding a new shape parameter to the cumulative function of the distribution, resulting in a new distribution, and this method is characterized by obtaining a distribution that belongs for the exponential family. We also obtained a function of survival rate and failure rate for this distribution, where some mathematical properties were derived, then we used the method of maximum likelihood (ML) and method least squares developed (LSD)

View Publication Preview PDF

1 2 ... 60 61 62 63 ... 1232 1233