doi:10.24996/ijs.2021.62.7.32

Details

Publication Date

Sat Jul 31 2021

Journal Name

Iraqi Journal Of Science

Issue Number

7

DOI

10.24996/ijs.2021.62.7.32

Choose Citation Style

Statistics

View publication

7

Abstract Views

374

Galley Views

290

Statistics

(3)

(1)

A Parallel Clustering Analysis Based on Hadoop Multi-Node and Apache Mahout

Big Data

Hadoop

Mahout

Predictive Analytics

Parallel K-means

Noor S.

Suhad A.

...Show More Authors

The conventional procedures of clustering algorithms are incapable of overcoming the difficulty of managing and analyzing the rapid growth of generated data from different sources. Using the concept of parallel clustering is one of the robust solutions to this problem. Apache Hadoop architecture is one of the assortment ecosystems that provide the capability to store and process the data in a distributed and parallel fashion. In this paper, a parallel model is designed to process the k-means clustering algorithm in the Apache Hadoop ecosystem by connecting three nodes, one is for server (name) nodes and the other two are for clients (data) nodes. The aim is to speed up the time of managing the massive scale of healthcare insurance dataset with the size of 11 GB and also using machine learning algorithms, which are provided by the Mahout Framework. The experimental results depict that the proposed model can efficiently process large datasets. The parallel k-means algorithm outperforms the sequential k-means algorithm based on the execution time of the algorithm, where the required time to execute a data size of 11 GB is around 1.847 hours using the parallel k-means algorithm, while it equals 68.567 hours using the sequential k-means algorithm. As a result, we deduce that when the nodes number in the parallel system increases, the computation time of the proposed algorithm decreases.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Sat Dec 30 2023

Journal Name

Iraqi Journal Of Science

Agriculture Cadaster Map of Al-Shehimea

cadastral information

digital cadastral map

design

spatial data

land parcel identification system.

Nawal

Khalid

Hashem

...Show More Authors

The cadastral map is very important because it has technical and materialist
specification of the property borders and these maps which are land registration
based on it in Iraq, the problem is an ancient maps and unfit for use, despite its
importance, Therefor the updating and digitize the cadastral map is very pivotal, this
is what we have done in the present work.
In the present work, we have an old cadastral map (as a paper) was made in 1932
with modern satellite image (Quick Bird ) 2006, which has 61 cm resolution for the
same area after. Geometric correction technique has been applied by using image-toimage
method or (image registration ) and after that we get new agricultural cadaster
map and connect the

View Publication Preview PDF

Publication Date

Mon Jun 01 2020

Journal Name

Journal Of Engineering

An An Accurate Estimation of Shear Wave Velocity Using Well Logging Data for Khasib Carbonate Reservoir - Amara Oil Field

shear velocity

compressional velocity

well log data

dynamic modules

multiple regression

geomechanic.

Rwaida K.

Ayad A.

...Show More Authors

Shear and compressional wave velocities, coupled with other petrophysical data, are vital in determining the dynamic modules magnitude in geomechanical studies and hydrocarbon reservoir characterization. But, due to field practices and high running cost, shear wave velocity may not available in all wells. In this paper, a statistical multivariate regression method is presented to predict the shear wave velocity for Khasib formation - Amara oil fields located in South- East of Iraq using well log compressional wave velocity, neutron porosity and density. The accuracy of the proposed correlation have been compared to other correlations. The results show that, the presented model provides accurate

View Publication Preview PDF

Publication Date

Tue Mar 01 2022

Journal Name

The International Journal Of Nonlinear Analysis And Applications

Improved optimality checkpoint for decision making by using the sub-triangular form

Assignment problems Decision making Imprecise data Optimality check Point Sub-Triangular form

Zeina

...Show More Authors

Decision-making in Operations Research is the main point in various studies in our real-life applications. However, these different studies focus on this topic. One drawback some of their studies are restricted and have not addressed the nature of values in terms of imprecise data (ID). This paper thus deals with two contributions. First, decreasing the total costs by classifying subsets of costs. Second, improving the optimality solution by the Hungarian assignment approach. This newly proposed method is called fuzzy sub-Triangular form (FS-TF) under ID. The results obtained are exquisite as compared with previous methods including, robust ranking technique, arithmetic operations, magnitude ranking method and centroid ranking method. This

View Publication Preview PDF

Publication Date

Mon Jan 01 2018

Journal Name

International Journal Of Data Mining, Modelling And Management

Association rules mining using cuckoo search algorithm

data mining

ARM

association rules mining

DCS

discrete cuckoo search

metaheuristic algorithm

Mohammed R.A.

MEHDI G. DUAIMI

...Show More Authors

Association rules mining (ARM) is a fundamental and widely used data mining technique to achieve useful information about data. The traditional ARM algorithms are degrading computation efficiency by mining too many association rules which are not appropriate for a given user. Recent research in (ARM) is investigating the use of metaheuristic algorithms which are looking for only a subset of high-quality rules. In this paper, a modified discrete cuckoo search algorithm for association rules mining DCS-ARM is proposed for this purpose. The effectiveness of our algorithm is tested against a set of well-known transactional databases. Results indicate that the proposed algorithm outperforms the existing metaheuristic methods.

View Publication Preview PDF

(7)

(3)

Publication Date

Mon May 11 2020

Journal Name

Baghdad Science Journal

Proposing Robust LAD-Atan Penalty of Regression Model Estimation for High Dimensional Data

Atan penalty

High dimensional data

Least absolute deviation

Robust regression

Variable selection.

Ali Hameed

Omar Abdulmohsin

...Show More Authors

The issue of penalized regression model has received considerable critical attention to variable selection. It plays an essential role in dealing with high dimensional data. Arctangent denoted by the Atan penalty has been used in both estimation and variable selection as an efficient method recently. However, the Atan penalty is very sensitive to outliers in response to variables or heavy-tailed error distribution. While the least absolute deviation is a good method to get robustness in regression estimation. The specific objective of this research is to propose a robust Atan estimator from combining these two ideas at once. Simulation experiments and real data applications show that the p

View Publication Preview PDF

(4)

(1)

Publication Date

Sun Oct 01 2017

Journal Name

Diyala Journal For Pure Science

Employing difference technique in some Liu estimators to semiparametric regression model

Difference based liu estimator (DBL)

Difference based almost unbiased liu estimator (DBAUL)

K-nearest neighbor smoother .

Saja

Arshad

...Show More Authors

Semiparametric methods combined parametric methods and nonparametric methods ,it is important in most of studies which take in it's nature more progress in the procedure of accurate statistical analysis which aim getting estimators efficient, the partial linear regression model is considered the most popular type of semiparametric models, which consisted of parametric component and nonparametric component in order to estimate the parametric component that have certain properties depend on the assumptions concerning the parametric component, where the absence of assumptions, parametric component will have several problems for example multicollinearity means (explanatory variables are interrelated to each other) , To treat this problem we use

View Publication

Publication Date

Sun May 17 2020

Journal Name

Iraqi Journal Of Science

Multicomponent Inverse Lomax Stress-Strength Reliability

Reliability

Multicomponent system

s-out of-k model

Invers Lomax distribution

Maximum likelihood estimation

Regression estimation

Nada S.

Shahbaa M.

Bushra J.

...Show More Authors

In this article we derive two reliability mathematical expressions of two kinds of s-out of -k stress-strength model systems; and . Both stress and strength are assumed to have an Inverse Lomax distribution with unknown shape parameters and a common known scale parameter. The increase and decrease in the real values of the two reliabilities are studied according to the increase and decrease in the distribution parameters. Two estimation methods are used to estimate the distribution parameters and the reliabilities, which are Maximum Likelihood and Regression. A comparison is made between the estimators based on a simulation study by the mean squared error criteria, which revealed that the maximum likelihood estimator works the best.

View Publication Preview PDF

(4)

Publication Date

Fri Mar 01 2013

Journal Name

Journal Of Economics And Administrative Sciences

Stability testing of time series data for CT Large industrial establishments in Iraq

احمد سلطان

...Show More Authors

Abstract: -
The concept of joint integration of important concepts in macroeconomic application, the idea of cointegration is due to the Granger (1981), and he explained it in detail in Granger and Engle in Econometrica (1987). The introduction of the joint analysis of integration in econometrics in the mid-eighties of the last century, is one of the most important developments in the experimental method for modeling, and the advantage is simply the account and use it only needs to familiarize them selves with ordinary least squares.

Cointegration seen relations equilibrium time series in the long run, even if it contained all the sequences on t

View Publication Preview PDF

Publication Date

Thu Feb 01 2024

Journal Name

Baghdad Science Journal

Estimating the Parameters of Exponential-Rayleigh Distribution for Progressively Censoring Data with S- Function about COVID-19

COVID-19

Exponential-Rayleigh distribution (ERD)

Progressively censored data

Ranking function

S-function.

Rihaam N.

Iden H.

...Show More Authors

The two parameters of Exponential-Rayleigh distribution were estimated using the maximum likelihood estimation method (MLE) for progressively censoring data. To find estimated values for these two scale parameters using real data for COVID-19 which was taken from the Iraqi Ministry of Health and Environment, AL-Karkh General Hospital. Then the Chi-square test was utilized to determine if the sample (data) corresponded with the Exponential-Rayleigh distribution (ER). Employing the nonlinear membership function (s-function) to find fuzzy numbers for these parameters estimators. Then utilizing the ranking function transforms the fuzzy numbers into crisp numbers. Finally, using mean square error (MSE) to compare the outcomes of the survival

View Publication Preview PDF

Publication Date

Sat Jan 01 2022

Journal Name

The International Journal Of Nonlinear Analysis And Applications

Developing Bulk Arrival Queuing Models with Constant Batch Policy Under Uncertainty Data Using (0-1) Variables

Constant batch size Uncertainty data Mixed-integer Non-linear programming (0 - 1) variables

Zeina

...Show More Authors

This paper delves into some significant performance measures (PMs) of a bulk arrival queueing system with constant batch size b, according to arrival rates and service rates being fuzzy parameters. The bulk arrival queuing system deals with observation arrival into the queuing system as a constant group size before allowing individual customers entering to the service. This leads to obtaining a new tool with the aid of generating function methods. The corresponding traditional bulk queueing system model is more convenient under an uncertain environment. The α-cut approach is applied with the conventional Zadeh's extension principle (ZEP) to transform the triangular membership functions (Mem. Fs) fuzzy queues into a family of conventional b

1 2 3 4 ... 2306 2307 2308 2309