Article - ijs-2747 - Digital Repository

Details

Publication Date

Sat Jul 31 2021

Journal Name

Iraqi Journal Of Science

Issue Number

DOI

10.24996/ijs.2021.62.7.32

Keywords

Big Data

Hadoop

Mahout

Predictive Analytics

Parallel K-means

Choose Citation Style

Statistics

Abstract Views

217

Galley Views

256

Statistics

(3)

(1)

Authors (2)

Noor S.

Suhad A.

A Parallel Clustering Analysis Based on Hadoop Multi-Node and Apache Mahout

The conventional procedures of clustering algorithms are incapable of overcoming the difficulty of managing and analyzing the rapid growth of generated data from different sources. Using the concept of parallel clustering is one of the robust solutions to this problem. Apache Hadoop architecture is one of the assortment ecosystems that provide the capability to store and process the data in a distributed and parallel fashion. In this paper, a parallel model is designed to process the k-means clustering algorithm in the Apache Hadoop ecosystem by connecting three nodes, one is for server (name) nodes and the other two are for clients (data) nodes. The aim is to speed up the time of managing the massive scale of healthcare insurance dataset with the size of 11 GB and also using machine learning algorithms, which are provided by the Mahout Framework. The experimental results depict that the proposed model can efficiently process large datasets. The parallel k-means algorithm outperforms the sequential k-means algorithm based on the execution time of the algorithm, where the required time to execute a data size of 11 GB is around 1.847 hours using the parallel k-means algorithm, while it equals 68.567 hours using the sequential k-means algorithm. As a result, we deduce that when the nodes number in the parallel system increases, the computation time of the proposed algorithm decreases.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Thu Nov 17 2016

Journal Name

Plos One

Efficient and Stable Routing Algorithm Based on User Mobility and Node Density in Urban Vehicular Network

Vehicular ad hoc networks (VANETs) are considered an emerging technology in the industrial and educational fields. This technology is essential in the deployment of the intelligent transportation system, which is targeted to improve safety and efficiency of traffic. The implementation of VANETs can be effectively executed by transmitting data among vehicles with the use of multiple hops. However, the intrinsic characteristics of VANETs, such as its dynamic network topology and intermittent connectivity, limit data delivery. One particular challenge of this network is the possibility that the contributing node may only remain in the network for a limited time. Hence, to prevent data loss from that node, the information must reach the destina

(32)

(28)

Authors (7)

Yusor Rafid Bahar

Mahamod

Nor Fadzilah

View All

View Publication Preview PDF

Publication Date

Wed Feb 08 2023

Journal Name

Iraqi Journal Of Science

Watershed Transform Based on Clustering Techniques to Extract Brain Tumors in MRI

In this work, watershed transform method was implemented to detect and extract tumors and abnormalities in MRI brain skull stripped images. An adaptive technique has been proposed to improve the performance of this method.Watershed transform algorithm based on clustering techniques: K-Means and FCM were implemented to reduce the oversegmentation problem. The K-Means and FCM clustered images were utilized as input images to the watershed algorithm as well as of the original image. The relative surface area of the extracted tumor region was calculated for each application. The results showed that watershed trnsform algorithm succeedeed to detect and extract the brain tumor regions very well according to the consult of a specialist doctor a

Authors (1)

Rabab

View Publication Preview PDF

Publication Date

Thu Nov 29 2018

Journal Name

Iraqi Journal Of Science

A new Color image Encryption based on multi Chaotic Maps

This paper presents a new RGB image encryption scheme using multi chaotic maps. Encrypting an image is performed via chaotic maps to confirm the properties of secure cipher namely confusion and diffusion are satisfied. Also, the key sequence for encrypting an image is generated using a combination of 1D logistic and Sine chaotic maps. Experimental results and the compassion results indicate that the suggested scheme provides high security against several types of attack, large secret keyspace and highly sensitive.

Authors (2)

Ibtisam A.

Sarab M.

View Publication Preview PDF

Publication Date

Sun Jan 30 2022

Journal Name

Iraqi Journal Of Science

A New Image Encryption Algorithm Based on Multi Chaotic System

In recent years, encryption technology has been developed rapidly and many image encryption methods have been put forward. The chaos-based image encryption technique is a modern encryption system for images. To encrypt images, it uses random sequence chaos, which is an efficient way to solve the intractable problem of simple and highly protected image encryption. There are, however, some shortcomings in the technique of chaos-based image encryption, such limited accuracy issue. The approach focused on the chaotic system in this paper is to construct a dynamic IP permutation and S-Box substitution by following steps. First of all, use of a new IP table for more diffusion of al

(14)

(10)

Authors (2)

Azhaar Akram

Alaa Kadhim

View Publication Preview PDF

Publication Date

Wed Sep 26 2018

Journal Name

Communications In Computer And Information Science

A New RGB Image Encryption Based on DNA Encoding and Multi-chaotic Maps

(1)

Authors (2)

Sarab

Ibtisam

View Publication

Publication Date

Sun Apr 30 2023

Journal Name

Iraqi Journal Of Science

Fuzzy Based Clustering for Grayscale Image Steganalysis

Authors (3)

Sarab

Rasha

Baraa'

View Publication Preview PDF

Publication Date

Thu Jan 20 2022

Journal Name

Webology

Hybrid Intrusion Detection System based on DNA Encoding, Teiresias Algorithm and Clustering Method

Until recently, researchers have utilized and applied various techniques for intrusion detection system (IDS), including DNA encoding and clustering that are widely used for this purpose. In addition to the other two major techniques for detection are anomaly and misuse detection, where anomaly detection is done based on user behavior, while misuse detection is done based on known attacks signatures. However, both techniques have some drawbacks, such as a high false alarm rate. Therefore, hybrid IDS takes advantage of combining the strength of both techniques to overcome their limitations. In this paper, a hybrid IDS is proposed based on the DNA encoding and clustering method. The proposed DNA encoding is done based on the UNSW-NB15

(2)

Authors (2)

Omar Fitian

Mazin S.

View Publication

Publication Date

Wed Oct 26 2022

Journal Name

Iraqi Journal Of Science

Gene Expression Analysis via Spatial Clustering and Evaluation Indexing

The density-based spatial clustering for applications with noise (DBSCAN) is one of the most popular applications of clustering in data mining, and it is used to identify useful patterns and interesting distributions in the underlying data. Aggregation methods for classifying nonlinear aggregated data. In particular, DNA methylations, gene expression. That show the differentially skewed by distance sites and grouped nonlinearly by cancer daisies and the change Situations for gene excretion on it. Under these conditions, DBSCAN is expected to have a desirable clustering feature i that can be used to show the results of the changes. This research reviews the DBSCAN and compares its performance with other algorithms, such as the tradit

(2)

Authors (1)

Basad

View Publication

Publication Date

Thu Jan 01 2015

Journal Name

Journal Of Theoretical And Applied Information Technology

Graph based text representation for document clustering

Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship an

(15)

Authors (2)

Asma Khazaal Abdulsahib

SITI SAKIRA KAMARUDDIN

Preview PDF

Publication Date

Thu May 28 2020

Journal Name

Iraqi Journal Of Science

Genetic Algorithm-Based Anisotropic Diffusion Filter and Clustering Algorithms for Thyroid Tumor Detection

Medical imaging is a technique that has been used for diagnosis and treatment of a large number of diseases. Therefore it has become necessary to conduct a good image processing to extract the finest desired result and information. In this study, genetic algorithm (GA)-based clustering technique (K-means and Fuzzy C Means (FCM)) were used to segment thyroid Computed Tomography (CT) images to an extraction thyroid tumor. Traditional GA, K-means and FCM algorithms were applied separately on the original images and on the enhanced image with Anisotropic Diffusion Filter (ADF). The resulting cluster centers from K-means and FCM were used as the initial population in GA for the implementation of GAK-Mean and GAFCM. Jaccard index was used to s

(2)

(1)

Authors (1)

W. A.

View Publication Preview PDF

1 2 3 4 ... 997 998 999 1000