doi:10.24996/ijs.2022.63.7.37

Details

Publication Date

Sun Jul 31 2022

Journal Name

Iraqi Journal Of Science

Volume

63

Issue Number

7

DOI

10.24996/ijs.2022.63.7.37

Choose Citation Style

Statistics

Abstract Views

322

Galley Views

96

Statistics

(4)

(2)

A Review of Data Mining and Knowledge Discovery Approaches for Bioinformatics

Bioinformatics

Data Mining

Knowledge Discovery Database

Gene Ontology

Similarity Function

Fatin Kadhim

Suhad Faisal

...Show More Authors

This review explores the Knowledge Discovery Database (KDD) approach, which supports the bioinformatics domain to progress efficiently, and illustrate their relationship with data mining. Thus, it is important to extract advantages of Data Mining (DM) strategy management such as effectively stressing its role in cost control, which is the principle of competitive intelligence, and the role of it in information management. As well as, its ability to discover hidden knowledge. However, there are many challenges such as inaccurate, hand-written data, and analyzing a large amount of variant information for extracting useful knowledge by using DM strategies. These strategies are successfully applied in several applications as data warehouses, predictive analytics, business intelligence, bioinformatics, and decision support systems. There are many DM techniques that are applied for disease diagnostics and treatment, for example cancer diseases that are investigated using multi-layer perception, Naïve Bayes, Decision Tree, Simple Logistic, K-Nearest Neighbor. As will be explored in this paper. Consequently, for future perspectives there is research in progress for real Iraqi data of Breast Cancer using Data Mining techniques, specifically the Tree decision and K-nearest algorithms.

View Publication

Publication Date

Tue May 30 2023

Journal Name

Iraqi Journal Of Science

Application of Data Mining and Imputation Algorithms for Missing Value Handling: A Study Case Car Evaluation Dataset

C5.0

k-NNI

Data Mining

Missing Value Handling

R Studio

Wahyu

Muhammad Fauzan Edy

Muhammad

Panca

Sholeh Hadi

...Show More Authors

Data mining is a data analysis process using software to find certain patterns or rules in a large amount of data, which is expected to provide knowledge to support decisions. However, missing value in data mining often leads to a loss of information. The purpose of this study is to improve the performance of data classification with missing values, precisely and accurately. The test method is carried out using the Car Evaluation dataset from the UCI Machine Learning Repository. RStudio and RapidMiner tools were used for testing the algorithm. This study will result in a data analysis of the tested parameters to measure the performance of the algorithm. Using test variations: performance at C5.0, C4.5, and k-NN at 0% missi

View Publication Preview PDF

Publication Date

Mon Oct 30 2023

Journal Name

Traitement Du Signal

A Comprehensive Review on Machine Learning Approaches for Enhancing Human Speech Recognition

Maha

Husam

...Show More Authors

View Publication

Publication Date

Tue Dec 20 2022

Journal Name

2022 International Conference On Computer And Applications (icca)

Improve Data Mining Techniques with a High-Performance Cluster

Fadhil H.M.

...Show More Authors

View Publication

Publication Date

Mon Apr 11 2011

Journal Name

Icgst

Employing Neural Network and Naive Bayesian Classifier in Mining Data for Car Evaluation

Data mining

Backpropagation Neural Network

Naïve Bayesian Classifier

Classification

Sarmad

Aida

Junaidah

Ealaf

Mohammed

...Show More Authors

In data mining, classification is a form of data analysis that can be used to extract models describing important data classes. Two of the well known algorithms used in data mining classification are Backpropagation Neural Network (BNN) and Naïve Bayesian (NB). This paper investigates the performance of these two classification methods using the Car Evaluation dataset. Two models were built for both algorithms and the results were compared. Our experimental results indicated that the BNN classifier yield higher accuracy as compared to the NB classifier but it is less efficient because it is time-consuming and difficult to analyze due to its black-box implementation.

Publication Date

Fri Sep 30 2022

Journal Name

Iraqi Journal Of Science

Educational Data Mining For Predicting Academic Student Performance Using Active Classification

Educational Data Mining

Active classification

Students’ Prediction

Feature Importance

Random Forest

Multilayer Perceptron

Rasha H.

...Show More Authors

The increasing amount of educational data has rapidly in the latest few years. The Educational Data Mining (EDM) techniques are utilized to detect the valuable pattern so that improves the educational process and to obtain high performance of all educational elements. The proposed work contains three stages: preprocessing, features selection, and an active classification stage. The dataset was collected using EDM that had a lack in the label data, it contained 2050 records collected by using questionnaires and by using the students’ academic records. There are twenty-five features that were combined from the following five factors: (curriculum, teacher, student, the environment of education, and the family). Active learning ha

View Publication Preview PDF

(2)

Publication Date

Fri Apr 26 2019

Journal Name

Journal Of Contemporary Medical Sciences

Breast Cancer Decisive Parameters for Iraqi Women via Data Mining Techniques

CA 15-3

CEA

Breast Cancer

Saliva

MLP

SLR

J48

data mining

OneR

Iraq

Suhad Faisal

Mustafa S.

Iyden Kamil

Maha Mohammed

...Show More Authors

Objective This research investigates Breast Cancer real data for Iraqi women, these data are acquired manually from several Iraqi Hospitals of early detection for Breast Cancer. Data mining techniques are used to discover the hidden knowledge, unexpected patterns, and new rules from the dataset, which implies a large number of attributes. Methods Data mining techniques manipulate the redundant or simply irrelevant attributes to discover interesting patterns. However, the dataset is processed via Weka (The Waikato Environment for Knowledge Analysis) platform. The OneR technique is used as a machine learning classifier to evaluate the attribute worthy according to the class value. Results The evaluation is performed using

View Publication Preview PDF

(2)

Publication Date

Sat Oct 01 2022

Journal Name

Baghdad Science Journal

A Crime Data Analysis of Prediction Based on Classification Approaches

Crime

Crime Prediction

Decision Tree

Logistic Regression

Naïve Bayes

Fatima Shaker

Abbas Fadhil

...Show More Authors

Crime is considered as an unlawful activity of all kinds and it is punished by law. Crimes have an impact on a society's quality of life and economic development. With a large rise in crime globally, there is a necessity to analyze crime data to bring down the rate of crime. This encourages the police and people to occupy the required measures and more effectively restricting the crimes. The purpose of this research is to develop predictive models that can aid in crime pattern analysis and thus support the Boston department's crime prevention efforts. The geographical location factor has been adopted in our model, and this is due to its being an influential factor in several situations, whether it is traveling to a specific area or livin

View Publication Preview PDF

(4)

(2)

Publication Date

Thu Mar 30 2023

Journal Name

Iraqi Journal Of Computer, Communication, Control And System Engineering

Data Analytics and Blockchain: A Review

Safa S.

Alaa K.

Rana F.

...Show More Authors

Blockchain technology relies on cryptographic techniques that provide various advantages, such as trustworthiness, collaboration, organization, identification, integrity, and transparency. Meanwhile, data analytics refers to the process of utilizing techniques to analyze big data and comprehend the relationships between data points to draw meaningful conclusions. The field of data analytics in Blockchain is relatively new, and few studies have been conducted to examine the challenges involved in Blockchain data analytics. This article presents a systematic analysis of how data analytics affects Blockchain performance, with the aim of investigating the current state of Blockchain-based data analytics techniques in research fields and

View Publication

Publication Date

Tue Jan 18 2022

Journal Name

Iraqi Journal Of Science

Proposed Approach for Analysing General Hygiene Information Using Various Data Mining Algorithms

Data Mining

Association Rule

Apriori

Naïve Bayesian

Hygiene Information

Tareef K.

...Show More Authors

General medical fields and computer science usually conjugate together to produce impressive results in both fields using applications, programs and algorithms provided by Data mining field. The present research's title contains the term hygiene which may be described as the principle of maintaining cleanliness of the external body. Whilst the environmental hygienic hazards can present themselves in various media shapes e.g. air, water, soil…etc. The influence they can exert on our health is very complex and may be modulated by our genetic makeup, psychological factors and by our perceptions of the risks that they present. Our main concern in this research is not to improve general health, rather than to propose a data mining approach

View Publication Preview PDF

Publication Date

Wed Aug 31 2022

Journal Name

Iraqi Journal Of Science

Data Mining Methods for Extracting Rumors Using Social Analysis Tools

Machine learning

Text classification

Naïve Byes

RF

KNN

DT

Natural language processing

SGD

Manahil

Abdulkareem Merhej

...Show More Authors

Rumors are typically described as remarks whose true value is unknown. A rumor on social media has the potential to spread erroneous information to a large group of individuals. Those false facts will influence decision-making in a variety of societies. In online social media, where enormous amounts of information are simply distributed over a large network of sources with unverified authority, detecting rumors is critical. This research proposes that rumor detection be done using Natural Language Processing (NLP) tools as well as six distinct Machine Learning (ML) methods (Nave Bayes (NB), random forest (RF), K-nearest neighbor (KNN), Logistic Regression (LR), Stochastic Gradient Descent (SGD) and Decision Tree (

View Publication Preview PDF

(1)

1 2 3 4 ... 3194 3195 3196 3197