Data mining is a data analysis process using software to find certain patterns or rules in a large amount of data, which is expected to provide knowledge to support decisions. However, missing value in data mining often leads to a loss of information. The purpose of this study is to improve the performance of data classification with missing values, precisely and accurately. The test method is carried out using the Car Evaluation dataset from the UCI Machine Learning Repository. RStudio and RapidMiner tools were used for testing the algorithm. This study will result in a data analysis of the tested parameters to measure the performance of the algorithm. Using test variations: performance at C5.0, C4.5, and k-NN at 0% missing rate, performance at C5.0, C4.5, and k-NN at 5–50% missing rate, performance at C5.0 + k-NNI, C4.5 + k-NNI, and k-NN + k-NNI classifier at 5–50% missing rate, and performance at C5.0 + CMI, C4.5 + CMI, and k-NN + CMI classifier at 5–50% missing rate, The results show that C5.0 with k-NNI produces better classification accuracy than other tested imputation and classification algorithms. For example, with 35% of the dataset missing, this method obtains 93.40% validation accuracy and 92% test accuracy. C5.0 with k-NNI also offers fast processing times compared with other methods.
Background: Cleaning and shaping of root canals successfully requires high volumes of irrigation solutions that can only be applied to the apical third of root canal after enlargement with instrument, so the aim of this study was to evaluate and to compare the efficiency of Maxi-I-probe (side-vented needle), in the amount of root canal irrigant penetration for five different master apical file sizes (MAF) and four different degrees of coronal and middle thirds flaring. Materials and Methods: Two hundred resin blocks with simulated root canals were used in this study and divided into 5 major groups (40 for each) based on the size of master apical files (#20, #25, #30, #35, and #40). Each major group was subdivided into 4 subgroups depending
... Show MoreThe solar energy is the major source of power for the future and an important source of renewable energy in Iraq and the world. Suitable climate conditions for solar energy are available in Iraq, especially the high temperature in the summer season which extends for more than six months in the year. Hence, the global solar radiation is abundant with high intensity, which is very essential in applicable models for researchers and solar applications. Therefore, nine first-order regression empirical equations of Angstrom-type correlations were used to estimate the more appropriate global solar radiation model for Baghdad city. Two equations were developed empirically in this work, using the most available and easy to get meteorological data
... Show MoreThis research aims to studying and analyzing the theoretical
framework of the environmental auditing in industrial environment to its a broad and danger environmental effects . It aims to contribute in setting and testing a proposed procedure framework for environmental auditing in that vital activity .The practical aspect focused on testing a proposed framework within practice it in a one Iraqi industrial company that has a huge effect on environmental activity, represented by Iraqi state company
The Accounting Disclosure for non-current intangible assets is necessary to rely on accounting information by decision makers in the economic unity, two international accounting standards issued (IAS16,36), which aims to provide the foundations of the recognition, measurement and disclosure of appropriate assets Non-current tangible. (IAS16) allowed to use re-evaluation approach to measure assets entrance due to the inadequacy of the accounting information resulting from the application of the historical cost of the entrance under increasing technical developments and continuing that leave clear their effects on non-current intangible assets, As well as the requirements of what came (IAS36) the importance of accounting for the impairment
... Show MoreStructure type and disorder have become important questions in catalyst design, with the most active catalysts often noted to be “disordered” or “amorphous” in nature. To quantify the effects of disorder and structure type systematically, a test set of manganese(III,IV) oxides was developed and their reactivity as oxidants and catalysts tested against three substrates: methylene blue, hydrogen peroxide, and water. We find that disorder destabilizes the materialsthermodynamically, making them stronger chemical oxidantsbut not necessarily better catalysts. For the disproportionation of H2O2 and the oxidative decomposition of methylene blue, MnOx-mediated direct oxidation competes with catalytically mediated oxidation, making the most
... Show MoreHartha Formation is an overburdened horizon in the X-oilfield which generates a lot of Non-Productive Time (NPT) associated with drilling mud losses. This study has been conducted to investigate the loss events in this formation as well as to provide geological interpretations based on datasets from nine wells in this field of interest. The interpretation was based on different analyses including wireline logs, cuttings descriptions, image logs, and analog data. Seismic and coherency data were also used to formulate the geological interpretations and calibrate that with the loss events of the Hartha Fm.
The results revealed that the upper part of the Hartha Fm. was identified as an interval capable of creating potentia
... Show MoreWe studied the effect of Ca- doping on the properties of Bi-based superconductors by
adding differ ent amounts of CaO
to the Bi
2
Sr2La2-xCaxCu3O10+δ
compound. consequently, we
obtained three samples A,B and C with x=0.0, 0.4 and 0.8 respectively. The usual solid-state
reaction method has been applied under optimum conditions. The x-ray diffraction analy sis
showed that the samples A and B have tetragonal structures conversely the sample C has an
orthorhombic structure. In addition XRD analysis show that decreasing the c-axis lattice
constant and thus decreasing the ratio c/a for samples A,B and C resp ectively. The X-ray
florescence proved that the compositions of samples A,B and C with the ra
Big data analysis has important applications in many areas such as sensor networks and connected healthcare. High volume and velocity of big data bring many challenges to data analysis. One possible solution is to summarize the data and provides a manageable data structure to hold a scalable summarization of data for efficient and effective analysis. This research extends our previous work on developing an effective technique to create, organize, access, and maintain summarization of big data and develops algorithms for Bayes classification and entropy discretization of large data sets using the multi-resolution data summarization structure. Bayes classification and data discretization play essential roles in many learning algorithms such a
... Show MoreA three-dimensional (3D) model extraction represents the best way to reflect the reality in all details. This explains the trends and tendency of many scientific disciplines towards making measurements, calculations and monitoring in various fields using such model. Although there are many ways to produce the 3D model like as images, integration techniques, and laser scanning, however, the quality of their products is not the same in terms of accuracy and detail. This article aims to assess the 3D point clouds model accuracy results from close range images and laser scan data based on Agi soft photoscan and cloud compare software to determine the compatibility of both datasets for several applications. College of Scien
... Show More