Data mining is a data analysis process using software to find certain patterns or rules in a large amount of data, which is expected to provide knowledge to support decisions. However, missing value in data mining often leads to a loss of information. The purpose of this study is to improve the performance of data classification with missing values, precisely and accurately. The test method is carried out using the Car Evaluation dataset from the UCI Machine Learning Repository. RStudio and RapidMiner tools were used for testing the algorithm. This study will result in a data analysis of the tested parameters to measure the performance of the algorithm. Using test variations: performance at C5.0, C4.5, and k-NN at 0% missing rate, performance at C5.0, C4.5, and k-NN at 5–50% missing rate, performance at C5.0 + k-NNI, C4.5 + k-NNI, and k-NN + k-NNI classifier at 5–50% missing rate, and performance at C5.0 + CMI, C4.5 + CMI, and k-NN + CMI classifier at 5–50% missing rate, The results show that C5.0 with k-NNI produces better classification accuracy than other tested imputation and classification algorithms. For example, with 35% of the dataset missing, this method obtains 93.40% validation accuracy and 92% test accuracy. C5.0 with k-NNI also offers fast processing times compared with other methods.
Text categorization refers to the process of grouping text or documents into classes or categories according to their content. Text categorization process consists of three phases which are: preprocessing, feature extraction and classification. In comparison to the English language, just few studies have been done to categorize and classify the Arabic language. For a variety of applications, such as text classification and clustering, Arabic text representation is a difficult task because Arabic language is noted for its richness, diversity, and complicated morphology. This paper presents a comprehensive analysis and a comparison for researchers in the last five years based on the dataset, year, algorithms and the accu
... Show MoreBlockchain technology relies on cryptographic techniques that provide various advantages, such as trustworthiness, collaboration, organization, identification, integrity, and transparency. Meanwhile, data analytics refers to the process of utilizing techniques to analyze big data and comprehend the relationships between data points to draw meaningful conclusions. The field of data analytics in Blockchain is relatively new, and few studies have been conducted to examine the challenges involved in Blockchain data analytics. This article presents a systematic analysis of how data analytics affects Blockchain performance, with the aim of investigating the current state of Blockchain-based data analytics techniques in research fields and
... Show MoreThe research aims to presenting a number of scenarios for the investment of the marshes. The problem of research problem was that there is no in-depth analysis of the marshes environment. The traditional methods of the environmental analysis are insufficient. The research community is represented by the decision makers in Maysan Governorate. The research led to proposing of three scenarios with statement the requirements for the success of each one. The most important conclusions are that the three proposed scenarios for marshes investment depend on the availability of the required volunteers for each scenario. The higher the availability of the requirements, the more optimistic the scenario becomes. If t
... Show MoreThe research aims to develop alternatives to transportation at the entrance to the Educational City (University of Baghdad) during the morning and evening peaks, which result from of the traffic congestion at the entrances to the educational city (the University of Baghdad), and affects the emotional, functional, and social performance of the whole city, and leads to hotbeds of confluence and congestion at the entrances in the morning and evening peaks. This movement was measured on the ground for pedestrians and vehicles. Some criteria were adopted to determine the density of road length to the area and density of roads for the number of users and the rate of the area served by roads. The research reviews the experiences of some
... Show MoreThe research discusses the public relations services, registration, and academic advising at Petra University for the years 2013-2014. Using a field study and surveying Petra University students to be informed about the services and to cover the tiny details that have to do with public relations role in the university as a specialized department interested in serving public and gaining their trust in terms of what is legal and possible to build and enhance the university reputation. And gain mutual trust between the university and its publics.
The public relations is consi
Today, problems of spatial data integration have been further complicated by the rapid development in communication technologies and the increasing amount of available data sources on the World Wide Web. Thus, web-based geospatial data sources can be managed by different communities and the data themselves can vary in respect to quality, coverage, and purpose. Integrating such multiple geospatial datasets remains a challenge for geospatial data consumers. This paper concentrates on the integration of geometric and classification schemes for official data, such as Ordnance Survey (OS) national mapping data, with volunteered geographic information (VGI) data, such as the data derived from the OpenStreetMap (OSM) project. Useful descriptions o
... Show MoreThe major climate changes that have affected the planet in addition to wave the big drought plaguing the study area, including the lack of water for imports Badra River fatigue because of the Iran constructing dams on this river and make use of the waters for the benefitof its territory. The subject of finding sources of water has become available with the possibility of exploiting them in an exemplary manner is one of the key things in order to be exploited somewhere.
The study area was chosen within the eastern border of the province of Wasit within the district of Badra border, an area of (1557.5 km2) almost "to study the characteristics of hydrological and identify possibilities for water harvesting them. In this study was conduct
A network (or formally a graph) can be described by a set of nodes and a set of edges connecting these nodes. Networks model many real-world phenomena in various research domains, such as biology, engineering and sociology. Community mining is discovering the groups in a network where individuals group of membership are not explicitly given. Detecting natural divisions in such complex networks is proved to be extremely NP-hard problem that recently enjoyed a considerable interest. Among the proposed methods, the field of evolutionary algorithms (EAs) takes a remarkable interest. To this end, the aim of this paper is to present the general statement of community detection problem in social networks. Then, it visits the problem as an optim
... Show MoreDust storms are among the most important weather phenomena in Middle East. The Shamal dust storms are dominated across Iraq and the whole Middle East, especially in summer. However, frontal type of dust storms is possible in winter and spring. In this research, a comprehensive case study was conducted to a dust storm that occurred on 20 March 2016 from many perspectives: synoptic, satellite imagery, dust concentration analysis, visibility reduction, and aerosol optical depth. The study shows that the dust storm initiated inside Syria and moved eastward with the movement of the front. Dust concentrations and aerosol optical depth were also discussed that simulate the dust storm over Iraq in a reasonable way with some differences. The dust
... Show MoreDue to increased consumption of resources, especially energy it was necessary to find alternatives characterized by the same quality as well as being of less expensive, and most important of these alternatives are characterized by waste and the fact that humancannot stop consumption. So we have consideredwaste as an alternative and cheap economic resources and by using environmental index the MIP (input materials per unit ,unit / service) is based on the grounds that the product is not the end of itselfit is a product to meet the need of a product or service, awarded a resource input and output within the five basic elements are the raw materials is ecological, Raw materials ecological, water, air and soil erosion for a
... Show More