Data mining is a data analysis process using software to find certain patterns or rules in a large amount of data, which is expected to provide knowledge to support decisions. However, missing value in data mining often leads to a loss of information. The purpose of this study is to improve the performance of data classification with missing values, precisely and accurately. The test method is carried out using the Car Evaluation dataset from the UCI Machine Learning Repository. RStudio and RapidMiner tools were used for testing the algorithm. This study will result in a data analysis of the tested parameters to measure the performance of the algorithm. Using test variations: performance at C5.0, C4.5, and k-NN at 0% missing rate, performance at C5.0, C4.5, and k-NN at 5–50% missing rate, performance at C5.0 + k-NNI, C4.5 + k-NNI, and k-NN + k-NNI classifier at 5–50% missing rate, and performance at C5.0 + CMI, C4.5 + CMI, and k-NN + CMI classifier at 5–50% missing rate, The results show that C5.0 with k-NNI produces better classification accuracy than other tested imputation and classification algorithms. For example, with 35% of the dataset missing, this method obtains 93.40% validation accuracy and 92% test accuracy. C5.0 with k-NNI also offers fast processing times compared with other methods.
Abstract
This study aims to identify the extent to which the criteria of the American Council for Teaching Foreign Languages (ACTFL) are included in the English language books for the fifth and sixth graders. To achieve the objective of the study, a content analysis card was prepared, where the classification of language proficiencies was divided into five main levels (beginner, intermediate, advanced, superior, and distinguished) of the four language skills (listening, speaking, reading, and writing), The content analysis card consisted of (89) indicators distributed at the four levels of language skills as follows: Listening (17), speaking (33), reading (15), and writing (26). The study sample consisted of Engl
... Show MoreThere are many problems facing the economic entities as a result of its mass production &variation of its products , the matter which had increased the need & importance of cost accounting which is regarded a main tool for the managerial control.
The actual costing system is unable to meet the contemporary management needs ,so the Standard costing system appear to provide the management with required information to perform its functions by the best use& way.
This research aims to determine the standard cost for the direct material for oil extraction activity by applying it in the north oil company.
When it comes to changing one's profession after a several years of professional experience in a given field, It must be admitted that the person who adopts this approach is very persevering in the quest for self-realization. Thus, the search for improvement of the working environment and its conditions are often behind this type of choice. To try to understand this phenomenon of professional mobility, we conducted a study that concerning 1st cycle secondary school teachers who became guidance counselors and we focus on two parameters concerning this population: the causes that led them make this choice, and their socio-cultural characteristics (entry profiles). This information was collected via a questionary administered to all active
... Show MoreThis paper deals with testing a numerical solution for the discrete classical optimal control problem governed by a linear hyperbolic boundary value problem with variable coefficients. When the discrete classical control is fixed, the proof of the existence and uniqueness theorem for the discrete solution of the discrete weak form is achieved. The existence theorem for the discrete classical optimal control and the necessary conditions for optimality of the problem are proved under suitable assumptions. The discrete classical optimal control problem (DCOCP) is solved by using the mixed Galerkin finite element method to find the solution of the discrete weak form (discrete state). Also, it is used to find the solution for the discrete adj
... Show MoreText documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the te
... Show MoreBackground: Apicoectomy and retrograde filling is indicated when conventional endodontic treatment is impossible or failed to achieve apical seal. The aim of this study was to evaluate the effect of ER: YAG laser on apical microleakage. Materials and Methods: Sixty extracted single-rooted teeth were used in this study. The roots were divided into six groups. Group 1: apicoectomy by fissure bur, and apical cavities prepared by round bur, then cavities were filled with MTA. Group 2: the roots preparations and fillings were the same as group 1, then the apical areas were treated by Er:YAG Laser. Group 3: apicoectomy by fissure bur, and apical cavities prepared by ultrasound retrotip and cavities were filled with MTA. Group 4: the roots prepara
... Show MoreThe equation of Kepler is used to solve different problems associated with celestial mechanics and the dynamics of the orbit. It is an exact explanation for the movement of any two bodies in space under the effect of gravity. This equation represents the body in space in terms of polar coordinates; thus, it can also specify the time required for the body to complete its period along the orbit around another body. This paper is a review for previously published papers related to solve Kepler’s equation and eccentric anomaly. It aims to collect and assess changed iterative initial values for eccentric anomaly for forty previous years. Those initial values are tested to select the finest one based on the number of iterations, as well as the
... Show MoreThe education sector suffers from many problems, including the scarcity of schools that can absorb the increasing number of students in light of the increasing population growth rate, as some regions suffer from a lack of opening of new schools or the expansion of existing schools to increase their capacity so that attention is required. The research sought to identify the level of maturity of project management at the research site (Building Department in Al-Karkh I/ Ministry of Education) Being responsible for educational projects and their implementation and to know that, the ten areas of the knowledge guide to project management PMBOK have been adopted according to the PM3 model (one of the models of maturity
... Show MoreIn this study, two types of local plants were chosen, the first is the plant golden pothos Epipremnum aureum and the second is the Iraqi Sheikh's chin plant Tribulus terrestris L, for the purpose of making a comparison between them in terms of their possession of chemical groups with antioxidant activity in order to use them as a natural alternative to using antioxidants Industrial that cause negative effects on human health, the samples were prepared using the method of water and alcohol extraction (ethanol 70%) for both plants. It revealed the presence of a number of chemical groups (tannins, carbohydrates, phenols, flavonoids, alkaloids) for both plants, the aqueous and alcoholic extracts. Coumarins are only found in the sheikh's chin pl
... Show MoreOne of the most important phenomena facing the athlete is the anxiety of sports competition, as he faces many psychological problems during training and in competitions of psychological tension, fear and anxiety that accompany him sometimes, which leads to affecting his level, and sports competition anxiety is a special type of anxiety that occurs in the athlete It is related to the attitudes of sports competitions and that participation in sports competitions and the associated emotional experiences are among the important factors that motivate the practice of sports activity and try to advance and develop his sports level. It is assumed that when the individual begins to practice any activity, he aims to reach a level or degree of achie
... Show More