Data mining is a data analysis process using software to find certain patterns or rules in a large amount of data, which is expected to provide knowledge to support decisions. However, missing value in data mining often leads to a loss of information. The purpose of this study is to improve the performance of data classification with missing values, precisely and accurately. The test method is carried out using the Car Evaluation dataset from the UCI Machine Learning Repository. RStudio and RapidMiner tools were used for testing the algorithm. This study will result in a data analysis of the tested parameters to measure the performance of the algorithm. Using test variations: performance at C5.0, C4.5, and k-NN at 0% missing rate, performance at C5.0, C4.5, and k-NN at 5–50% missing rate, performance at C5.0 + k-NNI, C4.5 + k-NNI, and k-NN + k-NNI classifier at 5–50% missing rate, and performance at C5.0 + CMI, C4.5 + CMI, and k-NN + CMI classifier at 5–50% missing rate, The results show that C5.0 with k-NNI produces better classification accuracy than other tested imputation and classification algorithms. For example, with 35% of the dataset missing, this method obtains 93.40% validation accuracy and 92% test accuracy. C5.0 with k-NNI also offers fast processing times compared with other methods.
Abstract:
The distribution or retention of profits is the third decision among financial management decisions in terms of priority, whether at the level of theory or practice, as the issue of distribution or retention is multi-party in terms of influence and impact, as determining the optimal percentage for each component is still the subject of intellectual debate because these decisions are linked to the future of the organization and several considerations, The research focus on the nature of the policies followed by the Iraqi banking sector As the sample chosen by the intentional sampling method was represented by the Commercial Bank of
... Show MoreThe Wang-Ball polynomials operational matrices of the derivatives are used in this study to solve singular perturbed second-order differential equations (SPSODEs) with boundary conditions. Using the matrix of Wang-Ball polynomials, the main singular perturbation problem is converted into linear algebraic equation systems. The coefficients of the required approximate solution are obtained from the solution of this system. The residual correction approach was also used to improve an error, and the results were compared to other reported numerical methods. Several examples are used to illustrate both the reliability and usefulness of the Wang-Ball operational matrices. The Wang Ball approach has the ability to improve the outcomes by minimi
... Show MoreIn this paper we estimate the coefficients and scale parameter in linear regression model depending on the residuals are of type 1 of extreme value distribution for the largest values . This can be regard as an improvement for the studies with the smallest values . We study two estimation methods ( OLS & MLE ) where we resort to Newton – Raphson (NR) and Fisher Scoring methods to get MLE estimate because the difficulty of using the usual approach with MLE . The relative efficiency criterion is considered beside to the statistical inference procedures for the extreme value regression model of type 1 for largest values . Confidence interval , hypothesis testing for both scale parameter and regression coefficients
... Show MoreThe current study aims to assess the water quality of the Al-Diwaniyah River in the city of Al-Diwaniyah to drink in terms of chemical properties and heavy metals and their impact on the health of the local population. The results showed that most of the parameters in the river water are of low concentrations due to the limited human activities in polluting the river water. The study concluded that the water quality is suitable for drinking depending on major cations and anions in all seasons. The Heavy Metal Pollution Index (HPI) showed that the river water was clean and safe, except two slightly polluted samples. The study concluded that river water for drinking or various domestic uses does not pose any danger to human heal
... Show MoreA case of angiolymphoid hyperplasia with eosinophilia (ALH) is reported in a 42-year-old woman who developed multiple nodules behind the ear. Angiolymphoid hyperplasia with eosinophilia usually occurs on the head and neck of young adults and is more common in women than in men. Characteristic histologic features of ALH present in this case included proliferation of thick-walled blood vessels lined by prominent endothelial cells, infiltration of the interstitium by chronic inflammatory cells (mainly eosinophils), and presence of lymphoid follicles with germinal centers. The patient referred for surgeon for complete excision. in this context , cases previously described in the literature, and the differential diagnosis of ALH are discussed
... Show MorePhytoplankton community is a model for of monitoring aquatic systems and interpreting the environmental change in aquatic systems. The present study aimed to forecast environmental parameters that drive the change of phytoplankton community structure in the lake. The present study was carried out in Baghdad Tourist Island Lake (BTIL) for the period From October 2021 to May 2022. The study included the quality and quantity of phytoplankton, moreover, the highest and lowest value of the physical and chemical parameters were (Water temperature (13-30 °C), Light penetration (94-275cm), electric conductivity (837-1128 µS/cm), salinity (0.5-0.7 ‰), pH (7-8.2), total alkalinity (126-226 mg CaCO3/L), total Hardness (297-395 mg CaCO3/L), Ca
... Show MoreThe organizational culture is considered as an important topic. In this research, this topic was studied in modern paints Industries Company to assess its role in job performance and to show if there is this relationship between them or no. it is, also, attempted to measure this strength of this relationship if any. The 40 cases research sample was chosen. This sample included the chief executive, his assistants, key managers, and their assistants. The questioner consists of two sets of questions : the first set ( concerning the organizational culture) covers six variables (Physical structures , Symbols
... Show MoreTransportability refers to the ease with which people, goods, or services may be transferred. When transportability is high, distance becomes less of a limitation for activities. Transportation networks are frequently represented by a set of locations and a set of links that indicate the connections between those places which is usually called network topology. Hence, each transmission network has a unique topology that distinguishes its structure. The most essential components of such a framework are the network architecture and the connection level. This research aims to demonstrate the efficiency of the road network in the Al-Karrada area which is located in the Baghdad city. The analysis based on a quantitative evaluation using graph th
... Show MorePhytoplankton community is a model for of monitoring aquatic systems and interpreting the environmental change in aquatic systems. The present study aimed to forecast environmental parameters that drive the change of phytoplankton community structure in the lake. The present study was carried out in Baghdad Tourist Island Lake (BTIL) for the period From October 2021 to May 2022. The study included the quality and quantity of phytoplankton, moreover, the highest and lowest value of the physical and chemical parameters were (Water temperature (13-30 °C), Light penetration (94-275cm), electric conductivity (837-1128 µS/cm), salinity (0.5-0.7 ‰), pH (7-8.2), total alkalinity (126-226 mg CaCO3/L), total Hardness (297-395 mg CaCO3/L
... Show More