Big data analysis has important applications in many areas such as sensor networks and connected healthcare. High volume and velocity of big data bring many challenges to data analysis. One possible solution is to summarize the data and provides a manageable data structure to hold a scalable summarization of data for efficient and effective analysis. This research extends our previous work on developing an effective technique to create, organize, access, and maintain summarization of big data and develops algorithms for Bayes classification and entropy discretization of large data sets using the multi-resolution data summarization structure. Bayes classification and data discretization play essential roles in many learning algorithms such as decision tree and nearest neighbor search. The proposed method can handle streaming data efficiently and, for entropy discretization, provide su the optimal split value.
The Land Use/ Land Cover (LULC) is an essential application in many remotely sensed projects and problems. Land use is simply man-made objects such as urban, road complex targets, etc., while land covers are defined as any target and phenomenon that appear neutral. The LULC study is essential for all current and future engineering projects, as it shows the nature of the land's components, which is evident in studying and modernizing residential areas. One of the essential operations for studying LULC is the heterogeneity detection and classification calculations of satellite images and topographic maps. A part of the Baghdad, Iraq region was selected for the Landsat satellite group at different periods to detect variance and mak
... Show MoreThree Seismic Attributes are used to enhance or delineate geologic feature that cannot be detected within seismic resolution limit. These are Instantaneous Amplitude, Instantaneous Phase and Instantaneous Frequency Attributes. These are applied along two defined picked surface horizons within 3D seismic data for an area in southern Iraq. Two geologic features are deduced, the first represents complex channel system at the top of Saadi Formation and the second represents submarine fan within Mishrif Formation. The semblances of these ancient geological features are dramatically enhanced by using flattening technique.
This paper aims to evaluate large-scale water treatment plants’ performance and demonstrate that it can produce high-level effluent water. Raw water and treated water parameters of a large monitoring databank from 2016 to 2019, from eight water treatment plants located at different parts in Baghdad city, were analyzed using nonparametric and multivariate statistical tools such as principal component analysis (PCA) and hierarchical cluster analysis (HCA). The plants are Al-Karkh, Sharq-Dijlah, Al-Wathba, Al-Qadisiya Al-Karama, Al-Dora, Al-Rasheed, Al-Wehda. PCA extracted six factors as the most significant water quality parameters that can be used to evaluate the variation in drinkin
The paired sample t-test for testing the difference between two means in paired data is not robust against the violation of the normality assumption. In this paper, some alternative robust tests have been suggested by using the bootstrap method in addition to combining the bootstrap method with the W.M test. Monte Carlo simulation experiments were employed to study the performance of the test statistics of each of these three tests depending on type one error rates and the power rates of the test statistics. The three tests have been applied on different sample sizes generated from three distributions represented by Bivariate normal distribution, Bivariate contaminated normal distribution, and the Bivariate Exponential distribution.
n this research, several estimators concerning the estimation are introduced. These estimators are closely related to the hazard function by using one of the nonparametric methods namely the kernel function for censored data type with varying bandwidth and kernel boundary. Two types of bandwidth are used: local bandwidth and global bandwidth. Moreover, four types of boundary kernel are used namely: Rectangle, Epanechnikov, Biquadratic and Triquadratic and the proposed function was employed with all kernel functions. Two different simulation techniques are also used for two experiments to compare these estimators. In most of the cases, the results have proved that the local bandwidth is the best for all the types of the kernel boundary func
... Show MoreThe essential objective of this paper is to introduce new notions of fibrewise topological spaces on D that are named to be upper perfect topological spaces, lower perfect topological spaces, multi-perfect topological spaces, fibrewise upper perfect topological spaces, and fibrewise lower perfect topological spaces. fibrewise multi-perfect topological spaces, filter base, contact point, rigid, multi-rigid, multi-rigid, fibrewise upper weakly closed, fibrewise lower weakly closed, fibrewise multi-weakly closed, set, almost upper perfect, almost lower perfect, almost multi-perfect, fibrewise almost upper perfect, fibrewise almost lower perfect, fibrewise almost multi-perfect, upper* continuous fibrewise upper∗ topol
... Show MorePlayfair cipher is a substitution scheme. The classical playfair scheme has a limited matrix size containing only uppercase letters, so it is prone to hackers and cryptanalysis. To increase the resistance of playfair cipher, a new encipherment and decipherment method is proposed in this work, which depends on the permutation and its inverse, respectively. In addition, a modified key matrix is utilized, which includes capital and small Alphabets, numbers, and 38 special characters collected from ASCII codes. In the proposed method, both substitution and transposition schemes are used, where the first stratum of the cipher is a substitution by using key matrix and the second stratum is a transposi
... Show Morethe study considers the optical classification of cervical nodal lymph cells and is based on research into the development of a Computer Aid Diagnosis (CAD) to detect the malignancy cases of diseases. We consider 2 sets of features one of them is the statistical features; included Mode, Median, Mean, Standard Deviation and Maximum Probability Density and the second set are the features that consist of Euclidian geometrical features like the Object Perimeter, Area and Infill Coefficient. The segmentation method is based on following up the cell and its background regions as ranges in the minimum-maximum of pixel values. The decision making approach is based on applying of Minimum Dista
The purpose of the study is the city of Baghdad, the capital of Iraq, was chosen to study the spectral reflection of the land cover and to determine the changes taking place in the areas of the main features of the city using the temporal resolution of multispectral bands of the satellite Landsat 5 and 8 for MSS and OLI sensors respectively belonging to NASA and for the period 1999-2021, and calculating the increase and decrease in the basic features of Baghdad. The main conclusions of the study were, This study from 1999 to 2021 and in two different seasons: the Spring of the growing season and Summer the dry season. When using the supervised classification method to determine the differences, the results showed remarkable changes. Where h
... Show MoreThe purpose of the study is the city of Baghdad, the capital of Iraq, was chosen to study the spectral reflection of the land cover and to determine the changes taking place in the areas of the main features of the city using the temporal resolution of multispectral bands of the satellite Landsat 5 and 8 for MSS and OLI sensors respectively belonging to NASA and for the period 1999-2021, and calculating the increase and decrease in the basic features of Baghdad .The main conclusions of the study were,
This study from 1999 to 2021 and in two different seasons: the Spring of the growing season and Summer the dry season. When using the supervised classification method to determine the differences, the results showed rema
... Show More