Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.
This paper reports an evaluation of the properties of medium-quality concrete incorporating recycled coarse aggregate (RCA). Concrete specimens were prepared with various percentages of the RCA (25%, 50%, 75%, and 100%). The workability, mechanical properties, and durability in terms of abrasion of cured concrete were examined at different ages. The results reveal insignificant differences between the recycled concrete (RC) and reference concrete in terms of the mechanical and durability-related measurements. Meanwhile, the workability of the RC reduced vastly since the replacement of the RCA reached 75% and 100%. The ultrasound pulse velocity (UPV) results greatly depend on the porosity of concrete and the RC exhibited higher poros
... Show MoreThe Iraqi sports journalism has paid great attention to sports events through press coverage of all its forms and arts, especially the coverage of the World Cup of football which is one of the most watched events in the world. Thousands of journals are preparing for the immediate coverage of such event which is a daunting task in itself. Newspapers have devoted a wider space to this great event in its pages as well as the weakly sports newspapers work on issuing a daily special issue. The importance of this research sheds light on the coverage of major events such as World Cup in Iraqi newspapers. The topic is new
... Show MoreThe conception and experimental assessment of a removable friction-based shear connector (FBSC) for precast steel-concrete composite bridges is presented. The FBSC uses pre-tensioned high-strength steel bolts that pass through countersunk holes drilled on the top flange of the steel beam. Pre-tensioning of the bolts provides the FBSC with significant frictional resistance that essentially prevents relative slip displacement of the concrete slab with respect to the steel beam under service loading. The countersunk holes are grouted to prevent sudden slip of the FBSC when friction resistance is exceeded. Moreover, the FBSC promotes accelerated bridge construction by fully exploiting prefabrication, does not raise issues relevant to precast co
... Show MoreGlaucoma is a visual disorder, which is one of the significant driving reason for visual impairment. Glaucoma leads to frustrate the visual information transmission to the brain. Dissimilar to other eye illness such as myopia and cataracts. The impact of glaucoma can’t be cured; The Disc Damage Likelihood Scale (DDLS) can be used to assess the Glaucoma. The proposed methodology suggested simple method to extract Neuroretinal rim (NRM) region then dividing the region into four sectors after that calculate the width for each sector and select the minimum value to use it in DDLS factor. The feature was fed to the SVM classification algorithm, the DDLS successfully classified Glaucoma d
HM Al-Dabbas, RA Azeez, AE Ali, Iraqi Journal of Science, 2023
In this paper, we propose a method using continuous wavelets to study the multivariate fractional Brownian motion through the deviations of the transformed random process to find an efficient estimate of Hurst exponent using eigenvalue regression of the covariance matrix. The results of simulations experiments shown that the performance of the proposed estimator was efficient in bias but the variance get increase as signal change from short to long memory the MASE increase relatively. The estimation process was made by calculating the eigenvalues for the variance-covariance matrix of Meyer’s continuous wavelet details coefficients.