Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.
In this research an Artificial Neural Network (ANN) technique was applied for the prediction of Ryznar Index (RI) of the flowing water from WTPs in Al-Karakh side (left side) in Baghdad city for year 2013. Three models (ANN1, ANN2 and ANN3) have been developed and tested using data from Baghdad Mayoralty (Amanat Baghdad) including drinking water quality for the period 2004 to 2013. The results indicate that it is quite possible to use an artificial neural networks in predicting the stability index (RI) with a good degree of accuracy. Where ANN 2 model could be used to predict RI for the effluents from Al-Karakh, Al-Qadisiya and Al-Karama WTPs as the highest correlation coefficient were obtained 92.4, 82.9 and 79.1% respectively. For
... Show MoreThis research aims to analyze the indicators of spatial variation in the guide of health field in both Al-Adhamiyah and Rusafa districts according to the environmental and administrative units in 2016. The analysis was done by groups of health guide indicators. The objectives of the study were to identify the spatial variation of health services and assess the health situation for families following the environmental and administrative units of the studied area. Such objectives can be done by specifying the extent of the families’ consent to the type of services, measuring the cases of deprivation, and identifying the most deprived areas. The study has finally concluded that there is a clear spatial variation between the indicators and
... Show MoreThis study is concerned with the effect of adding two kinds of ceramic materials on the mechanical properties of (Al-7%Si- 0.3%Mg) alloy, which are zirconia with particle size (20μm > P.S ≥ 0.1μm) and alumina with particle size (20μm > P.S ≥ 0.1μm) and adding them to the alloy with weight ratios (0.2, 0.4, 0.6, 0.8 and 1%). Stirring casting method has been used to make composite material by using vortex technique which is used to pull the particles to inside the melted metals and distributed them homogenously.
After that solution treatment was done to the samples at (520ºC) and artificial ageing at (170ºC) in different times, it has been noticed that the values of hardness is increased with the aging time of the o
... Show Moreمن الاهمية دراسة التاريخ كونه يمدنا بحلول للمشكلات المعاصرة في ضوء خبرات الماضي، ودراسة سلبيات وايجابيات هذه الحلول، وانطلاقا من مبدأ أن ذوي الإعاقة البصرية طاقة بشرية لابد من استثمارها بما يخدم تقدم وازدهار المجتمع، فمن الأهمية تسليط الضوء على هذه الفئة والإسهام بنقل صورة مشرفة عنها، قد تكون دافعا للآخرين ممن اوقفهم انطفاء شعاع النور والبصيرة عن اكمال حياتهم لشرارة امل تعيد لهم شغفهم في الحياة، وته
... Show MoreThe research depth and dimensions of the problem of environmental pollution resulting from the combustion of fuel used in electric power generators, especially in the summer and you are the national electric power supplied by almost non-existent state where this problem is a local phenomenon that has serious dimensions to human health, as well as the possibility of using a the tax system tools of b (environmental taxes) to reduce these pollutants, so the search is aimed at the types of gases emitted from burning fuel electric generators operating in the province of Baghdad and then measure the amount of environmental pollution as well as compared to the amount of some of these gases, which is more risk to humans with permitted by the Wor
... Show MoreAutoría: Nuha Mohsin Dhahi. Localización: Revista iberoamericana de psicología del ejercicio y el deporte. Nº. 5, 2022. Artículo de Revista en Dialnet.
The research discusses the problem of salaries in the public sector in terms of the process of analyzing its structure and the possibility of benefiting from the information provided by the analysis process for the strategic planning process, and the General Authority for Groundwater has been adopted and one of the formations of the Ministry of Water Resources, which is centrally funded, to represent the salary structure of its employees (1117) employees be a field of research, as the salary structure in it was analyzed for the period between (2014-2019) using the quantitative approach to analysis and by relying on a number of statistical tools in the analysis process, including mathematical circles, upper limits, lower limits, p
... Show More
Abstract:
The models of time series often suffer from the problem of the existence of outliers that accompany the data collection process for many reasons, their existence may have a significant impact on the estimation of the parameters of the studied model. Access to highly efficient estimators is one of the most important stages of statistical analysis, And it is therefore important to choose the appropriate methods to obtain good estimators. The aim of this research is to compare the ordinary estimators and the robust estimators of the estimation of the parameters of
... Show More