Preferred Language
Articles
/
8hYn5IsBVTCNdQwCFON1
Graph based text representation for document clustering
...Show More Authors

Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.

Scopus
Preview PDF
Quick Preview PDF
Publication Date
Thu Oct 18 2018
Journal Name
Proceedings Of The Future Technologies Conference (ftc) 2018
Using Mouse Dynamics for Continuous User Authentication
...Show More Authors

View Publication
Scopus (13)
Crossref (14)
Scopus Clarivate Crossref
Publication Date
Fri Feb 01 2019
Journal Name
Ieee Transactions On Emerging Topics In Computational Intelligence
Neuromorphic Architecture for the Hierarchical Temporal Memory
...Show More Authors

View Publication
Scopus (27)
Crossref (28)
Scopus Clarivate Crossref
Publication Date
Thu Oct 10 2024
Journal Name
Al-mustansiriyah Journal Of Arts
Strategies for Achieving an Equivalent Level of Appellation in Arabic Tourist Texts as Translated into English
...Show More Authors

Preview PDF
Publication Date
Thu Jun 01 2017
Journal Name
Journal Of Economics And Administrative Sciences
Compared of estimating two methods for nonparametric function to cluster data for the white blood cells to leukemia patients
...Show More Authors

 

Abstract:                                        

   We can notice cluster data in social, health and behavioral sciences, so this type of data have a link between its observations and we can express these clusters through the relationship between measurements on units within the same group.

    In this research, I estimate the reliability function of cluster function by using the seemingly unrelate

... Show More
View Publication Preview PDF
Crossref
Publication Date
Thu Jan 01 2009
Journal Name
J. Of University Of Anbar For Pure Science
Estimation of the Normalized Difference Vegetation Index (NDVI) Variation for Selected Regions in Iraq for two Years 1990 & 2001
...Show More Authors

The Normalized Difference Vegetation Index (NDVI) is commonly used as a measure of land surface greenness based on the assumption that NDVI value is positively proportional to the amount of green vegetation in an image pixel area. The Normalized Difference Vegetation Index data set of Landsat based on the remote sensing information is used to estimate the area of plant cover in region west of Baghdad during 1990-2001. The results show that in the period of 1990 and 2001 the plant area in region of Baghdad increased from (44760.25) hectare to (75410.67) hectare. The vegetation area increased during the period 1990-2001, and decreases the exposed area.

View Publication
Publication Date
Fri Sep 30 2022
Journal Name
Journal Of Economics And Administrative Sciences
Comparison of Some Methods for Estimating the Survival Function and Failure Rate for the Exponentiated Expanded Power Function Distribution
...Show More Authors

 

     We have presented the distribution of the exponentiated expanded power function (EEPF) with four parameters, where this distribution was created by the exponentiated expanded method created by the scientist Gupta to expand the exponential distribution by adding a new shape parameter to the cumulative function of the distribution, resulting in a new distribution, and this method is characterized by obtaining a distribution that belongs for the exponential family. We also obtained a function of survival rate and failure rate for this distribution, where some mathematical properties were derived, then we used the method of maximum likelihood (ML) and method least squares developed  (LSD)

... Show More
View Publication Preview PDF
Crossref
Publication Date
Tue Dec 31 2019
Journal Name
Journal Of Economics And Administrative Sciences
Measuring the Range Application of Internal Marketing for HRM Philosophy in the Public Company for Electrical and Electronic Industries
...Show More Authors

The reason behind choosing this topic " internal marketing (IM) of human resource management (HRM)" is to highlight the advantages of using IM in the organization framework. The problem of the research paper lies in not paying enough attention to employees genuine needs as they interact with each other in the sake of organization prosper. This research paper can be used as indictor to expose the weaknesses that the organization encounters daily. The current research paper attempts at examining the possibility of developing philosophy of internal marketing of human resources and its most practices, empowering staff, training courses, motivations and recognitions, and within departments communication, in order to reach targeted res

... Show More
View Publication Preview PDF
Crossref (1)
Crossref
Publication Date
Sun Mar 01 2020
Journal Name
Iop Conference Series: Materials Science And Engineering
Prepare Maps For Greenhouse Gases With Some Weather Elements For Baghdad City Using Data Observation And Arc-GIS Techniques
...Show More Authors
Abstract<p>Air pollution refers to the release of pollutants into the air that are detrimental to human health and the planet as a whole.In this research, the air pollutants concentration measurements such as Total Suspended Particles(TSP), Carbon Monoxides(CO),Carbon Dioxide (CO2) and meteorological parameters including temperature (T), relative humidity (RH) and wind speed & direction were conducted in Baghdad city by several stations measuring numbered (22) stations located in different regions, and were classified into (industrial, commercial and residential) stations. Using Arc-GIS program ( spatial Analyses), different maps have been prepared for the distribution of different pollutant</p> ... Show More
View Publication Preview PDF
Scopus (4)
Crossref (3)
Scopus Crossref
Publication Date
Sat Dec 03 2022
Journal Name
Iraqi Journal Of Science
Detection of Spectral Reflective Changes for Temporal Resolution of Land Cover (LC) for Two Different Seasons in central Iraq
...Show More Authors

The purpose of the study is the city of Baghdad, the capital of Iraq, was chosen to study the spectral reflection of the land cover and to determine the changes taking place in the areas of the main features of the city using the temporal resolution of multispectral bands of the satellite Landsat 5 and 8 for MSS and OLI sensors respectively belonging to NASA and for the period 1999-2021, and calculating the increase and decrease in the basic features of Baghdad. The main conclusions of the study were, This study from 1999 to 2021 and in two different seasons: the Spring of the growing season and Summer the dry season. When using the supervised classification method to determine the differences, the results showed remarkable changes. Where h

... Show More
Publication Date
Fri Sep 30 2022
Journal Name
Journal Of Economics And Administrative Sciences
Comparison of Some Methods for Estimating the Survival Function and Failure Rate for the Exponentiated Expanded Power Function Distribution
...Show More Authors

       We have presented the distribution of the exponentiated expanded power function (EEPF) with four parameters, where this distribution was created by the exponentiated expanded method created by the scientist Gupta to expand the exponential distribution by adding a new shape parameter to the cumulative function of the distribution, resulting in a new distribution, and this method is characterized by obtaining a distribution that belongs for the exponential family. We also obtained a function of survival rate and failure rate for this distribution, where some mathematical properties were derived, then we used the method of maximum likelihood (ML) and method least squares developed  (LSD) to estimate the parameters an

... Show More
View Publication
Crossref