Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.
In this paper , the CO2 laser receiver system is designed and studied, with wavelength laser 10.6 ?m in room temperature , and to evaluate the performance and discussion it via the package of optical design (ZEMAX), from its output the Spot Diagram is measured through RMS ,and from the Ray fan plot , the aberrations is found which is the normal error for the best focus named (under corrected ) , the other output was the Geometric Encircled Energy in the spot diagram . and found that the radius of spot diagram at 80% (R80%) from the total energy ,and focal shift .The designed system have high efficiency and low cost .
Background: The vaginal microbial ecosystem stability preclude many other organisms but sometimes the vaginal micro biota is disturbed and this cause change in the normal
balance causing symptoms of vulvuvaginitis like abnormal or increased vaginal discharge, redness and itching.
Objective: To prove C. albicans presence in their vagina clinically and laboratory by culture of vaginal swab on two media.
Type of the study: This study is a case control study
Methods: This study is a case control study in which 100 clinically patient women admitted to maternity hospital in kalar city and khanaqin hospital during the pe
... Show MoreImproving the ability of asphalt pavement to survive the heavily repeated axle loads and weathering challenges in Iraq has been the subject of research for many years. The critical need for such data in the design and construction of more durable flexible pavement in bridge deck material is paramount. One of new possible steps is the epoxy asphalt concrete, which is classified as a superior asphalt concrete in roads and greatly imparts the level of design and construction. This paper describes a study on 40-50 penetration graded asphalt cement mixed with epoxy to produce asphalt concrete mixtures. The tests carried out are the Marshall properties, permanent deformation, flexural fatigue cracking and moisture damage. Epoxy asphalt mixes perf
... Show MorePassive optical network (PON) is a point to multipoint, bidirectional, high rate optical network for data communication. Different standards of PONs are being implemented, first of all PON was ATM PON (APON) which evolved in Broadband PON (BPON). The two major types are Ethernet PON (EPON) and Gigabit passive optical network (GPON). PON with these different standards is called xPON. To have an efficient performance for the last two standards of PON, some important issues will considered. In our work we will integrate a network with different queuing models such M/M/1 and M/M/m model. After analyzing IPACT as a DBA scheme for this integrated network, we modulate cycle time, traffic load, throughput, utilization and overall delay
... Show More