Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.
Receipt date: 12/28/2020 accepted date: 20/1/2021 Publication date: 12/31/2021
This work is licensed under a Creative Commons Attribution 4.0 International License.
Russia has emerged as a rising and influential power in the international arena, especially with Vladimir Putin's assumption of power and his desire for the rise of Russia and the end of the "unipolarism" represented by the hegemony of the United States of
... Show MoreTo ensure that a software/hardware product is of sufficient quality and functionality, it is essential to conduct thorough testing and evaluations of the numerous individual software components that make up the application. Many different approaches exist for testing software, including combinatorial testing and covering arrays. Because of the difficulty of dealing with difficulties like a two-way combinatorial explosion, this brings up yet another problem: time. Using client-server architectures, this research introduces a parallel implementation of the TWGH algorithm. Many studies have been conducted to demonstrate the efficiency of this technique. The findings of this experiment were used to determine the increase in speed and co
... Show MoreThe most important environmental constraints at the present time
is the accumulation of glass waste (transparent glass bottles). A lot of
experiments and research have been made on waste and recycling
glass to get use it as much as possible. This research using recycling
of locally waste colorless glass to turn them into raw materials as
alternative of certain percentages of cement to save the environment
from glass waste and reduce some of the disadvantages of cement
with conserving the mechanical and physical properties of concrete
made. A set of required samples were prepared for mechanical test
with different weight percentage of waste glass (2%, 4%, 5%, 6%,
8%, 10%, 15%, 20% and 25%). American standard
Conservative pipes conveying fluid such as pinned-pinned (p-p), clamped–pinned (c-p) pipes and clamped-clamped (c-c) lose their stability by buckling at certain critical fluid velocities. In order to experimentally evaluate these velocities, high flow-rate pumps that demand complicated fluid circuits must be used.
This paper studies a new experimental approach based on estimating the critical velocities from the measurement of several fundamental natural frequencies .In this approach low flow-rate pumps and simple fluid circuit can be used.
Experiments were carried out on two pipe models at three different boundary conditions. The results showed that the present approach is more accurate for est
... Show MoreRecently, the development of the field of biomedical engineering has led to a renewed interest in detection of several events. In this paper a new approach used to detect specific parameter and relations between three biomedical signals that used in clinical diagnosis. These include the phonocardiography (PCG), electrocardiography (ECG) and photoplethysmography (PPG) or sometimes it called the carotid pulse related to the position of electrode.
Comparisons between three cases (two normal cases and one abnormal case) are used to indicate the delay that may occurred due to the deficiency of the cardiac muscle or valve in an abnormal case.
The results shown that S1 and S2, first and second sound of the
... Show MoreA novel mixed natural coagulant has been developed to remove sewage pollutants and heavy metals from Qanat- al- Jayesh by using low cost adsorbent natural materials. In these materials, significant interaction contains Arabic gum mixed with extracted silica from rice husk ash (natural coagulants) by the Batch device approach, using two variables, pH values ranging from 5-8 and contact times between 0.25-5 hrs. All wastewater samples were collected after treatment by adsorbents and examined for determination of residual heavy metal concentrations: Pb, Ni, Zn and Cu by atomic absorption spectroscopy (AAS), turbidity, pH, total dissolved salts (TDS), electrical conductivity (EC) and total salinity (TS). The results obtained indicate Th
... Show MoreThin films of vanadium oxide nanoparticles doped with different concentrations of europium oxide (2, 4, 6, and 8) wt % are deposited on glass and Si substrates with orientation (111) utilizing by pulsed laser deposition technique using Nd:YAG laser that has a wavelength of 1064 nm, average frequency of 6 Hz and pulse duration of 10 ns. The films were annealed in air at 300 °C for two hours, then the structural, morphological and optical properties are characterized using x-ray diffraction (XRD), Field emission scanning electron microscopy (FESEM) and UV-Vis spectroscopy respectively. The X-ray diffraction results of V2O5:Eu2O3 exhibit that the film has apolycrystalline monoclinic V2O5 and triclinic V4O7 phases. The FESEM image shows a h
... Show MoreThe current research aims to reveal the strength of education and the direction of the relationship between the formal thinking and learning methods of Kindergarten department students. To achieve this objective, the researcher developed a scale of formal thinking according to the theory of (Inhelder & Piaget 1958) consisting of (25) items in the form of declarative phrases derived from the analysis of formal thinking skills based on a professional situation that students are expected to interact with in a professional way. The research sample consisted of (100) female students selected randomly who were divided into four groups based on the academic stages, the results revealed that The level of formal thinking of the main sample is
... Show MoreA confluence of forces has brought journalism and journalism education to a precipice. The rise of fascism, the advance of digital technology, and the erosion of the economic foundation of news media are disrupting journalism and mass communication (JMC) around the world. Combined with the increasingly globalized nature of journalism and media, these forces are posing extraordinary challenges to and opportunities for journalism and media education. This essay outlines 10 core principles to guide and reinvigorate international JMC education. We offer a concluding principle for JMC education as a foundation for the general education of college students.