Graph based text representation for document clustering

Asma Khazaal Abdulsahib Abdulsahib; SITI SAKIRA KAMARUDDIN KAMARUDDIN

Details

Publication Date

Thu Jan 01 2015

Journal Name

Journal Of Theoretical And Applied Information Technology

Volume

76

Issue Number

1

Choose Citation Style

Statistics

View publication

4

View pdf

3

Statistics

(15)

Graph based text representation for document clustering

Text Representation Schemes

Dependency Graph

Document Clustering

Sparsity Problem

Semantic Problem.

Asma Khazaal Abdulsahib Abdulsahib

SITI SAKIRA KAMARUDDIN KAMARUDDIN

...Show More Authors

Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.

Preview PDF

Quick Preview PDF

Publication Date

Thu Jun 01 2023

Journal Name

مجلة اداب المستنصرية

The Concept of dialogue, it's importance, it's goals and it's character A descriptive Study

Dialogue concept

discussion polite

problem of dialogue

Controversy in literature

Arabic literature.

Khaled Ahmed

...Show More Authors

Dialogue is one of the most important means of calling to the Creator, as it is one of the scientific and verbal activities carried out by a group of interlocutors to present ideas they believe in, and evidence and proofs that express their views and demonstrate the reason for their belief in them, In order to arrive at the truth or a radical solution to a specific problem, so the interlocutor should pay attention to this science, study it and its etiquette, because the purposeful dialogue requires that the funniest of them be the most knowledgeable and knowledgeable about the axis of the hadith, and the funniest must also be able to be convinced of the rule of difference of opinion that does not spoil the issue of friendly They must also c

Preview PDF

Publication Date

Mon Jun 22 2020

Journal Name

Baghdad Science Journal

Splitting the One-Dimensional Wave Equation. Part I: Solving by Finite-Difference Method and Separation Variables

Finite difference method

Inverse force problem

Regularization

Separation variables method

Wave equation.

Shilan Othman

...Show More Authors

In this study, an unknown force function dependent on the space in the wave equation is investigated. Numerically wave equation splitting in two parts, part one using the finite-difference method (FDM). Part two using separating variables method. This is the continuation and changing technique for solving inverse problem part in (1,2). Instead, the boundary element method (BEM) in (1,2), the finite-difference method (FDM) has applied. Boundary data are in the role of overdetermination data. The second part of the problem is inverse and ill-posed, since small errors in the extra boundary data cause errors in the force solution. Zeroth order of Tikhonov regularization, and several parameters of regularization are employed to decrease error

View Publication Preview PDF

(4)

(1)

Publication Date

Thu Jan 20 2022

Journal Name

Webology

Red Monkey Optimization and Genetic Algorithm to Solving Berth Allocation Problems

The Berth Allocation Problem (BAP)

Red Colobuses Monkey (RCM)

Genetic Algorithm (GA).

Musa Abdullah

Woud M.

Raghad K.

Mohamed

...Show More Authors

In the past two decades, maritime transport traffic has increased, especially in the case of container flow. The BAP (Berth Allocation Problem) (BAP) is a main problem to optimize the port terminals. The current manuscript explains the DBAP problems in a typical arrangement that varies from the conventional separate design station, where each berth can simultaneously accommodate several ships when their entire length is less or equal to length. Be a pier, serve. This problem was then solved by crossing the Red Colobuses Monkey Optimization (RCM) with the Genetic Algorithm (GA). In conclusion, the comparison and the computational experiments are approached to demonstrate the effectiveness of the proposed method contrasted with other

View Publication Preview PDF

(1)

Publication Date

Sun Jul 01 2012

Journal Name

Journal Of Techniques مجلة التقني

A STUDY OF SOME TECHNICAL AND ECONOMICAL PARAMETERS FOR MACHINERY UNIT (NEW HOLLAND &DISC PLOW) BY USING THREE DIFFERENT TILT ANGLES دراسة بعض المؤشرات الفنية والأقتصادية للوحدة الميكنية (الجرار نيوهولاند مع المحراث القرصي الثلاثي القلاب) بأستخدام زوايا ميل مختلفة للأقراص

F. F.

...Show More Authors

Publication Date

Sun Jan 01 2012

Journal Name

Advances In Materials Physics And Chemistry

The Effect of Zn Concentration on the Optical Properties of Cd&lt;sub&gt;10–x&lt;/sub&gt;Zn&lt;sub&gt;x&lt;/sub&gt;S Films for Solar Cells Applications

Zn Doped CdS

Spray Pyrolysis

Nathera

Nada

Sundus

Baha

...Show More Authors

View Publication Preview PDF

(13)

Publication Date

Fri Jan 01 2016

Journal Name

Diyala Journal For Pure Sciences

Synthesis, Characterization and Biological Activity for Complexes VO(II), Mn(II), Co(II) and Ni(II) With New Multidentate Ligand [2-((E)-3-(2-hydroxyphenylimino)-1,5-dimethyl-2-phenyl-2,3-dihydro-1H-pyrazol-4- ylimino)acetic Acid][H2L] type (N2).

4-aminoantipyrine

glyoxilic acid

complexes

Basima M sarhan

...Show More Authors

In this work, the precursor [2-(1,5-dimethyl-3-oxo-2-phenyl-2,3-dihydro-1H-pyrazol-4-ylimino)acetic acid] was synthesised from 4-aminoantipyrine and glyoxylic acid, this precursor has been used in the synthesis of new multidentate ligand [2-((E)-3-(2-hydroxyphenylimino)-1,5-dimethyl-2-phenyl-2,3-dihydro-1H-pyrazol-4-ylimino)acetic acid][H2L] type (N2O2). The ligand was refluxed in ethanol with metal ions [VO(II), Mn(II), Co(II) and Ni(II)] salts to give complexes of general molecular formula:[M(H2L)2(X)(Y)].B, where: M=VO(II), X=0, Y=OSO3-2, B=2H2O; M=Mn(II),Co(II) ,X=Cl, Y=Cl, B=0; M=Ni(II), X=H2O, Y=Cl, B=Cl. These complexes were characterised by atomic absorpition(A.A), F.T-I.R., (U.V-Vis)spectroscopies (1H,13C NMR for ligand only), alon

Publication Date

Fri Dec 01 2023

Journal Name

Chemical Methodologies

Investigations on TiO<inf>2</inf>-NiO@In<inf>2</inf>O<inf>3</inf> Nanocomposite Thin Films (NCTFs) for Gas Sensing: Synthesis, Physical Characterization, and Detection of NO<inf>2</inf> and H<inf>2</inf>S Gas Sensors

M.A.

Fuad Tariq

S.

...Show More Authors

(15)

Publication Date

Fri Apr 01 2016

Journal Name

Swift Journal Of Social Sciences And Humanity

Difficulties encountered in translating Some legal texts from Arabic into English

legal translation

legal text

marriage and divorce contracts

syntactic semantic and cultural difficulties

equivalence

Mahmood

IBTIHAL

...Show More Authors

Translation is both a social and cultural phenomenon, it can neither exist outside a social community and it is within society, nor it can be viewed as a medium of cross-cultural fertilization. This paper aims to investigate the difficulties that a translator may face when dealing with legal texts such as marriage and divorce contracts. These difficulties can be classified according to the present paper into syntactic, semantic, and cultural. The syntactic difficulties include word order, syntactic arrangement, unusual sentence structure, the use of model verbs in English, and difference in legal system. As to the semantic difficulties, they involve lack of established terminology, finding functional and lexical equivalence, word for word t

Preview PDF

Publication Date

Mon Oct 01 2018

Journal Name

Journal Of Educational And Psychological Researches

The ability of solving a mathematical problem and its relation to system thinking among fifth preparatory students

ability of solving a mathematical problem

relation to system thinking among fifth preparatory

Hussein O. Dahwi

kameran molod Fatah

...Show More Authors

The research seeks to examine the ability of fifth preparatory students in solving a mathematical problem in relation to system thinking. To this end, the researcher chose (140) fifth preparatory students from four-different secondary schools in Kirkuk city for the academic year (2016-2017). Two tests were adopted to collect study data: a test of (5) items about skills in solving math problem designed by (Al-raihan, 2006); and a test of system thinking skills designed by the researcher himself consisted of (14) items. It was divided into four skills (analyzing the main system to subsystems, eliminating all inner gaps of system, identifying the inner connection of system, and reorganizing the system). The findings indicated a good ability

View Publication Preview PDF

Publication Date

Sat Sep 30 2023

Journal Name

Journal Of The College Of Education For Women

The Problem of Paradox and its Manifestations: A Study in the Titles of Najm Wali's Novels

The Problem of Irony

the Irony of the Title

Najm Wali's Novel Titles.

Zainab Raad Ahmad Al-Hamdani

Hussein Mirzaei Nia

...Show More Authors

The irony pushes us to inquire about what is in the text of contradiction, irony, suspense, and other acts of paradox, as well as a departure from what is logical, or familiar, that attracts the attention of the addressee, and this is what drives us to introspect the text and interrogate it in order to get to know the intended product of the text or its real or metaphorical intent. On the other hand, the irony is more in the literary text than in the scientific texts. Therefore, critics add the word literature to it in their definition.

As it is represented by the paradox, we will seek to study the paradox of the title and the problematic that it may pose as the beginning of the text, and i

View Publication Preview PDF

1 2 ... 686 687 688 689 ... 693 694