Graph based text representation for document clustering

Asma Khazaal Abdulsahib Abdulsahib; SITI SAKIRA KAMARUDDIN KAMARUDDIN

Details

Publication Date

Thu Jan 01 2015

Journal Name

Journal Of Theoretical And Applied Information Technology

Volume

76

Issue Number

1

Choose Citation Style

Statistics

View publication

5

View pdf

3

Statistics

(15)

Graph based text representation for document clustering

Text Representation Schemes

Dependency Graph

Document Clustering

Sparsity Problem

Semantic Problem.

Asma Khazaal Abdulsahib Abdulsahib

SITI SAKIRA KAMARUDDIN KAMARUDDIN

...Show More Authors

Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.

Preview PDF

Quick Preview PDF

Publication Date

Mon Aug 01 2016

Journal Name

Journal Of Economics And Administrative Sciences

User (K-Means) for clustering in Data Mining with application

العناصر

تنقيب البيانات

العنقدة

التعليم الالي

الخوارزمية.

object

data mining

clustering

machine learning

algorithm object

data mining

clustering

machine learning

algorithm

قتيبة نبيل

محي الدين خلف

...Show More Authors

The great scientific progress has led to widespread Information as information accumulates in large databases is important in trying to revise and compile this vast amount of data and, where its purpose to extract hidden information or classified data under their relations with each other in order to take advantage of them for technical purposes.

And work with data mining (DM) is appropriate in this area because of the importance of research in the (K-Means) algorithm for clustering data in fact applied with effect can be observed in variables by changing the sample size (n) and the number of clusters (K)

View Publication Preview PDF

Publication Date

Sun Jan 02 2022

Journal Name

Journal Of The College Of Languages (jcl)

Semantic relations in text and translation: Die semantischen Relationen im Text und Übersetzung

Semantic- relations- text- translation-word.

Semantik-Relationen-Text-Struktur-Satz-Übersetzung

Muafak M.J. Almusleh

...Show More Authors

Based on the German language department’s theoretical and practical aspects as well as educational programs, the present study discusses the semantic relations in text sentences and their role in the science of translation. Through clarifying the semantic relationship between the text sentence and the methods used to express a news item, a situation or an occurrence and through the statement of the multiple theoretical semantic structures of the text’s construction and interrelation, a translator can easily translate a text into the target language.

It is known that language learners face multiple difficulties in writing and creating an inte

View Publication Preview PDF

Publication Date

Sat Aug 01 2020

Journal Name

International Journal Of Electrical And Computer Engineering (ijece)

Text hiding in text using invisible character

Nada Abdul Aziz Mustafa

...Show More Authors

Steganography can be defined as the art and science of hiding information in the data that could be read by computer. This science cannot recognize stego-cover and the original one whether by eye or by computer when seeing the statistical samples. This paper presents a new method to hide text in text characters. The systematic method uses the structure of invisible character to hide and extract secret texts. The creation of secret message comprises four main stages such using the letter from the original message, selecting the suitable cover text, dividing the cover text into blocks, hiding the secret text using the invisible character and comparing the cover-text and stego-object. This study uses an invisible character (white space

View Publication

(5)

(1)

Publication Date

Sat May 03 2025

Journal Name

Aip Conference Proceedings

Computational applications on the result involution graph for the held group He

Salwa Mohammed

Ali Abd

...Show More Authors

In this work, a deep computational study has been conducted to assign several qualities for the graph ⁠. Furthermore, determine the amount of the dihedral subgroups in the Held simple group He through utilizing the attributes of gamma.

View Publication

Publication Date

Tue Apr 02 2024

Journal Name

Al-iraqia Journal Of Scientific Engineering Research

Prioritise Five Tafseer Translators Using Clustering Technique for Surah Al-Baqarah

Dictionary

Image compression

Lossless Compression

LZW algorithm

photographs

Mohammed A.

Shahad Mahgoob

Hanif

Puteri Nor Ellyza

...Show More Authors

View Publication Preview PDF

Publication Date

Mon Feb 01 2021

Journal Name

International Journal Of Electrical And Computer Engineering (ijece)

Features of genetic algorithm for plain text encryption

Riyadh Bassil

...Show More Authors

The data communication has been growing in present day. Therefore, the data encryption became very essential in secured data transmission and storage and protecting data contents from intruder and unauthorized persons. In this paper, a fast technique for text encryption depending on genetic algorithm is presented. The encryption approach is achieved by the genetic operators Crossover and mutation. The encryption proposal technique based on dividing the plain text characters into pairs, and applying the crossover operation between them, followed by the mutation operation to get the encrypted text. The experimental results show that the proposal provides an important improvement in encryption rate with comparatively high-speed Process

View Publication

(15)

(4)

Publication Date

Sat Nov 02 2019

Journal Name

Advances In Intelligent Systems And Computing

Spin-Image Descriptors for Text-Independent Speaker Recognition

Suhaila N.

...Show More Authors

Building a system to identify individuals through their speech recording can find its application in diverse areas, such as telephone shopping, voice mail and security control. However, building such systems is a tricky task because of the vast range of differences in the human voice. Thus, selecting strong features becomes very crucial for the recognition system. Therefore, a speaker recognition system based on new spin-image descriptors (SISR) is proposed in this paper. In the proposed system, circular windows (spins) are extracted from the frequency domain of the spectrogram image of the sound, and then a run length matrix is built for each spin, to work as a base for feature extraction tasks. Five different descriptors are generated fro

View Publication

(7)

(2)

Publication Date

Wed Dec 30 2015

Journal Name

College Of Islamic Sciences

Phonological features For spoken text in the language

م. د. بشرى عبد الرزاق

...Show More Authors

This research attempted to take advantage of modern techniques in the study of the superstructural phonetic features of spoken text in language using phonetic programs to achieve more accurate and objective results, far from being limited to self-perception and personal judgment, which varies from person to person.
It should be noted that these phonological features (Nabr, waqf, toning) are performance controls that determine the fate of the meaning of the word or sentence, but in the modern era has received little attention and attention, and that little attention to some of them came to study issues related to the composition or style Therefore, we recommend that more attention should be given to the study of

View Publication Preview PDF

Publication Date

Sun Jan 07 2018

Journal Name

University Of Baghdad, College Of Education For Pure Sciences / Ibn Al-haitham, Department Of Mathematics

On Topological Structures In Graph Theory

M-space

m-derived graphs

m-open graphs

m-closed graphs

m-interior operators

m-closure operators

M-subspace

o-space

i-space.

SARA SAAD

Y. Y.

...Show More Authors

In this thesis, we study the topological structure in graph theory and various related results. Chapter one, contains fundamental concept of topology and basic definitions about near open sets and give an account of uncertainty rough sets theories also, we introduce the concepts of graph theory. Chapter two, deals with main concepts concerning topological structures using mixed degree systems in graph theory, which is M-space by using the mixed degree systems. In addition, the m-derived graphs, m-open graphs, m-closed graphs, m-interior operators, m-closure operators and M-subspace are defined and studied. In chapter three we study supra-approximation spaces using mixed degree systems and primary object in this chapter are two topological

Publication Date

Mon Oct 30 2023

Journal Name

Journal Of Discrete Mathematical Sciences & Cryptography

Associate graph of a commutative ring

Nermen J.

Nabeel E.

Tamadher Arif

...Show More Authors

In this study, a brand-new graph definition known Associate graph of a ring R denote Ass(R) is present, where the graph’s vertices stand in for R’s elements s.t, any two vertices α and β merage by an edge if and only if α = rβ and β = sα. In this paper, we investigated some new property of Ass (R) are studied., the complement of Ass (R) is finally defined and a few of its characteristics are researched.

View Publication Preview PDF

(5)

(2)

1 2 ... 6 7 8 9 ... 726 727