Graph based text representation for document clustering

Asma Khazaal Abdulsahib Abdulsahib; SITI SAKIRA KAMARUDDIN KAMARUDDIN

Details

Publication Date

Thu Jan 01 2015

Journal Name

Journal Of Theoretical And Applied Information Technology

Volume

76

Issue Number

1

Choose Citation Style

Statistics

View publication

5

View pdf

3

Statistics

(15)

Graph based text representation for document clustering

Text Representation Schemes

Dependency Graph

Document Clustering

Sparsity Problem

Semantic Problem.

Asma Khazaal Abdulsahib Abdulsahib

SITI SAKIRA KAMARUDDIN KAMARUDDIN

...Show More Authors

Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.

Preview PDF

Quick Preview PDF

Publication Date

Tue Sep 01 2020

Journal Name

Ceramics International

High-performance (K,Na)NbO3-based binary lead-free piezoelectric ceramics modified with acceptor metal oxide

R.

Nabil Janan

Mohammed A.

Amar

Thulfiqar Ali

W.H. Abd

...Show More Authors

View Publication

(38)

(37)

Publication Date

Sun Jun 20 2021

Journal Name

Baghdad Science Journal

Ontological Methodologies for Counselling Intervention: Do’a and Zikr Al-Mā’thur Corpus

Islamic knowledge

knowledge representation

ontology

ontology evaluation

ontology development

semantic technology

Roslina

Siti Fatimah Mohd

...Show More Authors

Do’a and Zikr al-Mā’thur (authentic supplications and remembrance of ALLAH ‘Azza wa Jalla) can be suggested to Muslims to help them deal with challenges or issues in life. Counselling cases affect a person’s feelings. Do’a and Zikr al-Mā’thur are often applied as a counselling intervention. Unfortunately, the authentic Do’a and Zikr al-Mā’thur are dispersed in many resources not visible to users, and the fact that not all online resources offer access to accurate Do’a and Zikr al-Mā’thur to users and the dubious Do’a and Zikr al-Mā’thur frequently credited to the Prophet (pbuh). The goal of this research is to develop an ontology

View Publication Preview PDF

(2)

Publication Date

Thu Sep 29 2022

Journal Name

World Journal Of Clinical Infectious Diseases

Five-year retrospective hospital-based study on epidemiological data regarding human leishmaniasis in West Kordofan state, Sudan

Mohammed

Musa

Ahmed

Adam

Safa

Suad

Mohammed

...Show More Authors

View Publication

(2)

Publication Date

Thu Nov 02 2017

Journal Name

Iraqi Journal Of Laser

Enhancement the Sensitivity of Humidity Sensor Based on an Agarose Coating Transmission-Type Photonic Crystal Fiber Interferometer

Hassan F.

Hanan J.

...Show More Authors

Photonic Crystal Fiber Interferometers (PCFIs) are widely used for sensing applications. This work presents the fabrication and the characterization of a relative humidity sensor based on a polymer-coated photonic crystal fiber that operates in a Mach- Zehnder Interferometer (MZI) transmission mode. The fabrication of the sensor involved splicing a short (1 cm) length of Photonic Crystal Fiber (PCF) between two single-mode fibers (SMF). It was then coated with a layer of agarose solution. Experimental results showed that a high humidity sensitivity of 29.37 pm/%RH was achieved within a measurement range of 27–95%RH. The sensor also showed good repeatability, small size, measurement accuracy and wide humidity range. The RH sensitivity o

View Publication Preview PDF

Publication Date

Sat Jan 01 2022

Journal Name

Journal Of Stomatology

Preferences of treatments and materials used in the management of exposed pulps: a web-based questionnaire study

Ahmed

Anas

Noor

...Show More Authors

View Publication

(4)

Publication Date

Tue Dec 12 2017

Journal Name

Al-khwarizmi Engineering Journal

Model Reference Adaptive Control based on a Self-Recurrent Wavelet Neural Network Utilizing Micro Artificial Immune Systems

Artificial neural network

micro artificial immune system

model reference adaptive control

self-recurrent wavelet neural network

Wavelet neural network.

Omar Farouq

Maryam Hassan

...Show More Authors

Abstract

This paper presents an intelligent model reference adaptive control (MRAC) utilizing a self-recurrent wavelet neural network (SRWNN) to control nonlinear systems. The proposed SRWNN is an improved version of a previously reported wavelet neural network (WNN). In particular, this improvement was achieved by adopting two modifications to the original WNN structure. These modifications include, firstly, the utilization of a specific initialization phase to improve the convergence to the optimal weight values, and secondly, the inclusion of self-feedback weights to the wavelons of the wavelet layer. Furthermore, an on-line training procedure was proposed to enhance the control per

View Publication Preview PDF

(1)

Publication Date

Tue Dec 01 2015

Journal Name

Journal Of Engineering

Digital Image Authentication Algorithm Based on Fragile Invisible Watermark and MD-5 Function in the DWT Domain

fragile watermark

image authentication

dwt

adaptive threshold

hvs

md-5

rsa.

Nehad Hameed

...Show More Authors

Using watermarking techniques and digital signatures can better solve the problems of digital images transmitted on the Internet like forgery, tampering, altering, etc. In this paper we proposed invisible fragile watermark and MD-5 based algorithm for digital image authenticating and tampers detecting in the Discrete Wavelet Transform DWT domain. The digital image is decomposed using 2-level DWT and the middle and high frequency sub-bands are used for watermark and digital signature embedding. The authentication data are embedded in number of the coefficients of these sub-bands according to the adaptive threshold based on the watermark length and the coefficients of each DWT level. These sub-bands are used because they a

View Publication Preview PDF

Publication Date

Mon Apr 01 2019

Journal Name

Journal Of Educational And Psychological Researches

The Teaching Practices of Faculty Members in Northern Border University According to the Brain-Based Learning Theory

Teaching practices

brain based learning theory

northern border university

Musaab bin Mutlaq A-inazi

...Show More Authors

The present study aims to identify the most and the least common teaching practices among faculty members in Northern Border University according to brain-based learning theory, as well as to identify the effect of sex, qualifications, faculty type, and years of experiences in teaching practices. The study sample consisted of (199) participants divided into 100 males and 99 females. The study results revealed that the most teaching practice among the study sample was ‘I am trying to create an Environment of encouragement and support within the classroom which found to be (4.4623). As for the least teaching practice was ‘I use a natural musical sounds to create student's mood to learn’ found to be (2.2965). The study results also in

View Publication Preview PDF

Publication Date

Wed Jan 17 2018

Journal Name

Journal Of Engineering And Applied Sciences

PREPARATION, CHARACTERIZATION AND THERMAL ANALYSIS OF POLYMERIC BLEND NANOCOMPOSITES BASED ON PVA-PVPPEGDOPED WITH ZINC OXIDE NANOPARTICLES

Ahlaam Jassim

...Show More Authors

Publication Date

Sun Mar 01 2026

Journal Name

Case Studies In Thermal Engineering

Dynamic thermal modelling of energy storage unit based on PCM under single or two-phase flow conditions

Karima E.

Abdelhamid

...Show More Authors

Phase change materials are extensively studied for use in low-, mid-, and high-temperature applications due to their melting and solidification temperatures, latent heat, and thermophysical properties. This work aims to explore the energy stored, or released and their duration for the energy storage unit formed of a phase change material surrounding a tube within which a hot or cold, single or Two-Phase fluid flows, serving as a heat source or sink. The 3D axial transient thermal analysis of the energy storage unit is performed using the finite element method via a MATLAB-developed computer program. The effects of single- or Two-Phase fluid flow on temperature distribution, solidification, melting duration, and energy stored within phase ch

View Publication

1 2 ... 123 124 125 126 ... 722 723