Graph based text representation for document clustering

Asma Khazaal Abdulsahib Abdulsahib; SITI SAKIRA KAMARUDDIN KAMARUDDIN

Details

Publication Date

Thu Jan 01 2015

Journal Name

Journal Of Theoretical And Applied Information Technology

Volume

76

Issue Number

1

Choose Citation Style

Statistics

View publication

5

View pdf

3

Statistics

(15)

Graph based text representation for document clustering

Text Representation Schemes

Dependency Graph

Document Clustering

Sparsity Problem

Semantic Problem.

Asma Khazaal Abdulsahib Abdulsahib

SITI SAKIRA KAMARUDDIN KAMARUDDIN

...Show More Authors

Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.

Preview PDF

Quick Preview PDF

Publication Date

Tue Feb 01 2022

Journal Name

Journal Of Engineering

Self-Repairing Technique Based on Microcapsules for Cementitious Composites- A Review

Self-repairing

Micro-capsules

Repairing factors

Investigative methodologies

Nondestructive methods

Zainab

Esraa

Tabarek

...Show More Authors

Self-repairing technology based on micro-capsules is an efficient solution for repairing cracked cementitious composites. Self-repairing based on microcapsules begins with the occurrence of cracks and develops by releasing self-repairing factors in the cracks located in concrete. Based on previous comprehensive studies, this paper provides an overview of various repairing factors and investigative methodologies. There has recently been a lack of consensus on the most efficient criteria for assessing self-repairing based on microcapsules and the smart solutions for improving capsule survival ratios during mixing. The most commonly utilized self-repairing efficiency assessment indicators are mechanical resistance and durab

View Publication Preview PDF

(3)

Publication Date

Sun Jan 01 2023

Journal Name

Computers, Materials & Continua

Severity Based Light-Weight Encryption Model for Secure Medical Information System

Mohammed

...Show More Authors

View Publication

(21)

(35)

Publication Date

Thu Mar 19 2026

Journal Name

International Journal Of Mechatronics And Applied Mechanics

EXTENDED STATE OBSERVER-BASED OPTIMIZED SYNERGETIC CONTROL FOR PRODUCTION-INVENTORY SYSTEMS

Extended state observer

Mountain gazelle optimizer

Production-inventory system

Proportional-integral-derivative controller

Synergetic control

Aws

...Show More Authors

The efficient exploitation of production inventory systems is of significant importance in the modern industrial reality. This paper explores the effect of such a system on dynamic behaviour of a system when the control is provided synergistically by a method called synergetic control (SC). The mathematical model of the system is first constructed and SC introduced to improve the responsiveness of the system when the time-varying demand condition is taken into account. To cope with the problem of unavailability of the systems' state signals and to estimate the demand, the extended state observer (ESO) is introduced. Moreover, mountain gazelle optimizer (MGO) is employed to tune the adjustable design parameters of the SC and the ESO based on

View Publication Preview PDF

Publication Date

Fri Dec 01 2023

Journal Name

Ieee Antennas And Wireless Propagation Letters

Stabilized and Fast Method for Compressive-Sensing-Based Method of Moments

Yalan

Muhammad Firdaus

Ghassan N.

...Show More Authors

View Publication

(17)

(12)

Publication Date

Sat Dec 31 2011

Journal Name

Al-khwarizmi Engineering Journal

Back stepping-Based-PID-Controller Designed for an Artificial Pancreas model

Keywords:- Type I diabetes

Backstepping

Bergman’s model

oral glucose tolerance test.

Taghreed M.

Mina Qais

Shaima Mahmou

...Show More Authors

Artificial pancreas is simulated to handle Type I diabetic patients under intensive care by automatically controlling the insulin infusion rate. A Backstepping technique is used to apply the effect of PID controller to blood glucose level since there is no direct relation between insulin infusion (the manipulated variable) and glucose level in Bergman’s system model subjected to an oral glucose tolerance test by applying a meal translated into a disturbance. Backstepping technique is usually recommended to stabilize and control the states of Bergman's class of nonlinear systems. The results showed a very satisfactory behavior of glucose deviation to a sudden rise represented by the meal that increase the blood glucose

View Publication Preview PDF

Publication Date

Mon Dec 25 2023

Journal Name

Ieee Access

ITor-SDN: Intelligent Tor Networks-Based SDN for Data Forwarding Management

Anonymity

blockchain

ML

SDN

Tor networks

Fouad A.

Nahlah Abdulrahman

Hamed S.

...Show More Authors

Tor (The Onion Routing) network was designed to enable users to browse the Internet anonymously. It is known for its anonymity and privacy security feature against many agents who desire to observe the area of users or chase users’ browsing conventions. This anonymity stems from the encryption and decryption of Tor traffic. That is, the client’s traffic should be subject to encryption and decryption before the sending and receiving process, which leads to delay and even interruption in data flow. The exchange of cryptographic keys between network devices plays a pivotal and critical role in facilitating secure communication and ensuring the integrity of cryptographic procedures. This essential process is time-consuming, which causes del

View Publication

(6)

(8)

Publication Date

Mon Dec 01 2025

Journal Name

Results In Engineering

Kernel-based machine learning intrusion detection systems for ICMPv6 DDoS detection

Abeer Abdullah

Noora Al

Sam

Bilal

Sadiq H.

Abir Jaafar

...Show More Authors

View Publication

(2)

Publication Date

Tue Aug 27 2024

Journal Name

Algorithms

Multithreading-Based Algorithm for High-Performance Tchebichef Polynomials with Higher Orders

Ahlam Hanoon

Basheera M.

Firas A.

Sadiq H.

Muntadher

Wameedh Nazar

...Show More Authors

Tchebichef polynomials (TPs) play a crucial role in various fields of mathematics and applied sciences, including numerical analysis, image and signal processing, and computer vision. This is due to the unique properties of the TPs and their remarkable performance. Nowadays, the demand for high-quality images (2D signals) is increasing and is expected to continue growing. The processing of these signals requires the generation of accurate and fast polynomials. The existing algorithms generate the TPs sequentially, and this is considered as computationally costly for high-order and larger-sized polynomials. To this end, we present a new efficient solution to overcome the limitation of sequential algorithms. The presented algorithm us

View Publication

(3)

Publication Date

Mon Nov 09 2020

Journal Name

Journal Of The Optical Society Of America B

Theoretical study of silicon-based Bragg mirrors for cavity QED applications

J.

S.

R. G.

...Show More Authors

We conducted a theoretical study on the potential use of amorphous hydrogenated silicon (a-Si:H) as the high-index material in quarter-wave-stack Bragg mirrors for cavity quantum electrodynamics applications. Compared to conventionally employed $T a_{2}$

View Publication

(3)

Publication Date

Mon Jan 09 2023

Journal Name

2023 15th International Conference On Developments In Esystems Engineering (dese)

Low-Distortion MMSE Estimator for Speech Enhancement Based on Hahn Moments

Ammar S.

Basheera M.

Sadiq H.

Marwah A.

Abir

...Show More Authors

View Publication

(3)

(2)

1 2 ... 26 27 28 29 ... 720 721