Graph based text representation for document clustering

Asma Khazaal Abdulsahib Abdulsahib; SITI SAKIRA KAMARUDDIN KAMARUDDIN

Details

Publication Date

Thu Jan 01 2015

Journal Name

Journal Of Theoretical And Applied Information Technology

Volume

76

Issue Number

1

Choose Citation Style

Statistics

View publication

5

View pdf

3

Statistics

(15)

Graph based text representation for document clustering

Text Representation Schemes

Dependency Graph

Document Clustering

Sparsity Problem

Semantic Problem.

Asma Khazaal Abdulsahib Abdulsahib

SITI SAKIRA KAMARUDDIN KAMARUDDIN

...Show More Authors

Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.

Preview PDF

Quick Preview PDF

Publication Date

Mon Jan 09 2023

Journal Name

2023 15th International Conference On Developments In Esystems Engineering (dese)

Deep Learning-Based Skin Cancer Identification

Sandhua M

Abir

Dhiya

Basheera M.

Sadiq H.

...Show More Authors

View Publication

(7)

(4)

Publication Date

Sun Dec 30 2018

Journal Name

Journal Of Engineering

Knowledge-Based Urban Development The Impact of Knowledge- Based Urban Development in the Growth of Contemporary Cities

knowledge-based urban development

knowledge

knowledge workers

knowledge-based economy

Knowledge City.

Safaa Aldeen H.

Shatha Saleem

...Show More Authors

Urban Development refers to many topics such as: increased population density, city size, and individual’s production, distribution of technology and the growth of commercial, industrial and service professions. Such development is linked to the coordination of social and cultural trends in order to achieve social progress and economical prosperity. Knowledge as a topic now is known as intellectual capital wich led to upgrae the concept of urban development to be extended into many fields of knowledge, for example, cultural, social and human development to move the level of community culture into a new better standard.

The research adopted the urban transformation based on knowledge as an important factor in gr

View Publication Preview PDF

(1)

Publication Date

Sat Mar 29 2014

Journal Name

International Journal Of Academic Research In Progressive Education And Development

The Effects of Problem-Based Learning on Self-Directed Learning Skills among Physics Undergraduates

Keywords: Self-Directed Learning Skills

Problem-Based Learning

PBL With Lecture Method

Conventional Teaching

Majed Saleem Aziz

Ahmad Nurulazam Md. Zain

Mohd Ali Bin Samsudin

Salmiza Binti Saleh

...Show More Authors

The aim of this study is to compare the effects of three methods: problem-based learning (PBL), PBL with lecture method, and conventional teaching on self-directed learning skills among physics undergraduates. The actual sample size comprises of 122 students, who were selected randomly from the Physics Department, College of Education in Iraq. In this study, the pre- and post-test were done and the instruments were administered to the students for data collection. The data was analyzed and statistical results rejected null hypothesis of this study. This study revealed that there are no signifigant differences between PBL and PBL with lecture method, thus the PBL without or with lecture method enhances the self-directed learning skills bette

Publication Date

Sun Dec 01 2024

Journal Name

Russian Journal Of General Chemistry

Synthesis, Characterization, and Biological Evaluation for New Derivatives Based on 2Сhloro-N-[4-(5-phenyl-1,3,4-oxadiazol-2-yl)phenyl]acetamide

N. M.

K. A.

H. A.

J. H.

H. S.

R. K.

...Show More Authors

(2)

Publication Date

Sat Dec 02 2017

Journal Name

Al-khwarizmi Engineering Journal

Design of a Programmable System for Failure Modes and Effect Analysis of Steam-Power Plant Based on the Fault Tree Analysis

Keywords: Fault Tree

Reliability

Maintainability

Industrial Systems

Failure Mode and Effect Analysis

Diagnostic Expert System

Steam Power Plant

Soroor K. Hussain

Nihad M. A.

Zuhair I.

...Show More Authors

In this paper, the system of the power plant has been investigated as a special type of industrial systems, which has a significant role in improving societies since the electrical energy has entered all kinds of industries, and it is considered as the artery of modern life.

The aim of this research is to construct a programming system, which could be used to identify the most important failure modes that are occur in a steam type of power plants. Also the effects and reasons of each failure mode could be analyzed through the usage of this programming system reaching to the basic events (main reasons) that causing each failure mode. The construction of this system for FMEA is dependi

View Publication Preview PDF

Publication Date

Sun Dec 01 2024

Journal Name

Russian Journal Of General Chemistry

Synthesis, Characterization, and Biological Evaluation for New Derivatives Based on 2Сhloro-N-[4-(5-phenyl-1,3,4-oxadiazol-2-yl)phenyl]acetamide

N. M.

K. A.

H. A.

J. H.

H. S.

R. K.

...Show More Authors

View Publication

(2)

Publication Date

Wed Jun 26 2019

Journal Name

Journal Of Mechanics Of Continua And Mathematical Sciences

VSM Based Models and Integration of Exact and Fuzzy Similarity For Improving Detection of External Textual Plagiarism admin June 29, 2019

Nasreen J.

...Show More Authors

View Publication

Publication Date

Fri Oct 02 2020

Journal Name

International Journal Of Pharmaceutical Research

A turbidimetric method for the quantitative determination of cyproheptadine hydrochloride in tablets using an optoelectronic detector based on the LEDs array

Jalal N.

Nagham S. Turkey

...Show More Authors

(17)

(3)

Publication Date

Fri Oct 02 2020

Journal Name

International Journal Of Pharmaceutical Research

A turbidimetric method for the quantitative determination of cyproheptadine hydrochloride in tablets using an optoelectronic detector based on the LEDs array

Flow injection analysis

Turbidity

Cyproheptadine hydrochloride (CPH)

Pharmaceutical preparations

Quality control analysis.

Jalal N.

...Show More Authors

View Publication Preview PDF

(17)

(3)

Publication Date

Tue Sep 30 2025

Journal Name

Gsc Advanced Research And Reviews

A comprehensive review of metal-organic framework based biosensors for detection of reactive oxygen species and hydrogen peroxide in biomedical applications

Russol Abdul Salam

Jasim M. S.

Arash

...Show More Authors

Metal-organic frameworks (MOFs) have emerged as revolutionary materials for developing advanced biosensors, especially for detecting reactive oxygen species (ROS) and hydrogen peroxide (H₂O₂) in biomedical applications. This comprehensive review explores the current state-of-the-art in MOF-based biosensors, covering fundamental principles, design strategies, performance features, and clinical uses. MOFs offer unique benefits, including exceptional porosity (up to 10,400 m²/g), tunable structures, biocompatibility, and natural enzyme-mimicking properties, making them ideal platforms for sensitive and selective detection of ROS and H₂O₂. Recent advances have shown significant improvements in detection capabilities, with limit

View Publication

1 2 ... 57 58 59 60 ... 720 721