Graph based text representation for document clustering

Asma Khazaal Abdulsahib Abdulsahib; SITI SAKIRA KAMARUDDIN KAMARUDDIN

Details

Publication Date

Thu Jan 01 2015

Journal Name

Journal Of Theoretical And Applied Information Technology

Volume

76

Issue Number

1

Choose Citation Style

Statistics

View publication

5

View pdf

3

Statistics

(15)

Graph based text representation for document clustering

Text Representation Schemes

Dependency Graph

Document Clustering

Sparsity Problem

Semantic Problem.

Asma Khazaal Abdulsahib Abdulsahib

SITI SAKIRA KAMARUDDIN KAMARUDDIN

...Show More Authors

Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.

Preview PDF

Quick Preview PDF

Publication Date

Sat Mar 29 2014

Journal Name

International Journal Of Academic Research In Progressive Education And Development

The Effects of Problem-Based Learning on Self-Directed Learning Skills among Physics Undergraduates

Keywords: Self-Directed Learning Skills

Problem-Based Learning

PBL With Lecture Method

Conventional Teaching

Majed Saleem Aziz

Ahmad Nurulazam Md. Zain

Mohd Ali Bin Samsudin

Salmiza Binti Saleh

...Show More Authors

The aim of this study is to compare the effects of three methods: problem-based learning (PBL), PBL with lecture method, and conventional teaching on self-directed learning skills among physics undergraduates. The actual sample size comprises of 122 students, who were selected randomly from the Physics Department, College of Education in Iraq. In this study, the pre- and post-test were done and the instruments were administered to the students for data collection. The data was analyzed and statistical results rejected null hypothesis of this study. This study revealed that there are no signifigant differences between PBL and PBL with lecture method, thus the PBL without or with lecture method enhances the self-directed learning skills bette

Publication Date

Sun Jan 01 2012

Journal Name

Tikrit Journal For Dental Sciences

Microleakage Evaluation of a Silorane-Based and Methacrylate-Based Packable and Nanofill Posterior Composites (in vitro comparative study)

silorane

Filtek™ P90

Filtek™ P60

microleakage

Manhal

...Show More Authors

This study compared in vitro the microleakage of a new low shrink silorane-based posterior composite (Filtek™ P90) and two methacrylate-based composites: a packable posterior composite (Filtek™ P60) and a nanofill composite (Filtek™ Supreme XT) through dye penetration test. Thirty sound human upper premolars were used in this study. Standardized class V cavities were prepared at the buccal surface of each tooth. The teeth were then divided into three groups of ten teeth each: (Group 1: restored with Filtek™ P90, Group 2: restored with Filtek™ P60, and Group 3: restored with Filtek™ Supreme XT). Each composite system was used according to the manufacturer's instructions with their corresponding adhesive systems. The teeth were th

Preview PDF

Publication Date

Wed Jun 26 2019

Journal Name

Journal Of Mechanics Of Continua And Mathematical Sciences

VSM Based Models and Integration of Exact and Fuzzy Similarity For Improving Detection of External Textual Plagiarism admin June 29, 2019

Nasreen J.

...Show More Authors

View Publication

Publication Date

Sat Dec 02 2017

Journal Name

Al-khwarizmi Engineering Journal

Design of a Programmable System for Failure Modes and Effect Analysis of Steam-Power Plant Based on the Fault Tree Analysis

Keywords: Fault Tree

Reliability

Maintainability

Industrial Systems

Failure Mode and Effect Analysis

Diagnostic Expert System

Steam Power Plant

Soroor K. Hussain

Nihad M. A.

Zuhair I.

...Show More Authors

In this paper, the system of the power plant has been investigated as a special type of industrial systems, which has a significant role in improving societies since the electrical energy has entered all kinds of industries, and it is considered as the artery of modern life.

The aim of this research is to construct a programming system, which could be used to identify the most important failure modes that are occur in a steam type of power plants. Also the effects and reasons of each failure mode could be analyzed through the usage of this programming system reaching to the basic events (main reasons) that causing each failure mode. The construction of this system for FMEA is dependi

View Publication Preview PDF

Publication Date

Thu Jun 01 2023

Journal Name

Journal Of Engineering

A Control Program for Hydropower Operation Based on Minimizing the Principal Stress Values on the Dam Body: Mosul Dam Case Study

Francis Turbine

ANSYS

Principal Stress

Mosul Dam

Hussain Ali

Ameen Mohammed

...Show More Authors

This study examines the vibrations produced by hydropower operations to improve embankment dam safety. This study consists of two parts: In the first part, ANSYS-CFX was used to generate a three-dimensional (3-D) finite volume (FV) model to simulate a vertical Francis turbine unit in the Mosul hydropower plant. The pressure pattern result of the turbine model was transformed into the dam body to show how the turbine unit's operation affects the dam's stability. The upstream reservoir conditions, various flow rates, and fully open inlet gates were considered. In the second part of this study, a 3-D FE Mosul dam model was simulated using an ANSYS program. The operational turbine model's water pressure pattern is conveyed t

View Publication Preview PDF

Publication Date

Tue Mar 21 2023

Journal Name

Biomedical And Pharmacology Journal

Development and Validation of HPLC Method For the Detection of Fusidic Acid Loaded in Non-ionic and Cationic Nanoemulsion-Based Gels

Hasan H.J.

Mwafaq M.

...Show More Authors

Fusidic acid (FA) is a well-known pharmaceutical antibiotic used to treat dermal infections. This experiment aimed for developing a standardized HPLC protocol to determine the accurate concentration of fusidic acid in both non-ionic and cationic nano-emulsion based gels. For this purpose, a simple, precise, accurate approach was developed. A column with reversed-phase C18 (250 mm x 4.6 mm ID x 5 m) was utilized for the separation process. The main constituents of the HPLC mobile phase were composed of water: acetonitrile (1: 4); adjusted at pH 3.3. The flow rate was 1.0 mL/minute. The optimized wavelength was selected at 235 nm. This approach achieved strong linearity for alcoholic solutions of FA when loaded at a serial concentrati

View Publication

(4)

(2)

Publication Date

Wed Jan 02 2019

Journal Name

Journal Of Educational And Psychological Researches

A training program for chemistry teachers based on the knowledge economy and its impact on the productive thinking of their students

productive thinking

training program

knowledge economy

mazin kasim hilal

zainb aziz ahmed

sarmad bahjat deka

...Show More Authors

The current research aims to build a training program for chemistry teachers based on the knowledge economy and its impact on the productive thinking of their students. To achieve the objectives of the research, the following hypothesis was formulated:

There is no statistically significant difference at (0.05) level of significance between the average grades of the students participating in the training program according to the knowledge economy and the average grades of the students who did not participate in the training program in the test of productive thinking. The study sample consisted of (288) second intermediate grade students divided into (152) for the control group

View Publication Preview PDF

Publication Date

Thu Feb 01 2024

Journal Name

International Journal Of Biological Macromolecules

A novel designed nanofibrous mat based on hydroxypropyl methyl cellulose incorporating mango peel extract for potential use in wound care system

Hanan Adnan Shaker

Elham

Marwa M.

Yasir Q.

Mastafa H.

Vahid

Marjan

Fatemeh

...Show More Authors

View Publication

(11)

(8)

Publication Date

Fri Mar 31 2017

Journal Name

Al-khwarizmi Engineering Journal

Design of Nonlinear PID Neural Controller for the Speed Control of a Permanent Magnet DC Motor Model based on Optimization Algorithm

Nonlinear PID Controller

DC Motor

Particle Swarm Optimization

Neural Networks

MATLAB

LabVIEW

Ahmed Sabah

...Show More Authors

In this paper, the speed control of the real DC motor is experimentally investigated using nonlinear PID neural network controller. As a simple and fast tuning algorithm, two optimization techniques are used; trial and error method and particle swarm optimization PSO algorithm in order to tune the nonlinear PID neural controller's parameters and to find best speed response of the DC motor. To save time in the real system, a Matlab simulation package is used to carry out these algorithms to tune and find the best values of the nonlinear PID parameters. Then these parameters are used in the designed real time nonlinear PID controller system based on LabVIEW package. Simulation and experimental results are compared with each other and showe

View Publication Preview PDF

Publication Date

Tue Sep 30 2025

Journal Name

Gsc Advanced Research And Reviews

A comprehensive review of metal-organic framework based biosensors for detection of reactive oxygen species and hydrogen peroxide in biomedical applications

Russol Abdul Salam

Jasim M. S.

Arash

...Show More Authors

Metal-organic frameworks (MOFs) have emerged as revolutionary materials for developing advanced biosensors, especially for detecting reactive oxygen species (ROS) and hydrogen peroxide (H₂O₂) in biomedical applications. This comprehensive review explores the current state-of-the-art in MOF-based biosensors, covering fundamental principles, design strategies, performance features, and clinical uses. MOFs offer unique benefits, including exceptional porosity (up to 10,400 m²/g), tunable structures, biocompatibility, and natural enzyme-mimicking properties, making them ideal platforms for sensitive and selective detection of ROS and H₂O₂. Recent advances have shown significant improvements in detection capabilities, with limit

View Publication

1 2 ... 54 55 56 57 ... 696 697