Graph based text representation for document clustering

Asma Khazaal Abdulsahib Abdulsahib; SITI SAKIRA KAMARUDDIN KAMARUDDIN

Details

Publication Date

Thu Jan 01 2015

Journal Name

Journal Of Theoretical And Applied Information Technology

Volume

76

Issue Number

1

Choose Citation Style

Statistics

View publication

5

View pdf

3

Statistics

(15)

Graph based text representation for document clustering

Text Representation Schemes

Dependency Graph

Document Clustering

Sparsity Problem

Semantic Problem.

Asma Khazaal Abdulsahib Abdulsahib

SITI SAKIRA KAMARUDDIN KAMARUDDIN

...Show More Authors

Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.

Preview PDF

Quick Preview PDF

Publication Date

Fri Jan 01 2021

Journal Name

Ieee Access

IFFT-Based Microwave Non-Destructive Testing for Delamination Detection and Thickness Estimation

Ghassan N.

Muhammad Firdaus

...Show More Authors

View Publication

(22)

(20)

Publication Date

Sun Jul 01 2018

Journal Name

Ieee Transactions On Intelligent Transportation Systems

Real-Time Intersection-Based Segment Aware Routing Algorithm for Urban Vehicular Networks

Communication overhead

VANETs

segment aware.

Yusor Rafid Bahar

Nor Fadzilah

Omar Adil

Suleman

Mahamod

Mohsen

Syed Hassan

...Show More Authors

High vehicular mobility causes frequent changes in the density of vehicles, discontinuity in inter-vehicle communication, and constraints for routing protocols in vehicular ad hoc networks (VANETs). The routing must avoid forwarding packets through segments with low network density and high scale of network disconnections that may result in packet loss, delays, and increased communication overhead in route recovery. Therefore, both traffic and segment status must be considered. This paper presents real-time intersection-based segment aware routing (RTISAR), an intersection-based segment aware algorithm for geographic routing in VANETs. This routing algorithm provides an optimal route for forwarding the data packets toward their destination

View Publication

(70)

(61)

Publication Date

Sat Mar 31 2018

Journal Name

Journal Of Engineering

Estimation of Minimum Miscibility Pressure for 〖CO〗_2 Flood Based on EOS

minimum miscibility pressure

CO2

PR-EOS

differential liberation.

Sameera

Samaher A.

Ali A.

...Show More Authors

CO₂ Gas is considered one of the unfavorable gases and it causes great air pollution. It’s possible to decrease this pollution by injecting gas in the oil reservoirs to provide a good miscibility and to increase the oil recovery factor. MMP was estimated by Peng Robinson equation of state (PR-EOS). South Rumila-63 (SULIAY) is involved for which the miscible displacement by is achievable based on the standard criteria for success EOR processes. A PVT report was available for the reservoir under study. It contains deferential liberation (DL) and constant composition expansion (CCE) tests. PVTi software is one of the (Eclipse V.2010) software’s packages, it has been used to achieve the goal.

View Publication Preview PDF

Publication Date

Sun Dec 01 2019

Journal Name

Baghdad Science Journal

Symmetric- Based Steganography Technique Using Spiral-Searching Method for HSV Color Images

Hierarchal decomposition Image processing

Information hiding

Security Techniques

Spiral search

Steganography.

Raheem Abdul Sahib

...Show More Authors

Steganography is defined as hiding confidential information in some other chosen media without leaving any clear evidence of changing the media's features. Most traditional hiding methods hide the message directly in the covered media like (text, image, audio, and video). Some hiding techniques leave a negative effect on the cover image, so sometimes the change in the carrier medium can be detected by human and machine. The purpose of suggesting hiding information is to make this change undetectable. The current research focuses on using complex method to prevent the detection of hiding information by human and machine based on spiral search method, the Structural Similarity Index Metrics measures are used to get the accuracy and quality

View Publication Preview PDF

(5)

Publication Date

Mon Jan 09 2023

Journal Name

2023 15th International Conference On Developments In Esystems Engineering (dese)

Inverse Kinematics Optimization for Humanoid Robotic Legs Based on Particle Swarm Optimization

inverse kinematics

D-H parameters

optimization

humanoid robotic legs.

Radeaf H.S.

Mohammed Z.

...Show More Authors

Calculating the Inverse Kinematic (IK) equations is a complex problem due to the nonlinearity of these equations. Choosing the end effector orientation affects the reach of the target location. The Forward Kinematics (FK) of Humanoid Robotic Legs (HRL) is determined by using DenavitHartenberg (DH) method. The HRL has two legs with five Degrees of Freedom (DoF) each. The paper proposes using a Particle Swarm Optimization (PSO) algorithm to optimize the best orientation angle of the end effector of HRL. The selected orientation angle is used to solve the IK equations to reach the target location with minimum error. The performance of the proposed method is measured by six scenarios with different simulated positions of the legs. The proposed

View Publication

(2)

Publication Date

Mon Jan 28 2013

Journal Name

Spie Proceedings

Enhancement of security for free space optics based on reconfigurable chaotic technique

Lwaa Faisal

...Show More Authors

Free Space Optical (FSO) technology offers highly directional, high bandwidth communication channels. This technology can provide fiber-like data rate over short distances. In order to improve security associated with data transmission in FSO networks, a secure communication method based on chaotic technique is presented. In this paper, we have turned our focus on a specific class of piece wise linear one-dimensional chaotic maps. Simulation results indicate that this approach has the advantage of possessing excellent correlation property. In this paper we examine the security vulnerabilities of single FSO links and propose a solution to this problem by implementing the chaotic signal generator “reconfigurable tent map”. As synchronizat

View Publication

(3)

(2)

Publication Date

Sat Mar 31 2018

Journal Name

Journal Of Engineering

Estimation of Minimum Miscibility Pressure for 〖CO〗_2 Flood Based on EOS

Sameera

Samaher A.

Ali A.

...Show More Authors

CO2 Gas is considered one of the unfavorable gases and it causes great air pollution. It’s possible to decrease this pollution by injecting gas in the oil reservoirs to provide a good miscibility and to increase the oil recovery factor. MMP was estimated by Peng Robinson equation of state (PR-EOS). South Rumila-63 (SULIAY) is involved for which the miscible displacement by is achievable based on the standard criteria for success EOR processes. A PVT report was available for the reservoir under study. It contains deferential liberation (DL) and constant composition expansion (CCE) tests. PVTi software is one of the (Eclipse V.2010) software’s packages, it has been used to achieve the goal. Many trials have been done to ma

Publication Date

Sun Jun 20 2021

Journal Name

Baghdad Science Journal

PDCNN: FRAMEWORK for Potato Diseases Classification Based on Feed Foreword Neural Network

K-means

Gray Level Run Length Matrix

First Order Histogram Features

Scaled Conjugate Gradient Backpropagation

Israa Mohammed

Samar Amil

Musaab

...Show More Authors

The economy is exceptionally reliant on agricultural productivity. Therefore, in domain of agriculture, plant infection discovery is a vital job because it gives promising advance towards the development of agricultural production. In this work, a framework for potato diseases classification based on feed foreword neural network is proposed. The objective of this work is presenting a system that can detect and classify four kinds of potato tubers diseases; black dot, common scab, potato virus Y and early blight based on their images. The presented PDCNN framework comprises three levels: the pre-processing is first level, which is based on K-means clustering algorithm to detect the infected area from potato image. The s

View Publication Preview PDF

(9)

(4)

Publication Date

Tue Jun 02 2026

Journal Name

Journal Of Engineering

Design and Implementation of ICT-Based Recycle-Rewarding System for Green Environment

Mohammed

...Show More Authors

View Publication

Publication Date

Wed Jun 01 2022

Journal Name

Baghdad Science Journal

Advanced GIS-based Multi-Function Support System for Identifying the Best Route

Suhiar Mohammed Zeki

...Show More Authors

Geographic Information Systems (GIS) are obtaining a significant role in handling strategic applications in which data are organized as records of multiple layers in a database. Furthermore, GIS provide multi-functions like data collection, analysis, and presentation. Geographic information systems have assured their competence in diverse fields of study via handling various problems for numerous applications. However, handling a large volume of data in the GIS remains an important issue. The biggest obstacle is designing a spatial decision-making framework focused on GIS that manages a broad range of specific data to achieve the right performance. It is very useful to support decision-makers by providing GIS-based decision support syste

View Publication Preview PDF

(7)

(4)

1 2 ... 29 30 31 32 ... 720 721