Graph based text representation for document clustering

Asma Khazaal Abdulsahib Abdulsahib; SITI SAKIRA KAMARUDDIN KAMARUDDIN

Details

Publication Date

Thu Jan 01 2015

Journal Name

Journal Of Theoretical And Applied Information Technology

Volume

76

Issue Number

1

Choose Citation Style

Statistics

View publication

5

View pdf

3

Statistics

(15)

Graph based text representation for document clustering

Text Representation Schemes

Dependency Graph

Document Clustering

Sparsity Problem

Semantic Problem.

Asma Khazaal Abdulsahib Abdulsahib

SITI SAKIRA KAMARUDDIN KAMARUDDIN

...Show More Authors

Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.

Preview PDF

Quick Preview PDF

Publication Date

Tue Mar 31 2015

Journal Name

Iraqi Journal Of Chemical And Petroleum Engineering

Formation Evaluation for Nasiriyah Oil Field Based on The Non-Conventional Techniques

Nasiriyah Oil Field

quick look techniques

Ayad A.

Antwan M.

Haider Alwan

...Show More Authors

The unconventional techniques called “the quick look techniques”, have been developed to present well log data calculations, so that they may be scanned easily to identify the zones that warrant a more detailed analysis, these techniques have been generated by service companies at the well site which are among the useful, they provide the elements of information needed for making decisions quickly when time is of essence^.The techniques used in this paper are:

Apparent resistivity R_wa
R_xo /R_t

The above two methods had been used to evaluate Nasiriyah oil field formations (well-NS-3) to discover the hydrocarbon bearing formations. A compu

View Publication Preview PDF

Publication Date

Wed Sep 30 2020

Journal Name

Association Of Arab Universities Journal Of Engineering Sciences

Estimation of Minimum Miscibility Pressure for Hydrocarbon Gas Injection Based on EOS

Liqaa I.

Sameera

...Show More Authors

The important parameter used for determining the probable application of miscible displacement is the MMP (minimum miscibility pressure). In enhanced oil recovery, the injection of hydrocarbon gases can be a highly efficient method to improve the productivity of the well especially if miscibility developed through the displacement process. There are a lot of experiments for measuring the value of the miscibility pressure, but they are expensive and take a lot of time, so it's better to use the mathematical equations because of it inexpensive and fast. This study focused on calculating MMP required to inject hydrocarbon gases into two reservoirs namely Sadi and Tanomaa/ East Baghdad field. Modified Peng Robenson Equation of State was

View Publication

Publication Date

Tue Jun 02 2026

Journal Name

Journal Of Advanced Research Design

Sustainable Leaf Plant Disease Based on Salp Swarm Algorithm for Feature Selection

Leaf

Plant Disease

Salp Swarm Algorithm

Hamsa

Yossra

Tarik

Janmenjoy

...Show More Authors

Sustainable plant protection and the economy of plant crops worldwide depend heavily on the health of agriculture. In the modern world, one of the main factors influencing economic growth is the quality of agricultural produce. The need for future crop protection and production is growing as disease-affected plants have caused considerable agricultural losses in several crop categories. The crop yield must be increased while preserving food quality and security and having the most negligible negative environmental impact. To overcome these obstacles, early discovery of satisfactory plants is critical. The use of Advances in Intelligent Systems and information computer science effectively helps find more efficient and low-cost solutions. Thi

View Publication Preview PDF

Publication Date

Sun Mar 01 2020

Journal Name

Materials Research Express

Spray pyrolysis of graphene oxide based composite for optical and wettability applications

Abdulkareem A

Imad H

Hussein A.

...Show More Authors

Abstract<p>In this study, silica-graphene oxide nano–composites were prepared by sol-gel technique and deposited by spray pyrolysis method on glass substrate. The effect of changing the graphene/silica ratio on the optical properties and wetting of these nano–structures has been investigated. The structural and morphological properties of the thin films have been studied by x-ray diffraction spectroscopy (XRD), field emission scanning electron microscope (FESEM), energy dispersive x-ray spectroscopy (EDS) and atomic force microscope (AFM). XRD results show that silica structures present in the synthesized films exhibit amorphous character and there is a poor arrangement in graphene plates al</p> ... Show More

View Publication

(4)

Publication Date

Sat Jun 29 2013

Journal Name

Wireless Personal Communications

A Low Cost Route Optimization Scheme for Cluster-Based Proxy MIPv6 Protocol

Adnan J.

S.

Z.

N. A. W. A.

...Show More Authors

View Publication

(3)

Publication Date

Mon Nov 01 2010

Journal Name

Journal Of Systems And Software

Development of Java based RFID application programmable interface for heterogeneous RFID system

Mohammed F.M.

Mohammed I.

Kamal Z.

Widad

...Show More Authors

View Publication

(7)

(4)

Publication Date

Wed Jul 01 2020

Journal Name

Journal Of Engineering

Bat Algorithm Based an Adaptive PID Controller Design for Buck Converter Model

Bat Algorithm

Buck Converter

State Space Averaging

On-Line-Tuning Optimization

Luay Thamir

...Show More Authors

The aim of this paper is to design a PID controller based on an on-line tuning bat optimization algorithm for the step-down DC/DC buck converter system which is used in the battery operation of the mobile applications. In this paper, the bat optimization algorithm has been utilized to obtain the optimal parameters of the PID controller as a simple and fast on-line tuning technique to get the best control action for the system. The simulation results using (Matlab Package) show the robustness and the effectiveness of the proposed control system in terms of obtaining a suitable voltage control action as a smooth and unsaturated state of the buck converter input voltage of ( ) volt that will stabilize the buck converter sys

View Publication Preview PDF

(1)

Publication Date

Sat Jan 01 2022

Journal Name

Computers, Materials & Continua

An Optimal Method for Supply Chain Logistics Management Based on Neural Network

Mohammed

...Show More Authors

View Publication

(6)

(4)

Publication Date

Sat Dec 01 2018

Journal Name

2018 Third Scientific Conference Of Electrical Engineering (scee)

An Intelligent Cognitive System Design for Mobile Robot based on Optimization Algorithm

Ahmed S.

Khulood E.

Bakir A.

...Show More Authors

View Publication

(8)

Publication Date

Tue Mar 25 2014

Journal Name

Sensors

Minimal Camera Networks for 3D Image Based Modeling of Cultural Heritage Objects

Bashar

M.

G.

Afrah

Luma K.

...Show More Authors

View Publication

(40)

(30)

1 2 ... 28 29 30 31 ... 720 721