Graph based text representation for document clustering

Asma Khazaal Abdulsahib Abdulsahib; SITI SAKIRA KAMARUDDIN KAMARUDDIN

Details

Publication Date

Thu Jan 01 2015

Journal Name

Journal Of Theoretical And Applied Information Technology

Volume

76

Issue Number

1

Choose Citation Style

Statistics

View publication

5

View pdf

3

Statistics

(15)

Graph based text representation for document clustering

Text Representation Schemes

Dependency Graph

Document Clustering

Sparsity Problem

Semantic Problem.

Asma Khazaal Abdulsahib Abdulsahib

SITI SAKIRA KAMARUDDIN KAMARUDDIN

...Show More Authors

Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship and meanings of words in the document. As a result the sparsity and semantic problem that is prevalent in textual document are not resolved. In this study, the problem of sparsity and semantic is reduced by proposing a graph based text representation method, namely dependency graph with the aim of improving the accuracy of document clustering. The dependency graph representation scheme is created through an accumulation of syntactic and semantic analysis. A sample of 20 news groups, dataset was used in this study. The text documents undergo pre-processing and syntactic parsing in order to identify the sentence structure. Then the semantic of words are modeled using dependency graph. The produced dependency graph is then used in the process of cluster analysis. K-means clustering technique was used in this study. The dependency graph based clustering result were compared with the popular text representation method, i.e. TFIDF and Ontology based text representation. The result shows that the dependency graph outperforms both TFIDF and Ontology based text representation. The findings proved that the proposed text representation method leads to more accurate document clustering results.

Preview PDF

Quick Preview PDF

Publication Date

Sun Sep 11 2022

Journal Name

Electronics

IoT-Based Motorbike Ambulance: Secure and Efficient Transportation

Halah Hasan

Abed Saif

Marwan Kadhim Mohammed

Gehad Abdullah

Khaled H.

Mohammed A. A.

...Show More Authors

The predilection for 5G telemedicine networks has piqued the interest of industry researchers and academics. The most significant barrier to global telemedicine adoption is to achieve a secure and efficient transport of patients, which has two critical responsibilities. The first is to get the patient to the nearest hospital as quickly as possible, and the second is to keep the connection secure while traveling to the hospital. As a result, a new network scheme has been suggested to expand the medical delivery system, which is an agile network scheme to securely redirect ambulance motorbikes to the nearest hospital in emergency cases. This research provides a secured and efficient telemedicine transport strategy compatible with the

View Publication

(4)

Publication Date

Mon Jan 01 2018

Journal Name

International Journal Of Electronic Security And Digital Forensics

LSB based audio steganography preserving minimum sample SNR

Mohammed A.

...Show More Authors

View Publication

(4)

Publication Date

Sat Oct 01 2022

Journal Name

Therapeutic Delivery

Particles-based Medicated Wound Dressings: A Comprehensive Review

Kawther K

Amaraporn

...Show More Authors

View Publication

(3)

(2)

Publication Date

Mon Jan 01 2024

Journal Name

Journal Of Engineering

Face-based Gender Classification Using Deep Learning Model

Alex-Net

CLAHE

Deep learning

Gender Classification

Buraq Abed Ruda

Faten Abed Ali

...Show More Authors

Gender classification is a critical task in computer vision. This task holds substantial importance in various domains, including surveillance, marketing, and human-computer interaction. In this work, the face gender classification model proposed consists of three main phases: the first phase involves applying the Viola-Jones algorithm to detect facial images, which includes four steps: 1) Haar-like features, 2) Integral Image, 3) Adaboost Learning, and 4) Cascade Classifier. In the second phase, four pre-processing operations are employed, namely cropping, resizing, converting the image from(RGB) Color Space to (LAB) color space, and enhancing the images using (HE, CLAHE). The final phase involves utilizing Transfer lea

View Publication Preview PDF

(2)

Publication Date

Tue Nov 01 2016

Journal Name

Research Journal Of Pharmaceutical, Biological And Chemical Sciences

Treating of oil-based drill cuttings by earthworms

Bio treatment

Drill cutting

Earthworms

Environmental Protection

AA

Khalid M.

...Show More Authors

This study assessed the advantage of using earthworms in combination with punch waste and nutrients in remediating drill cuttings contaminated with hydrocarbons. Analyses were performed on day 0, 7, 14, 21, and 28 of the experiment. Two hydrocarbon concentrations were used (20000 mg/kg and 40000 mg/kg) for three groups of earthworms number which were five, ten and twenty earthworms. After 28 days, the total petroleum hydrocarbon (TPH) concentration (20000 mg/kg) was reduced to 13200 mg/kg, 9800 mg/kg, and 6300 mg/kg in treatments with five, ten and twenty earthworms respectively. Also, TPH concentration (40000 mg/kg) was reduced to 22000 mg/kg, 10100 mg/kg, and 4200 mg/kg in treatments with the above number of earthworms respectively. The p

View Publication

Publication Date

Mon Mar 01 2021

Journal Name

Iop Conference Series: Materials Science And Engineering

Speech Enhancement Algorithm Based on a Hybrid Estimator

Basheera M.

Sadiq H.

Marwah A.

Muntadher

Jamila

...Show More Authors

Abstract<p>Speech is the essential way to interact between humans or between human and machine. However, it is always contaminated with different types of environment noise. Therefore, speech enhancement algorithms (SEA) have appeared as a significant approach in speech processing filed to suppress background noise and return back the original speech signal. In this paper, a new efficient two-stage SEA with low distortion is proposed based on minimum mean square error sense. The estimation of clean signal is performed by taking the advantages of Laplacian speech and noise modeling based on orthogonal transform (Discrete Krawtchouk-Tchebichef transform) coefficients distribution. The Discrete Kra</p> ... Show More

View Publication

(12)

Publication Date

Fri Sep 01 2023

Journal Name

Journal Of Engineering

EMG-Based Control of Active Ankle-Foot Prosthesis

Prosthetic

Dorsiflexion

Plantar flexion

Inversion

Eversion

Ankle joint

Ruaa

Mohsin A.

...Show More Authors

Most below-knee prostheses are manufactured in Iraq without considering the fast progress in smart prostheses, which can offer movements in the desired directions according to the type of control system designed for this purpose. The proposed design appears to have the advantages of simplicity, affordability, better load distribution, suitability for subjects with transtibial amputation, and viability in countries with people having low socio-economic status. The designed prosthetics consisted of foot, ball, and socket joints, two stepper motors, a linkage system, and an EMG shield. All these materials were available in the local markets in Iraq. The experimental results showed t

View Publication Preview PDF

(2)

Publication Date

Sat Jul 31 2021

Journal Name

Brain Sciences

Robust EEG Based Biomarkers to Detect Alzheimer’s Disease

Ali

Marina

Shaymaa S.

Chima S.

Emmanuel

Lingfen

Emmanuel

...Show More Authors

Biomarkers to detect Alzheimer’s disease (AD) would enable patients to gain access to appropriate services and may facilitate the development of new therapies. Given the large numbers of people affected by AD, there is a need for a low-cost, easy to use method to detect AD patients. Potentially, the electroencephalogram (EEG) can play a valuable role in this, but at present no single EEG biomarker is robust enough for use in practice. This study aims to provide a methodological framework for the development of robust EEG biomarkers to detect AD with a clinically acceptable performance by exploiting the combined strengths of key biomarkers. A large number of existing and novel EEG biomarkers associated with slowing of EEG, reductio

View Publication Preview PDF

(40)

(37)

Publication Date

Tue Jun 23 2020

Journal Name

Baghdad Science Journal

Content Based Image Retrieval (CBIR) by Statistical Methods

Content Based Image Retrieval

Histogram statistical characteristics

Test of- T

Trademark Image Retrieval

Fathala

...Show More Authors

An image retrieval system is a computer system for browsing, looking and recovering pictures from a huge database of advanced pictures. The objective of Content-Based Image Retrieval (CBIR) methods is essentially to extract, from large (image) databases, a specified number of images similar in visual and semantic content to a so-called query image. The researchers were developing a new mechanism to retrieval systems which is mainly based on two procedures. The first procedure relies on extract the statistical feature of both original, traditional image by using the histogram and statistical characteristics (mean, standard deviation). The second procedure relies on the T-

View Publication Preview PDF

(13)

(8)

Publication Date

Thu Aug 01 2019

Journal Name

2019 2nd International Conference On Engineering Technology And Its Applications (iiceta)

Human Gait Identification System Based on Average Silhouette

Mohanad Hazim Nsaif

Nawaf Hazim

Sinan Sameer Mahmood

...Show More Authors

View Publication

(2)

1 2 ... 62 63 64 65 ... 696 697