Advances in Document Clustering with Evolutionary-Based Algorithms

Sarmad Makki

doi:10.3844/ajassp.2015.689.708

Details

Publication Date

Fri Oct 02 2015

Journal Name

American Journal Of Applied Sciences

Volume

12

Issue Number

12

DOI

10.3844/ajassp.2015.689.708

Choose Citation Style

Statistics

View publication

17

Statistics

(2)

Advances in Document Clustering with Evolutionary-Based Algorithms

Text Document Clustering

Hypertext Clustering

Evolutionary Algorithms

Genetic Algorithms

Text Dimensional Reduction

Sarmad Makki

...Show More Authors

Document clustering is the process of organizing a particular electronic corpus of documents into subgroups of similar text features. Formerly, a number of conventional algorithms had been applied to perform document clustering. There are current endeavors to enhance clustering performance by employing evolutionary algorithms. Thus, such endeavors became an emerging topic gaining more attention in recent years. The aim of this paper is to present an up-to-date and self-contained review fully devoted to document clustering via evolutionary algorithms. It firstly provides a comprehensive inspection to the document clustering model revealing its various components with its related concepts. Then it shows and analyzes the principle research work in this topic. Finally, it compiles and classifies various objective functions, the core of the evolutionary algorithms, from the related collection of research papers. The paper ends up by addressing some important issues and challenges that can be subject of future work.

View Publication

Publication Date

Mon Dec 20 2021

Journal Name

Baghdad Science Journal

Recurrent Stroke Prediction using Machine Learning Algorithms with Clinical Public Datasets: An Empirical Performance Evaluation

Artificial Neural Network

Bayesian Rule List

Machine Learning

Recurrent Stroke Prediction

Support Vector Machine

Fadratul Hafinaz

Mohd Adib

...Show More Authors

Recurrent strokes can be devastating, often resulting in severe disability or death. However, nearly 90% of the causes of recurrent stroke are modifiable, which means recurrent strokes can be averted by controlling risk factors, which are mainly behavioral and metabolic in nature. Thus, it shows that from the previous works that recurrent stroke prediction model could help in minimizing the possibility of getting recurrent stroke. Previous works have shown promising results in predicting first-time stroke cases with machine learning approaches. However, there are limited works on recurrent stroke prediction using machine learning methods. Hence, this work is proposed to perform an empirical analysis and to investigate machine learning al

View Publication Preview PDF

(13)

(7)

Publication Date

Sat Jan 01 2022

Journal Name

Ieee Access

Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review

Metaheuristics

Feature extraction

Text categorization

Classification algorithms

Systematics

Search problems

Business

Osamah Mohammed

Yu-N

Ammar Kamal

Omar Mustafa

...Show More Authors

Feature selection (FS) constitutes a series of processes used to decide which relevant features/attributes to include and which irrelevant features to exclude for predictive modeling. It is a crucial task that aids machine learning classifiers in reducing error rates, computation time, overfitting, and improving classification accuracy. It has demonstrated its efficacy in myriads of domains, ranging from its use for text classification (TC), text mining, and image recognition. While there are many traditional FS methods, recent research efforts have been devoted to applying metaheuristic algorithms as FS techniques for the TC task. However, there are few literature reviews concerning TC. Therefore, a comprehensive overview was systematicall

View Publication Preview PDF

(57)

(51)

Publication Date

Mon Apr 01 2024

Journal Name

Chemical Engineering Research And Design

Treatment of petroleum refinery wastewater by a combination of anodic oxidation with photocatalyst process: Recent advances, affecting factors and future perspectives

Husham M.

Khalid A.

Ali H.

...Show More Authors

View Publication

(11)

(10)

Publication Date

Wed Oct 26 2022

Journal Name

Iraqi Journal Of Science

Gene Expression Analysis via Spatial Clustering and Evaluation Indexing

Basad

...Show More Authors

The density-based spatial clustering for applications with noise (DBSCAN) is one of the most popular applications of clustering in data mining, and it is used to identify useful patterns and interesting distributions in the underlying data. Aggregation methods for classifying nonlinear aggregated data. In particular, DNA methylations, gene expression. That show the differentially skewed by distance sites and grouped nonlinearly by cancer daisies and the change Situations for gene excretion on it. Under these conditions, DBSCAN is expected to have a desirable clustering feature i that can be used to show the results of the changes. This research reviews the DBSCAN and compares its performance with other algorithms, such as the tradit

View Publication

(4)

Publication Date

Thu Apr 01 2021

Journal Name

Applied Soft Computing

Evolutionary multi-objective set cover problem for task allocation in the Internet of Things

Hussein M.

Bara’a A.

Amenah D.

Mustafa N.

Mayyadah

...Show More Authors

(7)

(6)

Publication Date

Thu Apr 01 2021

Journal Name

Applied Soft Computing

Evolutionary multi-objective set cover problem for task allocation in the Internet of Things

Hussein M.

Bara’a A.

Amenah D.

Mustafa N.

Mayyadah

...Show More Authors

View Publication

(7)

(6)

Publication Date

Fri Mar 01 2024

Journal Name

Iaes International Journal Of Artificial Intelligence (ij-ai)

Analyzing the behavior of different classification algorithms in diabetes prediction

Israa N.

...Show More Authors

<span lang="EN-US">Diabetes is one of the deadliest diseases in the world that can lead to stroke, blindness, organ failure, and amputation of lower limbs. Researches state that diabetes can be controlled if it is detected at an early stage. Scientists are becoming more interested in classification algorithms in diagnosing diseases. In this study, we have analyzed the performance of five classification algorithms namely naïve Bayes, support vector machine, multi layer perceptron artificial neural network, decision tree, and random forest using diabetes dataset that contains the information of 2000 female patients. Various metrics were applied in evaluating the performance of the classifiers such as precision, area under the c

View Publication

(1)

Publication Date

Tue Dec 03 2024

Journal Name

Adab Al-basrah

The Time Machine: Scientific Advances and Social Milieus in H. G. Well's Vision of the Future

Darwinism -H.G. Wells- modernity -science fiction -scientific progress -Victorian Era

Taisir Abdulhafed Abdulrahman

...Show More Authors

This study is qualitative, it illustrates H.G. Wells\\'s The Time Machine through the scientific and social framework of the Victorian Era. Wells\\'s portrayal of future societies examines the rapid technological progress and social changes of the 19th century. The analysis scrutinizes the division between the Eloi and the Morlocks, tracing the consequences of social division. To meet the objective of the study, Victorian frame of mind is utilized to examine the class struggle that is symbolized by the Eloi and the Morlocks. The analysis highlights the economic and social effects of industrialization and how Wells examines the capitalist system and its impact on human relationships and class division. The study also utilizes concepts from D

Preview PDF

Publication Date

Sat Apr 01 2023

Journal Name

Science Of The Total Environment

Recent advances and applicable flexibility potential of electrochemical processes for wastewater treatment

Forat Yasir

Shaymaa A.

Hasan F.

Ahmed Samir

Haider M.

Ali Dawood

Tatjána

Sebestyen

B.

Phuoc-Cuong

D. Duong

S. Woong

Myoung-Jin

Huu Hao

D. Duc

...Show More Authors

This study examined >140 relevant publications from the last few years (2018–2021). In this study, classification was reviewed depending on the operation's progress. Electrocoagulation (EC), electrooxidation (EO), electroflotation (EF), electrodialysis (ED), and electro-Fenton (EFN) processes have received considerable attention. The type of action (individual or hybrid) for each electrochemical procedure was evaluated, and statistical analysis was performed to compare them as a new manner of reviewing cited papers providing a massive amount of information efficiently to the readers. Individual or hybrid operation progress of the electrochemical techniques is critical issues. Their design, operation, and maintenance costs vary depending o

View Publication

(81)

(83)

Publication Date

Wed Aug 27 2025

Journal Name

Baghdad Science Journal

A Clustering Technique Based on the Hard K-Means (H.KM.) Method to Determine the Governorate That Have More Influence for Spreading COVID-19 in the Kingdom of Saudi Arabia

Rand Muhaned

Wurood R. Abd

Iden Hassan

...Show More Authors

View Publication

1 2 ... 10 11 12 13 ... 1777 1778