Advances in Document Clustering with Evolutionary-Based Algorithms

Sarmad Makki

doi:10.3844/ajassp.2015.689.708

Details

Publication Date

Fri Oct 02 2015

Journal Name

American Journal Of Applied Sciences

Volume

12

Issue Number

12

DOI

10.3844/ajassp.2015.689.708

Choose Citation Style

Statistics

View publication

19

Statistics

(2)

Advances in Document Clustering with Evolutionary-Based Algorithms

Text Document Clustering

Hypertext Clustering

Evolutionary Algorithms

Genetic Algorithms

Text Dimensional Reduction

Sarmad Makki

...Show More Authors

Document clustering is the process of organizing a particular electronic corpus of documents into subgroups of similar text features. Formerly, a number of conventional algorithms had been applied to perform document clustering. There are current endeavors to enhance clustering performance by employing evolutionary algorithms. Thus, such endeavors became an emerging topic gaining more attention in recent years. The aim of this paper is to present an up-to-date and self-contained review fully devoted to document clustering via evolutionary algorithms. It firstly provides a comprehensive inspection to the document clustering model revealing its various components with its related concepts. Then it shows and analyzes the principle research work in this topic. Finally, it compiles and classifies various objective functions, the core of the evolutionary algorithms, from the related collection of research papers. The paper ends up by addressing some important issues and challenges that can be subject of future work.

View Publication

Publication Date

Sun Apr 23 2017

Journal Name

International Conference Of Reliable Information And Communication Technology

Classification of Arabic Writer Based on Clustering Techniques

Mohammed S. H.

...Show More Authors

Arabic text categorization for pattern recognitions is challenging. We propose for the first time a novel holistic method based on clustering for classifying Arabic writer. The categorization is accomplished stage-wise. Firstly, these document images are sectioned into lines, words, and characters. Secondly, their structural and statistical features are obtained from sectioned portions. Thirdly, F-Measure is used to evaluate the performance of the extracted features and their combination in different linkage methods for each distance measures and different numbers of groups. Finally, experiments are conducted on the standard KHATT dataset of Arabic handwritten text comprised of varying samples from 1000 writers. The results in the generatio

(6)

Publication Date

Mon Aug 01 2016

Journal Name

2016 38th Annual International Conference Of The Ieee Engineering In Medicine And Biology Society (embc)

Selecting the optimal movement subset with different pattern recognition based EMG control algorithms

Ali H.

Rami N.

Javier

...Show More Authors

View Publication

(4)

(2)

Publication Date

Thu Jan 20 2022

Journal Name

Webology

Hybrid Intrusion Detection System based on DNA Encoding, Teiresias Algorithm and Clustering Method

Intrusion Detection System

DNA Encoding

Clustering Algorithm

UNSW-NB15 Database.

Omar Fitian

Mazin S.

...Show More Authors

Until recently, researchers have utilized and applied various techniques for intrusion detection system (IDS), including DNA encoding and clustering that are widely used for this purpose. In addition to the other two major techniques for detection are anomaly and misuse detection, where anomaly detection is done based on user behavior, while misuse detection is done based on known attacks signatures. However, both techniques have some drawbacks, such as a high false alarm rate. Therefore, hybrid IDS takes advantage of combining the strength of both techniques to overcome their limitations. In this paper, a hybrid IDS is proposed based on the DNA encoding and clustering method. The proposed DNA encoding is done based on the UNSW-NB15

View Publication

(3)

Publication Date

Sun Jun 01 2008

Journal Name

Baghdad Science Journal

Tamper Detection in Text Document

Ali Kadhim

...Show More Authors

Although text document images authentication is difficult due to the binary nature and clear separation between the background and foreground but it is getting higher demand for many applications. Most previous researches in this field depend on insertion watermark in the document, the drawback in these techniques lie in the fact that changing pixel values in a binary document could introduce irregularities that are very visually noticeable. In this paper, a new method is proposed for object-based text document authentication, in which I propose a different approach where a text document is signed by shifting individual words slightly left or right from their original positions to make the center of gravity for each line fall in with the m

View Publication Preview PDF

Publication Date

Fri Aug 23 2024

Journal Name

Aro-the Scientific Journal Of Koya University

Graphical User Authentication Algorithms Based on Recognition

Zena M.

Ahmed T.

Omar Z.

...Show More Authors

In cyber security, the most crucial subject in information security is user authentication. Robust text-based password methods may offer a certain level of protection. Strong passwords are hard to remember, though, so people who use them frequently write them on paper or store them in file for computer .Numerous of computer systems, networks, and Internet-based environments have experimented with using graphical authentication techniques for user authentication in recent years. The two main characteristics of all graphical passwords are their security and usability. Regretfully, none of these methods could adequately address both of these factors concurrently. The ISO usability standards and associated characteristics for graphical

View Publication Preview PDF

(1)

Publication Date

Mon Oct 28 2019

Journal Name

Journal Of Mechanics Of Continua And Mathematical Sciences

Heuristic Initialization And Similarity Integration Based Model for Improving Extractive Multi-Document Summarization

Nasreen

...Show More Authors

View Publication

Publication Date

Mon Dec 05 2022

Journal Name

Baghdad Science Journal

Proposed Framework for Official Document Sharing and Verification in E-government Environment Based on Blockchain Technology

Rana F.

Asia Ali Salman

Shakir Mahmood

...Show More Authors

Progression in Computer networks and emerging of new technologies in this field helps to find out new protocols and frameworks that provides new computer network-based services. E-government services, a modernized version of conventional government, are created through the steady evolution of technology in addition to the growing need of societies for numerous services. Government services are deeply related to citizens’ daily lives; therefore, it is important to evolve with technological developments—it is necessary to move from the traditional methods of managing government work to cutting-edge technical approaches that improve the effectiveness of government systems for providing services to citizens. Blockchain technology is amon

View Publication Preview PDF

(11)

(8)

Publication Date

Fri Apr 01 2022

Journal Name

Baghdad Science Journal

Improved Firefly Algorithm with Variable Neighborhood Search for Data Clustering

Data clustering

Data mining

Firefly algorithm

Machine learning

Variable neighborhood search.

Hayder Naser Khraibet

...Show More Authors

Among the metaheuristic algorithms, population-based algorithms are an explorative search algorithm superior to the local search algorithm in terms of exploring the search space to find globally optimal solutions. However, the primary downside of such algorithms is their low exploitative capability, which prevents the expansion of the search space neighborhood for more optimal solutions. The firefly algorithm (FA) is a population-based algorithm that has been widely used in clustering problems. However, FA is limited in terms of its premature convergence when no neighborhood search strategies are employed to improve the quality of clustering solutions in the neighborhood region and exploring the global regions in the search space. On the

View Publication Preview PDF

(16)

(5)

Publication Date

Tue Sep 08 2020

Journal Name

Baghdad Science Journal

Hiding the Type of Skin Texture in Mice based on Fuzzy Clustering Technique

C-Mean

Extracting

LSB

Information hiding

Steganography

Alaa Noori

Ekhlas Falih

...Show More Authors

A substantial matter to confidential messages' interchange through the internet is transmission of information safely. For example, digital products' consumers and producers are keen for knowing those products are genuine and must be distinguished from worthless products. Encryption's science can be defined as the technique to embed the data in an images file, audio or videos in a style which should be met the safety requirements. Steganography is a portion of data concealment science that aiming to be reached a coveted security scale in the interchange of private not clear commercial and military data. This research offers a novel technique for steganography based on hiding data inside the clusters that resulted from fuzzy clustering. T

View Publication Preview PDF

(5)

Publication Date

Thu May 18 2023

Journal Name

Journal Of Engineering

A Modified Strength Pareto Evolutionary Algorithm 2 based Environmental /Economic Power Dispatch

Genetic algorithm

multi-objectives optimization

power generation dispatch

power generation economic

pareto distributions.

Hassan Abdullah

Saif Sabah

...Show More Authors

A Strength Pareto Evolutionary Algorithm 2 (SPEA 2) approach for solving the multi-objective Environmental / Economic Power Dispatch (EEPD) problem is presented in this paper. In the past fuel cost consumption minimization was the aim (a single objective function) of economic power dispatch problem. Since the clean air act amendments have been applied to reduce SO2 and NOX emissions from power plants, the utilities change their strategies in order to reduce pollution and atmospheric emission as well, adding emission minimization as other objective function made economic power dispatch (EPD) a multi-objective problem having conflicting objectives. SPEA2 is the improved version of SPEA with better fitness assignment, density estimation, an

View Publication Preview PDF

1 2 3 4 ... 1828 1829 1830 1831