Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review

Osamah Mohammed Alyasiri; Yu-N Cheah; Ammar Kamal Abasi; Omar Mustafa Al-Janabi

doi:10.1109/ACCESS.2022.3165814

Details

Publication Date

Sat Jan 01 2022

Journal Name

Ieee Access

Volume

10

DOI

10.1109/ACCESS.2022.3165814

Choose Citation Style

Statistics

View publication

44

View original publication

2

Click abstract more

2

View pdf

5

Statistics

(51)

(44)

Wrapper and Hybrid Feature Selection Methods Using Metaheuristic Algorithms for English Text Classification: A Systematic Review

Metaheuristics

Feature extraction

Text categorization

Classification algorithms

Systematics

Search problems

Business

Osamah Mohammed Alyasiri

Yu-N Cheah

Ammar Kamal Abasi

Omar Mustafa Al-Janabi

...Show More Authors

Feature selection (FS) constitutes a series of processes used to decide which relevant features/attributes to include and which irrelevant features to exclude for predictive modeling. It is a crucial task that aids machine learning classifiers in reducing error rates, computation time, overfitting, and improving classification accuracy. It has demonstrated its efficacy in myriads of domains, ranging from its use for text classification (TC), text mining, and image recognition. While there are many traditional FS methods, recent research efforts have been devoted to applying metaheuristic algorithms as FS techniques for the TC task. However, there are few literature reviews concerning TC. Therefore, a comprehensive overview was systematically studied by exploring available studies of different metaheuristic algorithms used for FS to improve TC. This paper will contribute to the body of existing knowledge by answering four research questions (RQs): 1) What are the different approaches of FS that apply metaheuristic algorithms to improve TC? 2) Does applying metaheuristic algorithms for TC lead to better accuracy than the typical FS methods? 3) How effective are the modified, hybridized metaheuristic algorithms for text FS problems?, and 4) What are the gaps in the current studies and their future directions? These RQs led to a study of recent works on metaheuristic-based FS methods, their contributions, and limitations. Hence, a final list of thirty-seven (37) related articles was extracted and investigated to align with our RQs to generate new knowledge in the domain of study. Most of the conducted papers focused on addressing the TC in tandem with metaheuristic algorithms based on the wrapper and hybrid FS approaches. Future research should focus on using a hybrid-based FS approach as it intuitively handles complex optimization problems and potentiality provide new research opportunities in this rapidly developing field.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Wed Jan 01 2020

Journal Name

Periodicals Of Engineering And Natural Sciences

Analyzing big data sets by using different panelized regression methods with application: Surveys of multidimensional poverty in Iraq

Big Data

Penalized Regression

Poverty

Ridge Regression

SICA

A.M.

...Show More Authors

Poverty phenomenon is very substantial topic that determines the future of societies and governments and the way that they deals with education, health and economy. Sometimes poverty takes multidimensional trends through education and health. The research aims at studying multidimensional poverty in Iraq by using panelized regression methods, to analyze Big Data sets from demographical surveys collected by the Central Statistical Organization in Iraq. We choose classical penalized regression method represented by The Ridge Regression, Moreover; we choose another penalized method which is the Smooth Integration of Counting and Absolute Deviation (SICA) to analyze Big Data sets related to the different poverty forms in Iraq. Euclidian Distanc

View Publication

Publication Date

Mon Dec 01 2014

Journal Name

Journal Of Economics And Administrative Sciences

Comparison between some of linear classification models with practical application

Linear discriminant analysis

binary response logistic regression and misclassification probability.

حمزة اسماعيل

...Show More Authors

Linear discriminant analysis and logistic regression are the most widely used in multivariate statistical methods for analysis of data with categorical outcome variables .Both of them are appropriate for the development of linear classification models .linear discriminant analysis has been that the data of explanatory variables must be distributed multivariate normal distribution. While logistic regression no assumptions on the distribution of the explanatory data. Hence ,It is assumed that logistic regression is the more flexible and more robust method in case of violations of these assumptions.

In this paper we have been focus for the comparison between three forms for classification data belongs

View Publication Preview PDF

Publication Date

Tue Mar 30 2021

Journal Name

Journal Of Economics And Administrative Sciences

Comparison of Some Methods for Estimating Parameters of General Linear Model in Presence of Heteroscedastic Problem and High Leverage Points

: Diagnostic Robust Generalized Potential

Robust Heteroscedastic Consistent Covariance Matrix

Masking

Swamping

التشخيص الحصين العام الكامن

مصفوفة التباين والتباين المشترك المتسقة الحصينة

الاغراق

الاخفاء .

Qasim Mohammed

Saja Mohammad

...Show More Authors

Linear regression is one of the most important statistical tools through which it is possible to know the relationship between the response variable and one variable (or more) of the independent variable(s), which is often used in various fields of science. Heteroscedastic is one of the linear regression problems, the effect of which leads to inaccurate conclusions. The problem of heteroscedastic may be accompanied by the presence of extreme outliers in the independent variables (High leverage points) (HLPs), the presence of (HLPs) in the data set result unrealistic estimates and misleading inferences. In this paper, we review some of the robust

View Publication Preview PDF

Publication Date

Tue Jan 01 2019

Journal Name

International Journal Of Agricultural And Statistical Sciences,

The comparison of several methods for calculating the degree of heritability and calculating the number of genes II. Yield components

Banan

...Show More Authors

(4)

Publication Date

Tue Jan 01 2013

Journal Name

Ibn Al-haitham Journal For Pure And Applied Science

Classification and Construction of (k,3)-Arcs on Projective Plane Over Galois Field GF(7)

A.

Fatema

...Show More Authors

The purpose of this work is to study the classification and construction of (k,3)-arcs in the projective plane PG(2,7). We found that there are two (5,3)-arcs, four (6,3)-arcs, six (7,3)arcs, six (8,3)-arcs, seven (9,3)-arcs, six (10,3)-arcs and six (11,3)-arcs. All of these arcs are incomplete. The number of distinct (12,3)-arcs are six, two of them are complete. There are four distinct (13,3)-arcs, two of them are complete and one (14,3)-arc which is incomplete. There exists one complete (15,3)-arc.

Publication Date

Thu Dec 03 2015

Journal Name

Iraqi Journal Of Science

New multispectral images classification method based on MSR and Skewness implementing on various sensor scenes

Taghreed

...Show More Authors

Publication Date

Sun Apr 04 2010

Journal Name

Journal Of Educational And Psychological Researches

Translation & Adaptation of(Patterns) & (Assembly) Scales of The Flanagan Aptitude Classification Tests (FACT)

Translation & Adaptation

The Flanagan Aptitude Classification Tests (FACT)

Adil A. S. Al-Salihy

Huda Jameel Abdul-Ghani

...Show More Authors

The Flanagan Aptitude Classification Tests (FACT) assesses aptitudes that are important for successful performance of particular job-related tasks. An individual's aptitude can then be matched to the job tasks. The FACT helps to determine the tasks in which a person has proficiency. Each test measures a specific skill that is important for particular occupations. The FACT battery is designed to provide measures of an individual's aptitude for each of 16 job elements.

The FACT consists of 16 tests used to measure aptitudes that are important for the successful performance of many occupational tasks. The tests provide a broad basis for predicting success in various occupational fields. All are paper and pen

View Publication Preview PDF

Publication Date

Wed Jul 01 2015

Journal Name

Arabian Journal Of Geosciences

Mishrif carbonates facies and diagenesis glossary, South Iraq microfacies investigation technique: types, classification, and related diagenetic impacts

Afrah H.

Govand H.

...Show More Authors

View Publication

(16)

(12)

Publication Date

Mon Mar 23 2020

Journal Name

Baghdad Science Journal

Surfactant Cloud Point Extraction as a Procedure of Preconcentrating for Metoclopramide Determination Using Spectro Analytical Technique

Cloud point extraction

Metoclopramide hydrochloride detection

1-Naphthol

Pharmaceutical products

Spectrophotometry.

maha

...Show More Authors

In current article an easy and selective method is proposed for spectrophotometric estimation of metoclopramide (MCP) in pharmaceutical preparations using cloud point extraction (CPE) procedure. The method involved reaction between MCP with 1-Naphthol in alkali conditions using Triton X-114 to form a stable dark purple dye. The Beer’s law limit in the range 0.34-9 μg mL^-1 of MCP with r =0.9959 (n=3) after optimization. The relative standard deviation (RSD) and percentage recoveries were 0.89 %, and (96.99–104.11%) respectively. As well, using surfactant cloud point extraction as a method to extract MCP was reinforced the extinction coefficient(ε) to 1.7333×105L/mol.cm in surfactant-rich phase. The small volume of organi

View Publication Preview PDF

(13)

(4)

Publication Date

Sat Mar 01 2014

Journal Name

Renewable And Sustainable Energy Reviews

Review on the development of natural dye photosensitizer for dye-sensitized solar cells

A.M. Al-Alwani Mahmoud

...Show More Authors

View Publication

(365)

(341)

1 2 ... 135 136 137 138 ... 2209 2210