Hybrid approaches to feature subset selection for data classification in high-dimensional feature space

Maysa Almulla Khalaf; John Q Gan

doi:10.5430/air.v9n1p45

Details

Publication Date

Wed Sep 23 2020

Journal Name

Artificial Intelligence Research

Volume

9

DOI

10.5430/air.v9n1p45

Choose Citation Style

Statistics

View publication

17

Statistics

Hybrid approaches to feature subset selection for data classification in high-dimensional feature space

Maysa Almulla Khalaf

John Q Gan

...Show More Authors

This paper proposes two hybrid feature subset selection approaches based on the combination (union or intersection) of both supervised and unsupervised filter approaches before using a wrapper, aiming to obtain low-dimensional features with high accuracy and interpretability and low time consumption. Experiments with the proposed hybrid approaches have been conducted on seven high-dimensional feature datasets. The classifiers adopted are support vector machine (SVM), linear discriminant analysis (LDA), and K-nearest neighbour (KNN). Experimental results have demonstrated the advantages and usefulness of the proposed methods in feature subset selection in high-dimensional space in terms of the number of selected features and time spent to achieve the best classification accuracy.

View Publication

Publication Date

Sat Dec 30 2023

Journal Name

Traitement Du Signal

Optimizing Acoustic Feature Selection for Estimating Speaker Traits: A Novel Threshold-Based Approach

Umniah

...Show More Authors

View Publication

(1)

Publication Date

Mon Jan 19 2026

Journal Name

American Journal Of Alzheimer's Disease & Other Dementias®

Comparison Study of Different Feature Selection Techniques for the Diagnosis of Alzheimer’s Disease

Farah

...Show More Authors

Objective : Alzheimer’s disease (AD) continues to be a major challenge because handling high-dimensional data is time-consuming and expensive due to its complexity. A large feature space often increases computational costs and reduces model interpretability. This study addresses this problem by evaluating and comparing multiple feature selection techniques to identify the most informative biomarkers for AD diagnosis.

Methods : Our study used data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) to implement and test three feature selection a

View Publication

Publication Date

Sun Aug 02 2026

Journal Name

International Journal Of Data And Network Science

Multi-objective of wind-driven optimization as feature selection and clustering to enhance text clustering

Text Clustering

Multi-Objectives

Wind Driven Optimization

K-Means

Unsupervised Feature Selection

Meta-heuristics optimization

MEHDI G. DUAIMI

Bsoul,Q.

AL-Gburi, A.

...Show More Authors

Text Clustering consists of grouping objects of similar categories. The initial centroids influence operation of the system with the potential to become trapped in local optima. The second issue pertains to the impact of a huge number of features on the determination of optimal initial centroids. The problem of dimensionality may be reduced by feature selection. Therefore, Wind Driven Optimization (WDO) was employed as Feature Selection to reduce the unimportant words from the text. In addition, the current study has integrated a novel clustering optimization technique called the WDO (Wasp Swarm Optimization) to effectively determine the most suitable initial centroids. The result showed the new meta-heuristic which is WDO was employed as t

View Publication Preview PDF

(1)

Publication Date

Sun Jan 01 2023

Journal Name

Aip Conference Proceedings

The study of the literature review of hybrid classification approaches to credit scoring

Sameer F.

...Show More Authors

View Publication

Publication Date

Tue Nov 03 2015

Journal Name

Journal Of Natural Sciences Research

Implementation of remote sensing for vegetation studying using vegetation indices and automatic feature space plot

Taghreed

...Show More Authors

Publication Date

Sun Feb 25 2024

Journal Name

Baghdad Science Journal

Exploring Important Factors in Predicting Heart Disease Based on Ensemble- Extra Feature Selection Approach

Extra Tree

Feature selection

Feature subsets

Heart Disease Dataset

Machine learning

Howida

Farkhana

Alif Ridzuan

Ahmad Najmi Amerhaider

Zuriahati Mohd

Carolyn

...Show More Authors

Heart disease is a significant and impactful health condition that ranks as the leading cause of death in many countries. In order to aid physicians in diagnosing cardiovascular diseases, clinical datasets are available for reference. However, with the rise of big data and medical datasets, it has become increasingly challenging for medical practitioners to accurately predict heart disease due to the abundance of unrelated and redundant features that hinder computational complexity and accuracy. As such, this study aims to identify the most discriminative features within high-dimensional datasets while minimizing complexity and improving accuracy through an Extra Tree feature selection based technique. The work study assesses the efficac

View Publication Preview PDF

(7)

(5)

Publication Date

Sun Jan 01 2023

Journal Name

Ieee Access

Fuzzy-Based Ensemble Feature Selection for Automated Estimation of Speaker Height and Age Using Vocal Characteristics

Umniah

...Show More Authors

View Publication

(2)

(3)

Publication Date

Mon Jul 01 2024

Journal Name

Journal Of Engineering

Efficient Intrusion Detection Through the Fusion of AI Algorithms and Feature Selection Methods

Intrusion Detection System (IDS)

Machine learning

Naïve bayes

K-Nearest Neighbor (KNN)

Decision tree

Feature selection

Muna Hadi

...Show More Authors

With the proliferation of both Internet access and data traffic, recent breaches have brought into sharp focus the need for Network Intrusion Detection Systems (NIDS) to protect networks from more complex cyberattacks. To differentiate between normal network processes and possible attacks, Intrusion Detection Systems (IDS) often employ pattern recognition and data mining techniques. Network and host system intrusions, assaults, and policy violations can be automatically detected and classified by an Intrusion Detection System (IDS). Using Python Scikit-Learn the results of this study show that Machine Learning (ML) techniques like Decision Tree (DT), Naïve Bayes (NB), and K-Nearest Neighbor (KNN) can enhance the effectiveness of an Intrusi

View Publication Preview PDF

(5)

(1)

Publication Date

Sat Oct 04 2025

Journal Name

Mesopotamian Journal Of Computer Science

Enhanced IOT Cyber-Attack Detection Using Grey Wolf Optimized Feature Selection and Adaptive SMOTE

IoT Security

Cyber-Attack

Grey Wolf Optimizer

Feature Selection

SMOTE

Random Forest

XGBoost

CatBoost

Sura

Mustafa

Nada

...Show More Authors

The Internet of Things (IoT) has significantly transformed modern systems through extensive connectivity but has also concurrently introduced considerable cybersecurity risks. Traditional rule-based methods are becoming increasingly insufficient in the face of evolving cyber threats. This study proposes an enhanced methodology utilizing a hybrid machine-learning framework for IoT cyber-attack detection. The framework integrates a Grey Wolf Optimizer (GWO) for optimal feature selection, a customized synthetic minority oversampling technique (SMOTE) for data balancing, and a systematic approach to hyperparameter tuning of ensemble algorithms: Random Forest (RF), XGBoost, and CatBoost. Evaluations on the RT-IoT2022 dataset demonstrat

View Publication Preview PDF

(1)

Publication Date

Sat Oct 22 2022

Journal Name

Aro-the Scientific Journal Of Koya University

Classification of Different Shoulder Girdle Motions for Prosthesis Control Using a Time-Domain Feature Extraction Technique

Bio-signal analysis

Dimensionality reduction

LDA classifier

Time domain

Huda M.

Alia K.

Ali H.

...Show More Authors

Abstract—The upper limb amputation exerts a significant burden on the amputee, limiting their ability to perform everyday activities, and degrading their quality of life. Amputee patients’ quality of life can be improved if they have natural control over their prosthetic hands. Among the biological signals, most commonly used to predict upper limb motor intentions, surface electromyography (sEMG), and axial acceleration sensor signals are essential components of shoulder-level upper limb prosthetic hand control systems. In this work, a pattern recognition system is proposed to create a plan for categorizing high-level upper limb prostheses in seven various types of shoulder girdle motions. Thus, combining seven feature groups, w

View Publication Preview PDF

(6)

(3)

1 2 3 4 ... 2083 2084 2085 2086