Toward Constructing a Balanced Intrusion Detection Dataset

Amer Abulmajeed Abdulrahman Alsameraee; Mahmood Khalel Ibrahem

doi:10.54153/sjpas.2020.v2i3.86

Details

Publication Date

Wed Sep 22 2021

Journal Name

Samarra Journal Of Pure And Applied Science

Volume

2

DOI

10.54153/sjpas.2020.v2i3.86

Choose Citation Style

Statistics

View publication

24

Statistics

(11)

Toward Constructing a Balanced Intrusion Detection Dataset

Amer Abulmajeed Abdulrahman Alsameraee

Mahmood Khalel Ibrahem

...Show More Authors

Several Intrusion Detection Systems (IDS) have been proposed in the current decade. Most datasets which associate with intrusion detection dataset suffer from an imbalance class problem. This problem limits the performance of classifier for minority classes. This paper has presented a novel class imbalance processing technology for large scale multiclass dataset, referred to as BMCD. Our algorithm is based on adapting the Synthetic Minority Over-Sampling Technique (SMOTE) with multiclass dataset to improve the detection rate of minority classes while ensuring efficiency. In this work we have been combined five individual CICIDS2017 dataset to create one multiclass dataset which contains several types of attacks. To prove the efficiency of our algorithm, several machine learning algorithms have been applied on combined dataset with and without using BMCD algorithm. The experimental results have concluded that BMCD provides an effective solution to imbalanced intrusion detection and outperforms the state-of-the-art intrusion detection methods.

View Publication

Publication Date

Sat Feb 25 2017

Journal Name

International Journal On Advanced Science, Engineering And Information Technology

A Novel DNA Sequence Approach for Network Intrusion Detection System Based on Cryptography Encoding Method

DNA

Horspool algorithm

network intrusion detection system

Teiresas algorithm

Omar Fitian

Zulaiha Ali

Suhaila

...Show More Authors

A novel method for Network Intrusion Detection System (NIDS) has been proposed, based on the concept of how DNA sequence detects disease as both domains have similar conceptual method of detection. Three important steps have been proposed to apply DNA sequence for NIDS: convert the network traffic data into a form of DNA sequence using Cryptography encoding method; discover patterns of Short Tandem Repeats (STR) sequence for each network traffic attack using Teiresias algorithm; and conduct classification process depends upon STR sequence based on Horspool algorithm. 10% KDD Cup 1999 data set is used for training phase. Correct KDD Cup 1999 data set is used for testing phase to evaluate the proposed method. The current experiment results sh

View Publication

(10)

(6)

Publication Date

Sat Dec 01 2018

Journal Name

Journal Of Theoretical And Applied Information Technology

Matching Algorithms for Intrusion Detection System based on DNA Encoding

Intrusion detection

DNA Encoding

Pattern Matching Algorithm

Knuth-Morris-Pratt Algorithm

Boyer-Moore Algorithm

Omar Fitian

Zulaiha Ali

Suhaila

...Show More Authors

Pattern matching algorithms are usually used as detecting process in intrusion detection system. The efficiency of these algorithms is affected by the performance of the intrusion detection system which reflects the requirement of a new investigation in this field. Four matching algorithms and a combined of two algorithms, for intrusion detection system based on new DNA encoding, are applied for evaluation of their achievements. These algorithms are Brute-force algorithm, Boyer-Moore algorithm, Horspool algorithm, Knuth-Morris-Pratt algorithm, and the combined of Boyer-Moore algorithm and Knuth–Morris– Pratt algorithm. The performance of the proposed approach is calculated based on the executed time, where these algorithms are applied o

(5)

Publication Date

Fri May 17 2019

Journal Name

Lecture Notes In Networks And Systems

Features Selection for Intrusion Detection System Based on DNA Encoding

Intrusion detection system

DNA encoding

Feature selection

KDD Cup 99 dataset

NSL-KDD dataset

Omar Fitian

Zulaiha Ali

Suhaila

...Show More Authors

Intrusion detection systems detect attacks inside computers and networks, where the detection of the attacks must be in fast time and high rate. Various methods proposed achieved high detection rate, this was done either by improving the algorithm or hybridizing with another algorithm. However, they are suffering from the time, especially after the improvement of the algorithm and dealing with large traffic data. On the other hand, past researches have been successfully applied to the DNA sequences detection approaches for intrusion detection system; the achieved detection rate results were very low, on other hand, the processing time was fast. Also, feature selection used to reduce the computation and complexity lead to speed up the system

(5)

Publication Date

Sat Dec 01 2012

Journal Name

Journal Of Engineering

Development an Anomaly Network Intrusion Detection System Using Neural Network

Intrusion Detection Systems (IDS)

PAYL

SOM

Randomization

Kais Said

Hamid M.

Elaf Sabah

...Show More Authors

Most intrusion detection systems are signature based that work similar to anti-virus but they are unable to detect the zero-day attacks. The importance of the anomaly based IDS has raised because of its ability to deal with the unknown attacks. However smart attacks are appeared to compromise the detection ability of the anomaly based IDS. By considering these weak points the proposed
system is developed to overcome them. The proposed system is a development to the well-known payload anomaly detector (PAYL). By
combining two stages with the PAYL detector, it gives good detection ability and acceptable ratio of false positive. The proposed system improve the models recognition ability in the PAYL detector, for a filtered unencrypt

View Publication Preview PDF

Publication Date

Fri Jan 01 2021

Journal Name

Ieee Access

DNA Encoding and STR Extraction for Anomaly Intrusion Detection Systems

DNA

intrusion detection

Knuth-MorrisPratt

short tandem repeat

Teiresias algorithm.

Omar Fitian

Zulaiha Ali

Suhaila

Noor Azah

...Show More Authors

View Publication

(11)

(9)

Publication Date

Mon Dec 14 2020

Journal Name

2020 13th International Conference On Developments In Esystems Engineering (dese)

Anomaly Based Intrusion Detection System Using Hierarchical Classification and Clustering Techniques

H.

Suhaila N.

...Show More Authors

With the rapid development of computers and network technologies, the security of information in the internet becomes compromise and many threats may affect the integrity of such information. Many researches are focused theirs works on providing solution to this threat. Machine learning and data mining are widely used in anomaly-detection schemes to decide whether or not a malicious activity is taking place on a network. In this paper a hierarchical classification for anomaly based intrusion detection system is proposed. Two levels of features selection and classification are used. In the first level, the global feature vector for detection the basic attacks (DoS, U2R, R2L and Probe) is selected. In the second level, four local feature vect

View Publication

(5)

Publication Date

Sat May 24 2025

Journal Name

Iraqi Journal For Computer Science And Mathematics

Intrusion Detection System for IoT Based on Modified Random Forest Algorithm

Intrusion detection system

IoT

Modified random forest

IoTID20 dataset

UNSW_NB15 dataset

IoT-23 dataset

Omar Z.

Sura Mazin

Ann F.

Ahmed T.

S. K.

...Show More Authors

An intrusion detection system (IDS) is key to having a comprehensive cybersecurity solution against any attack, and artificial intelligence techniques have been combined with all the features of the IoT to improve security. In response to this, in this research, an IDS technique driven by a modified random forest algorithm has been formulated to improve the system for IoT. To this end, the target is made as one-hot encoding, bootstrapping with less redundancy, adding a hybrid features selection method into the random forest algorithm, and modifying the ranking stage in the random forest algorithm. Furthermore, three datasets have been used in this research, IoTID20, UNSW-NB15, and IoT-23. The results are compared with the three datasets men

View Publication Preview PDF

(6)

(4)

Publication Date

Sun May 01 2022

Journal Name

Journal Of Engineering

Performance Analysis of different Machine Learning Models for Intrusion Detection Systems

salim

Mohammed

...Show More Authors

In recent years, the world witnessed a rapid growth in attacks on the internet which resulted in deficiencies in networks performances. The growth was in both quantity and versatility of the attacks. To cope with this, new detection techniques are required especially the ones that use Artificial Intelligence techniques such as machine learning based intrusion detection and prevention systems. Many machine learning models are used to deal with intrusion detection and each has its own pros and cons and this is where this paper falls in, performance analysis of different Machine Learning Models for Intrusion Detection Systems based on supervised machine learning algorithms. Using Python Scikit-Learn library KNN, Support Ve

View Publication Preview PDF

(19)

(8)

Publication Date

Wed Jul 17 2019

Journal Name

Advances In Intelligent Systems And Computing

A New Arabic Dataset for Emotion Recognition

emotions recognition

text categorization

machine learn-ing

PPM

WEKA

Arabic corpus

Amer J.

William J.

...Show More Authors

In this study, we have created a new Arabic dataset annotated according to Ekman’s basic emotions (Anger, Disgust, Fear, Happiness, Sadness and Surprise). This dataset is composed from Facebook posts written in the Iraqi dialect. We evaluated the quality of this dataset using four external judges which resulted in an average inter-annotation agreement of 0.751. Then we explored six different supervised machine learning methods to test the new dataset. We used Weka standard classifiers ZeroR, J48, Naïve Bayes, Multinomial Naïve Bayes for Text, and SMO. We also used a further compression-based classifier called PPM not included in Weka. Our study reveals that the PPM classifier significantly outperforms other classifiers such as SVM and N

View Publication

(26)

(15)

Publication Date

Fri Nov 01 2019

Journal Name

2019 1st International Informatics And Software Engineering Conference (ubmyk)

Radial Basis Function (RBF) Based on Multistage Autoencoders for Intrusion Detection system (IDS)

Feature extraction

Intrusion detection

Training

Mathematical model

Testing

Support vector machines

Matlab

Yezi

Alok

...Show More Authors

In this paper, RBF-based multistage auto-encoders are used to detect IDS attacks. RBF has numerous applications in various actual life settings. The planned technique involves a two-part multistage auto-encoder and RBF. The multistage auto-encoder is applied to select top and sensitive features from input data. The selected features from the multistage auto-encoder is wired as input to the RBF and the RBF is trained to categorize the input data into two labels: attack or no attack. The experiment was realized using MATLAB2018 on a dataset comprising 175,341 case, each of which involves 42 features and is authenticated using 82,332 case. The developed approach here has been applied for the first time, to the knowledge of the authors, to dete

View Publication

(4)

(3)

1 2 3 4 ... 627 628 629 630