A Survey on Arabic Text Classification Using Deep and Machine Learning Algorithms

Farah A. Abdulghani; Nada A.Z. Abdullah

doi:10.24996/ijs.2022.63.1.37

Details

Publication Date

Sun Jan 30 2022

Journal Name

Iraqi Journal Of Science

DOI

10.24996/ijs.2022.63.1.37

Choose Citation Style

Statistics

View publication

8

Statistics

(14)

(4)

A Survey on Arabic Text Classification Using Deep and Machine Learning Algorithms

Farah A. Abdulghani

Nada A.Z. Abdullah

...Show More Authors

Text categorization refers to the process of grouping text or documents into classes or categories according to their content. Text categorization process consists of three phases which are: preprocessing, feature extraction and classification. In comparison to the English language, just few studies have been done to categorize and classify the Arabic language. For a variety of applications, such as text classification and clustering, Arabic text representation is a difficult task because Arabic language is noted for its richness, diversity, and complicated morphology. This paper presents a comprehensive analysis and a comparison for researchers in the last five years based on the dataset, year, algorithms and the accuracy they got. Deep Learning (DL) and Machine Learning (ML) models were used to enhance text classification for Arabic language. Remarks for future work were concluded.

Publication Date

Sat Jul 06 2024

Journal Name

Multimedia Tools And Applications

Text classification based on optimization feature selection methods: a review and future directions

Text mining Text classification Text categorization Feature selection Optimization algorithms Machine learning classifiers

Osamah Mohammed

Yu-N

Hao

Omar Mustafa

Ammar Kamal

...Show More Authors

A substantial portion of today’s multimedia data exists in the form of unstructured text. However, the unstructured nature of text poses a significant task in meeting users’ information requirements. Text classification (TC) has been extensively employed in text mining to facilitate multimedia data processing. However, accurately categorizing texts becomes challenging due to the increasing presence of non-informative features within the corpus. Several reviews on TC, encompassing various feature selection (FS) approaches to eliminate non-informative features, have been previously published. However, these reviews do not adequately cover the recently explored approaches to TC problem-solving utilizing FS, such as optimization techniques.

View Publication Preview PDF

(2)

(4)

Publication Date

Fri Sep 01 2023

Journal Name

Journal Of Engineering

Iraqi Sentiment and Emotion Analysis Using Deep Learning

Emotion analysis

Sentiment analysis

CNN

GRU

Iraqi dialect

Anwar Abdul-Razzaq

Nada A. Z.

...Show More Authors

Analyzing sentiment and emotions in Arabic texts on social networking sites has gained wide interest from researchers. It has been an active research topic in recent years due to its importance in analyzing reviewers' opinions. The Iraqi dialect is one of the Arabic dialects used in social networking sites, characterized by its complexity and, therefore, the difficulty of analyzing sentiment. This work presents a hybrid deep learning model consisting of a Convolution Neural Network (CNN) and the Gated Recurrent Units (GRU) to analyze sentiment and emotions in Iraqi texts. Three Iraqi datasets (Iraqi Arab Emotions Data Set (IAEDS), Annotated Corpus of Mesopotamian-Iraqi Dialect (ACMID), and Iraqi Arabic Dataset (IAD)) col

View Publication Preview PDF

(4)

Publication Date

Thu Nov 03 2022

Journal Name

Sensors

A Novel Application of Deep Learning (Convolutional Neural Network) for Traumatic Spinal Cord Injury Classification Using Automatically Learned Features of EMG Signal

Masood F.

...Show More Authors

In this study, a traumatic spinal cord injury (TSCI) classification system is proposed using a convolutional neural network (CNN) technique with automatically learned features from electromyography (EMG) signals for a non-human primate (NHP) model. A comparison between the proposed classification system and a classical classification method (k-nearest neighbors, kNN) is also presented. Developing such an NHP model with a suitable assessment tool (i.e., classifier) is a crucial step in detecting the effect of TSCI using EMG, which is expected to be essential in the evaluation of the efficacy of new TSCI treatments. Intramuscular EMG data were collected from an agonist/antagonist tail muscle pair for the pre- and post-spinal cord lesi

View Publication

(8)

(9)

Publication Date

Wed Feb 06 2013

Journal Name

Eng. & Tech. Journal

A proposal to detect computer worms (malicious codes) using data mining classification algorithms

Inas Ali

Soukaina

...Show More Authors

Malicious software (malware) performs a malicious function that compromising a computer system’s security. Many methods have been developed to improve the security of the computer system resources, among them the use of firewall, encryption, and Intrusion Detection System (IDS). IDS can detect newly unrecognized attack attempt and raising an early alarm to inform the system about this suspicious intrusion attempt. This paper proposed a hybrid IDS for detection intrusion, especially malware, with considering network packet and host features. The hybrid IDS designed using Data Mining (DM) classification methods that for its ability to detect new, previously unseen intrusions accurately and automatically. It uses both anomaly and misuse dete

Publication Date

Wed Mar 10 2021

Journal Name

Baghdad Science Journal

Detecting Textual Propaganda Using Machine Learning Techniques

Social Networks

Disinformation

Propaganda

Term Frequency

Bag of Words.

Akib Mohi Ud Din

Qamar Rayees

Syed Tanzeel

...Show More Authors

Social Networking has dominated the whole world by providing a platform of information dissemination. Usually people share information without knowing its truthfulness. Nowadays Social Networks are used for gaining influence in many fields like in elections, advertisements etc. It is not surprising that social media has become a weapon for manipulating sentiments by spreading disinformation. Propaganda is one of the systematic and deliberate attempts used for influencing people for the political, religious gains. In this research paper, efforts were made to classify Propagandist text from Non-Propagandist text using supervised machine learning algorithms. Data was collected from the news sources from July 2018-August 2018. After annota

View Publication Preview PDF

(21)

(11)

Publication Date

Tue Dec 01 2020

Journal Name

Baghdad Science Journal

Detection of Suicidal Ideation on Twitter using Machine Learning & Ensemble Approaches

Ensemble Learning

Machine learning

Suicidal Ideation

Text classification

Twitter

Weka.

Syed Tanzeel

Qamar Rayees

Akib Mohi Ud Din

...Show More Authors

Suicidal ideation is one of the most severe mental health issues faced by people all over the world. There are various risk factors involved that can lead to suicide. The most common & critical risk factors among them are depression, anxiety, social isolation and hopelessness. Early detection of these risk factors can help in preventing or reducing the number of suicides. Online social networking platforms like Twitter, Redditt and Facebook are becoming a new way for the people to express themselves freely without worrying about social stigma. This paper presents a methodology and experimentation using social media as a tool to analyse the suicidal ideation in a better way, thus helping in preventing the chances of being the victim o

View Publication Preview PDF

(41)

(28)

Publication Date

Fri Jan 01 2021

Journal Name

Artificial Intelligence For Covid-19

An Efficient Mixture of Deep and Machine Learning Models for COVID-19 and Tuberculosis Detection Using X-Ray Images in Resource Limited Settings

Ali H.

Rami N.

Zahraa M.

Javier

...Show More Authors

View Publication

(29)

(25)

Publication Date

Mon Jan 02 2017

Journal Name

European Journal Of Scientific Research

Fast approach for arabic text encryption using genetic algorithm

Encryption

Decryption

Genetic Algorithm

Population

Crossover

Riyadh Bassil

...Show More Authors

As s widely use of exchanging private information in various communication applications, the issue to secure it became top urgent. In this research, a new approach to encrypt text message based on genetic algorithm operators has been proposed. The proposed approach follows a new algorithm of generating 8 bit chromosome to encrypt plain text after selecting randomly crossover point. The resulted child code is flipped by one bit using mutation operation. Two simulations are conducted to evaluate the performance of the proposed approach including execution time of encryption/decryption and throughput computations. Simulations results prove the robustness of the proposed approach to produce better performance for all evaluation metrics with res

Publication Date

Sat Apr 30 2022

Journal Name

Revue D'intelligence Artificielle

Performance Evaluation of SDN DDoS Attack Detection and Mitigation Based Random Forest and K-Nearest Neighbors Machine Learning Algorithms

Mayadah A.

Ali H.

...Show More Authors

Software-defined networks (SDN) have a centralized control architecture that makes them a tempting target for cyber attackers. One of the major threats is distributed denial of service (DDoS) attacks. It aims to exhaust network resources to make its services unavailable to legitimate users. DDoS attack detection based on machine learning algorithms is considered one of the most used techniques in SDN security. In this paper, four machine learning techniques (Random Forest, K-nearest neighbors, Naive Bayes, and Logistic Regression) have been tested to detect DDoS attacks. Also, a mitigation technique has been used to eliminate the attack effect on SDN. RF and KNN were selected because of their high accuracy results. Three types of ne

View Publication

(17)

(6)

Publication Date

Wed Dec 01 2021

Journal Name

Computers & Electrical Engineering

Utilizing different types of deep learning models for classification of series arc in photovoltaics systems

Alaa Hamza

Dalila Mat

Siti Maherah

Sadiq H.

Haidar

...Show More Authors

View Publication

(12)

1 2 ... 4 5 6 7 ... 2144 2145