Research on Emotion Classification Based on Multi-modal Fusion

zhihua Xiang; Nor Haizan Mohamed Radzi; Haslina Hashim

doi:10.21123/bsj.2024.9454

Details

Publication Date

Sun Feb 25 2024

Journal Name

Baghdad Science Journal

Volume

21

Issue Number

2(SI)

DOI

10.21123/bsj.2024.9454

Choose Citation Style

Statistics

View publication

8

Statistics

(2)

Research on Emotion Classification Based on Multi-modal Fusion

Dynamic correlation

Feature matching

Multi-modal emotion classification

Match fusion

Temporal attention

zhihua Xiang

Nor Haizan Mohamed Radzi

Haslina Hashim

...Show More Authors

Nowadays, people's expression on the Internet is no longer limited to text, especially with the rise of the short video boom, leading to the emergence of a large number of modal data such as text, pictures, audio, and video. Compared to single mode data ,the multi-modal data always contains massive information. The mining process of multi-modal information can help computers to better understand human emotional characteristics. However, because the multi-modal data show obvious dynamic time series features, it is necessary to solve the dynamic correlation problem within a single mode and between different modes in the same application scene during the fusion process. To solve this problem, in this paper, a feature extraction framework of the three-dimensional dynamic expansion is established based on the common multi-modal data, for example video , sound ,text.Based on the framework, a multi-modal fusion-matched framework based on spatial and temporal feature enhancement, respectively to solve the dynamic correlation within and between modes, and then model the short and long term dynamic correlation information between different modes based on the proposed framework. Multiple group experiments performed on MOSI datasets show that the emotion recognition model constructed based on the framework proposed here in this paper can better utilize the more complex complementary information between different modal data. Compared with other multi-modal data fusion models, the spatial-temporal attention-based multimodal data fusion framework proposed in this paper significantly improves the emotion recognition rate and accuracy when applied to multi-modal emotion analysis, so it is more feasible and effective.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Thu Oct 01 2020

Journal Name

Defence Technology

A novel facial emotion recognition scheme based on graph mining

Emotion recognition

Facial landmarks

Graph mining

gSpan algorithm

Binary cat swarm optimization (BCSO)

Neural network

Suhaila N.

...Show More Authors

Recent years have seen an explosion in graph data from a variety of scientific, social and technological fields. From these fields, emotion recognition is an interesting research area because it finds many applications in real life such as in effective social robotics to increase the interactivity of the robot with human, driver safety during driving, pain monitoring during surgery etc. A novel facial emotion recognition based on graph mining has been proposed in this paper to make a paradigm shift in the way of representing the face region, where the face region is represented as a graph of nodes and edges and the gSpan frequent sub-graphs mining algorithm is used to find the frequent sub-structures in the graph database of each emotion. T

View Publication Preview PDF

(48)

(39)

Publication Date

Tue Jun 23 2020

Journal Name

Baghdad Science Journal

Anomaly Detection Approach Based on Deep Neural Network and Dropout

Deep Learning

Dropout

Feature Selection

Network Security

NIDS

Zaid

...Show More Authors

Regarding to the computer system security, the intrusion detection systems are fundamental components for discriminating attacks at the early stage. They monitor and analyze network traffics, looking for abnormal behaviors or attack signatures to detect intrusions in early time. However, many challenges arise while developing flexible and efficient network intrusion detection system (NIDS) for unforeseen attacks with high detection rate. In this paper, deep neural network (DNN) approach was proposed for anomaly detection NIDS. Dropout is the regularized technique used with DNN model to reduce the overfitting. The experimental results applied on NSL_KDD dataset. SoftMax output layer has been used with cross entropy loss funct

View Publication Preview PDF

(27)

(12)

Publication Date

Sat Oct 01 2022

Journal Name

Baghdad Science Journal

A Crime Data Analysis of Prediction Based on Classification Approaches

Crime

Crime Prediction

Decision Tree

Logistic Regression

Naïve Bayes

Fatima Shaker

Abbas Fadhil

...Show More Authors

Crime is considered as an unlawful activity of all kinds and it is punished by law. Crimes have an impact on a society's quality of life and economic development. With a large rise in crime globally, there is a necessity to analyze crime data to bring down the rate of crime. This encourages the police and people to occupy the required measures and more effectively restricting the crimes. The purpose of this research is to develop predictive models that can aid in crime pattern analysis and thus support the Boston department's crime prevention efforts. The geographical location factor has been adopted in our model, and this is due to its being an influential factor in several situations, whether it is traveling to a specific area or livin

View Publication Preview PDF

(10)

(6)

Publication Date

Sun Dec 01 2019

Journal Name

Journal Of Accounting And Financial Studies ( Jafs )

Cost technique based on ABCII specifications and its effect in the Implementation of contracting contracts: Applied research in Al-Mansour general company for construction contracting

سهام عبد علي

ثائر صبري

...Show More Authors

The problem of research was to identify after the use of cost technology based on specifications in the validity of determining and measuring the costs of the implementation of contracting, by applying to al-Mansour General Construction Contracting Company as an appropriate alternative to the traditional costing system currently adopted, which is characterized by many shortcomings and weaknesses Which has been reflected in the validity and integrity of the calculations. To solve this problem, the research was based on the premise that: (The application of cost technology based on specifications will result in calculating the cost of the product according to the specification required by the customer, to meet his wishes properly and witho

View Publication Preview PDF

Publication Date

Mon Dec 05 2022

Journal Name

Baghdad Science Journal

Short Text Semantic Similarity Measurement Approach Based on Semantic Network

Naamah Hussein

Adel M.

Ahmed T.

...Show More Authors

Estimating the semantic similarity between short texts plays an increasingly prominent role in many fields related to text mining and natural language processing applications, especially with the large increase in the volume of textual data that is produced daily. Traditional approaches for calculating the degree of similarity between two texts, based on the words they share, do not perform well with short texts because two similar texts may be written in different terms by employing synonyms. As a result, short texts should be semantically compared. In this paper, a semantic similarity measurement method between texts is presented which combines knowledge-based and corpus-based semantic information to build a semantic network that repre

View Publication Preview PDF

(3)

Publication Date

Mon Sep 21 2020

Journal Name

Iraqi Journal For Electrical And Electronic Engineering

Emotion Recognition Based on Mining Sub-Graphs of Facial Components

Suhaila N.

...Show More Authors

Facial emotion recognition finds many real applications in the daily life like human robot interaction, eLearning, healthcare, customer services etc. The task of facial emotion recognition is not easy due to the difficulty in determining the effective feature set that can recognize the emotion conveyed within the facial expression accurately. Graph mining techniques are exploited in this paper to solve facial emotion recognition problem. After determining positions of facial landmarks in face region, twelve different graphs are constructed using four facial components to serve as a source for sub-graphs mining stage using gSpan algorithm. In each group, the discriminative set of sub-graphs are selected and fed to Deep Belief Network (DBN) f

View Publication Preview PDF

(1)

Publication Date

Tue Dec 07 2021

Journal Name

2021 14th International Conference On Developments In Esystems Engineering (dese)

Content Based Image Retrieval Based on Feature Fusion and Support Vector Machine

Ibtihaal M.

Sadiq H.

Basheera M.

Abir

...Show More Authors

View Publication

(7)

Publication Date

Tue Sep 01 2020

Journal Name

Al-khwarizmi Engineering Journal

Two-Stage Classification of Breast Tumor Biomarkers for Iraqi Women

Iyden Kamil

Ali Hussein

Javier

...Show More Authors

Objective: Breast cancer is regarded as a deadly disease in women causing lots of mortalities. Early diagnosis of breast cancer with appropriate tumor biomarkers may facilitate early treatment of the disease, thus reducing the mortality rate. The purpose of the current study is to improve early diagnosis of breast by proposing a two-stage classification of breast tumor biomarkers fora sample of Iraqi women.

Methods: In this study, a two-stage classification system is proposed and tested with four machine learning classifiers. In the first stage, breast features (demographic, blood and salivary-based attributes) are classified into normal or abnormal cases, while in the second stage the abnormal breast cases are

View Publication Preview PDF

Publication Date

Sat Jun 01 2024

Journal Name

Iaes International Journal Of Artificial Intelligence (ij-ai)

A novel fusion-based approach for the classification of packets in wireless body area networks

Hanaa

KS

Baydaa

...Show More Authors

This abstract focuses on the significance of wireless body area networks (WBANs) as a cutting-edge and self-governing technology, which has garnered substantial attention from researchers. The central challenge faced by WBANs revolves around upholding quality of service (QoS) within rapidly evolving sectors like healthcare. The intricate task of managing diverse traffic types with limited resources further compounds this challenge. Particularly in medical WBANs, the prioritization of vital data is crucial to ensure prompt delivery of critical information. Given the stringent requirements of these systems, any data loss or delays are untenable, necessitating the implementation of intelligent algorithms. These algorithms play a pivota

View Publication

Publication Date

Sun Jun 20 2021

Journal Name

Baghdad Science Journal

PDCNN: FRAMEWORK for Potato Diseases Classification Based on Feed Foreword Neural Network

K-means

Gray Level Run Length Matrix

First Order Histogram Features

Scaled Conjugate Gradient Backpropagation

Israa Mohammed

Samar Amil

Musaab

...Show More Authors

The economy is exceptionally reliant on agricultural productivity. Therefore, in domain of agriculture, plant infection discovery is a vital job because it gives promising advance towards the development of agricultural production. In this work, a framework for potato diseases classification based on feed foreword neural network is proposed. The objective of this work is presenting a system that can detect and classify four kinds of potato tubers diseases; black dot, common scab, potato virus Y and early blight based on their images. The presented PDCNN framework comprises three levels: the pre-processing is first level, which is based on K-means clustering algorithm to detect the infected area from potato image. The s

View Publication Preview PDF

(9)

(1)

1 2 3 4 ... 980 981 982 983