Speech Emotion Recognition Using MELBP Variants of Spectrogram Image

Suhaila N. Mohammed

doi:10.22266/ijies2020.1031.23

Details

Publication Date

Sat Oct 31 2020

Journal Name

International Journal Of Intelligent Engineering And Systems

Volume

13

Issue Number

5

DOI

10.22266/ijies2020.1031.23

Choose Citation Style

Statistics

View publication

11

Statistics

(7)

(4)

Speech Emotion Recognition Using MELBP Variants of Spectrogram Image

Speech emotion

Spectrogram image

Multi-block extended local binary pattern (MELBP)

Deep beliefnetwork (DBN)

Short term fourier transform (STFT)

Suhaila N. Mohammed

...Show More Authors

View Publication Preview PDF

Quick Preview PDF

Publication Date

Tue Aug 31 2021

Journal Name

Inmateh Agricultural Engineering

DETERMINING THE EFFICIENCY OF A SMART SPRAYING ROBOT FOR CROP PROTECTION USING IMAGE PROCESSING TECHNOLOGY

machine learning

image processing

agricultural robot

forward speed

Mustafa Ahmed Jalal

Noor Ahmed

...Show More Authors

A system was used to detect injuries in plant leaves by combining machine learning and the principles of image processing. A small agricultural robot was implemented for fine spraying by identifying infected leaves using image processing technology with four different forward speeds (35, 46, 63 and 80 cm/s). The results revealed that increasing the speed of the agricultural robot led to a decrease in the mount of supplements spraying and a detection percentage of infected plants. They also revealed a decrease in the percentage of supplements spraying by 46.89, 52.94, 63.07 and 76% with different forward speeds compared to the traditional method.

View Publication Preview PDF

(6)

(4)

Publication Date

Tue Jun 09 2020

Journal Name

Article In Journal Of Engineering Science And Technology

English Numbers Recognition Based on Sign Language Using Line-Slope Features and PSO-DBN Optimization Method

Suhaila

Huda

...Show More Authors

View Publication

(3)

Publication Date

Mon Mar 01 2021

Journal Name

Iop Conference Series: Materials Science And Engineering

Speech Enhancement Algorithm Based on a Hybrid Estimator

Basheera M.

Sadiq H.

Marwah A.

Muntadher

Jamila

...Show More Authors

Abstract<p>Speech is the essential way to interact between humans or between human and machine. However, it is always contaminated with different types of environment noise. Therefore, speech enhancement algorithms (SEA) have appeared as a significant approach in speech processing filed to suppress background noise and return back the original speech signal. In this paper, a new efficient two-stage SEA with low distortion is proposed based on minimum mean square error sense. The estimation of clean signal is performed by taking the advantages of Laplacian speech and noise modeling based on orthogonal transform (Discrete Krawtchouk-Tchebichef transform) coefficients distribution. The Discrete Kra</p> ... Show More

View Publication

(11)

Publication Date

Wed Dec 18 2019

Journal Name

Baghdad Science Journal

Detecting Keratoconus by Using SVM and Decision Tree Classifiers with the Aid of Image Processing

Decision Tree

Image processing

Keratoconus (KCN)

Pentacam

SVM

Topographic Maps

Mosa

...Show More Authors

Researchers used different methods such as image processing and machine learning techniques in addition to medical instruments such as Placido disc, Keratoscopy, Pentacam;to help diagnosing variety of diseases that affect the eye. Our paper aims to detect one of these diseases that affect the cornea, which is Keratoconus. This is done by using image processing techniques and pattern classification methods. Pentacam is the device that is used to detect the cornea’s health; it provides four maps that can distinguish the changes on the surface of the cornea which can be used for Keratoconus detection. In this study, sixteen features were extracted from the four refractive maps along with five readings from the Pentacam software. The

View Publication Preview PDF

(12)

(4)

Publication Date

Sun Feb 25 2024

Journal Name

Baghdad Science Journal

An Adaptive Harmony Search Part-of-Speech tagger for Square Hmong Corpus

Harmony Search Algorithm

Low-resource language

Optimization

Part-of-Speech tagging

Unknown words

Di-Wen

Shao-Qiang

Sharifah Zarith Rahmah

Li-Ping

Feng

Pan

...Show More Authors

Data-driven models perform poorly on part-of-speech tagging problems with the square Hmong language, a low-resource corpus. This paper designs a weight evaluation function to reduce the influence of unknown words. It proposes an improved harmony search algorithm utilizing the roulette and local evaluation strategies for handling the square Hmong part-of-speech tagging problem. The experiment shows that the average accuracy of the proposed model is 6%, 8% more than HMM and BiLSTM-CRF models, respectively. Meanwhile, the average F1 of the proposed model is also 6%, 3% more than HMM and BiLSTM-CRF models, respectively.

View Publication Preview PDF

(4)

(2)

Publication Date

Sun Jun 01 2014

Journal Name

International Journal Of Advanced Research In Computer Science And Software Engineering

Medical Image Compression using Wavelet Quadrants of Polynomial Prediction Coding & Bit Plane Slicing

Ghadah

...Show More Authors

Publication Date

Sat Jun 06 2020

Journal Name

Journal Of The College Of Education For Women

Image classification with Deep Convolutional Neural Network Using Tensorflow and Transfer of Learning

Convolutional Neural Network (CNN)

Synthetic Aperture Radar (SAR)

TensorFlow

Transfer learning

Visual Geometry Group (VGG16)

Aseel Sami

MatheelEmaduldin

...Show More Authors

The deep learning algorithm has recently achieved a lot of success, especially in the field of computer vision. This research aims to describe the classification method applied to the dataset of multiple types of images (Synthetic Aperture Radar (SAR) images and non-SAR images). In such a classification, transfer learning was used followed by fine-tuning methods. Besides, pre-trained architectures were used on the known image database ImageNet. The model VGG16 was indeed used as a feature extractor and a new classifier was trained based on extracted features.The input data mainly focused on the dataset consist of five classes including the SAR images class (houses) and the non-SAR images classes (Cats, Dogs, Horses, and Humans). The Conv

View Publication Preview PDF

(1)

Publication Date

Mon Feb 04 2019

Journal Name

Iraqi Journal Of Physics

Studying the contribution of components and type of spiral galaxy NGC 6946 using digital image processing

Image classification

classification techniques

Spiral Galaxy

NGC 6946.

A. K.

...Show More Authors

NGC 6946 have been observed with BVRI filters, on October 15-18,
2012, with the Newtonian focus of the 1.88m telescope, Kottamia
observatory, of the National Research Institute of Astronomy and
Geophysics, Egypt (NRIAG), then we combine the BVRI filters to
obtain an astronomical image to the spiral galaxy NGC 6946 which
is regarded main source of information to discover the components of
this galaxy, where galaxies are considered the essential element of
the universe. To know the components of NGC 6946, we studied it
with the Variable Precision Rough Sets technique to determine the
contribution of the Bulge, disk, and arms of NGC 6946 according to
different color in the image. From image we can determined th

View Publication Preview PDF

Publication Date

Sat Oct 01 2022

Journal Name

Baghdad Science Journal

Human Face Recognition Based on Local Ternary Pattern and Singular Value Decomposition

Face Recognition

Image Processing

Local Ternary Pattern

Neural Network

Singular Values Decomposition

Ali Nadhim

Rozaida

Nidhal K.

Hussein Ali Hussein

...Show More Authors

There is various human biometrics used nowadays, one of the most important of these biometrics is the face. Many techniques have been suggested for face recognition, but they still face a variety of challenges for recognizing faces in images captured in the uncontrolled environment, and for real-life applications. Some of these challenges are pose variation, occlusion, facial expression, illumination, bad lighting, and image quality. New techniques are updating continuously. In this paper, the singular value decomposition is used to extract the features matrix for face recognition and classification. The input color image is converted into a grayscale image and then transformed into a local ternary pattern before splitting the image into

View Publication Preview PDF

(6)

(1)

Publication Date

Mon Jan 01 2024

Journal Name

Jordanian Journal Of Computers And Information Technology

BEYOND WORDS: HARNESSING SPEECH SOUND FOR SPEAKER AGE AND GENDER DETECTION USING 1D CNN ARCHITECTURE WITH SELF-ATTENTION MECHANISM

Umniah

...Show More Authors

Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attentio

View Publication

1 2 ... 17 18 19 20 ... 2464 2465