Human recognition by utilizing voice recognition and visual recognition

Sukaina Sh Altyar; Samera Shams Hussein; Mahir Jasem Mohammed

Details

Publication Date

Sat Jan 01 2022

Journal Name

International Journal Of Nonlinear Analysis And Applications

Volume

13

Issue Number

1

Choose Citation Style

Statistics

View publication

13

Statistics

Human recognition by utilizing voice recognition and visual recognition

Deep learning Convolutional Neural Networks Human Recognition voice recognition visual recognition

Sukaina Sh Altyar

Samera Shams Hussein

Mahir Jasem Mohammed

...Show More Authors

Audio-visual detection and recognition system is thought to become the most promising methods for many applications includes surveillance, speech recognition, eavesdropping devices, intelligence operations, etc. In the recent field of human recognition, the majority of the research be- coming performed presently is focused on the reidentification of various body images taken by several cameras or its focuses on recognized audio-only. However, in some cases these traditional methods can- not be useful when used alone such as in indoor surveillance systems, that are installed close to the ceiling and capture images right from above in a downwards direction and in some cases people don't look straight the cameras or it cannot be added in some area such as W.C. or sleeping room. Thus, its commonly difficult to identify any movement or breakthrough process, on the other hand when need to pursue suspect when enter a building or party to identify his location and/or listen to his speech only and isolate it from other voices or noises, the other. Hence, the use of the hybrid combination technique is very effective. In this work, we proposed a multimodal human recognition approach that utilizes both the face and audio and is based upon a deep convolutional neural network (CNN). Mainly, to solve the challenge of not capturing part of the body, final results of recognizing via separate CNNs of VGG Face16 and ResNet50 are joined together depending on the score-level combination by Weighted Sum rule to enhance recognition performance. The results show that the proposed system success to recognise each person from his voice and/or his face captured. In addition, the system can separate the person voice and isolate it from noisy environment and determine the existence of desired person.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Sun Jan 02 2022

Journal Name

Advances In Science And Technology Research Journal

Vein Biometric Recognition Methods and Systems: A Review

biometric technology

finger vein recognition

pre-processing

feature extraction

matching

Ruaa

Mohammed

...Show More Authors

View Publication

(16)

(7)

Publication Date

Mon Aug 01 2022

Journal Name

Telkomnika (telecommunication Computing Electronics And Control)

Dorsal hand veins features extraction and recognition by correlation coefficient

Maha A.

Kadhim M.

...Show More Authors

View Publication

(6)

(1)

Publication Date

Mon Oct 30 2023

Journal Name

Traitement Du Signal

A Comprehensive Review on Machine Learning Approaches for Enhancing Human Speech Recognition

Maha

Husam

...Show More Authors

View Publication

Publication Date

Wed Jun 01 2016

Journal Name

International Educational Scientific Research Journal

EFFICIENTMETHODSOFIRISRECOGNITION

Zinah Rajab

...Show More Authors

Identification by biological features gets tremendous importance with the increasing of security systems in society. Various types of biometrics like face, finger, iris, retina, voice, palm print, ear and hand geometry, in all these characteristics, iris recognition gaining attention because iris of every person is unique, it never changes during human lifetime and highly protected against damage. This unique feature shows that iris can be good security measure. Iris recognition system listed as a high confidence biometric identification system; mostly it is divide into four steps: Acquisition, localization, segmentation and normalization. This work will review various Iris Recognition systems used by different researchers for each recognit

Preview PDF

Publication Date

Thu Nov 01 2018

Journal Name

2018 1st Annual International Conference On Information And Sciences (aicis)

Speech Emotion Recognition Using Minimum Extracted Features

Speech emotion recognition

Minimum feature extraction

ZCR

12 MFCC

Random forest

Wisal Hashim

Rafah Shihab

Mohammed Najm

...Show More Authors

Recognizing speech emotions is an important subject in pattern recognition. This work is about studying the effect of extracting the minimum possible number of features on the speech emotion recognition (SER) system. In this paper, three experiments performed to reach the best way that gives good accuracy. The first one extracting only three features: zero crossing rate (ZCR), mean, and standard deviation (SD) from emotional speech samples, the second one extracting only the first 12 Mel frequency cepstral coefficient (MFCC) features, and the last experiment applying feature fusion between the mentioned features. In all experiments, the features are classified using five types of classification techniques, which are the Random Forest (RF),

View Publication Preview PDF

(13)

(7)

Publication Date

Wed Jul 17 2019

Journal Name

Advances In Intelligent Systems And Computing

A New Arabic Dataset for Emotion Recognition

emotions recognition

text categorization

machine learn-ing

PPM

WEKA

Arabic corpus

Amer J.

William J.

...Show More Authors

In this study, we have created a new Arabic dataset annotated according to Ekman’s basic emotions (Anger, Disgust, Fear, Happiness, Sadness and Surprise). This dataset is composed from Facebook posts written in the Iraqi dialect. We evaluated the quality of this dataset using four external judges which resulted in an average inter-annotation agreement of 0.751. Then we explored six different supervised machine learning methods to test the new dataset. We used Weka standard classifiers ZeroR, J48, Naïve Bayes, Multinomial Naïve Bayes for Text, and SMO. We also used a further compression-based classifier called PPM not included in Weka. Our study reveals that the PPM classifier significantly outperforms other classifiers such as SVM and N

View Publication

(20)

(10)

Publication Date

Fri Jul 18 2014

Journal Name

International Journal Of Computer Applications

3-Level Techniques Comparison based Image Recognition

3-level Techniques

image recognition

stationary wavelet transform

wavelet transform

feature extraction.

Zainab

Ahlam

...Show More Authors

Image recognition is one of the most important applications of information processing, in this paper; a comparison between 3-level techniques based image recognition has been achieved, using discrete wavelet (DWT) and stationary wavelet transforms (SWT), stationary-stationary-stationary (sss), stationary-stationary-wavelet (ssw), stationary-wavelet-stationary (sws), stationary-wavelet-wavelet (sww), wavelet-stationary- stationary (wss), wavelet-stationary-wavelet (wsw), wavelet-wavelet-stationary (wws) and wavelet-wavelet-wavelet (www). A comparison between these techniques has been implemented. according to the peak signal to noise ratio (PSNR), root mean square error (RMSE), compression ratio (CR) and the coding noise e (n) of each third

View Publication

Publication Date

Mon Jan 02 2012

Journal Name

Journal Of Engineering

3-D Object Recognition using Multi-Wavelet and Neural Network

Object recognition

feature extraction

patches

multi-wavelet

neural network.

Zainab

Tariq

...Show More Authors

This search has introduced the techniques of multi-wavelet transform and neural network for recognition 3-D object from 2-D image using patches. The proposed techniques were tested on database of different patches features and the high energy subband of discrete multi-wavelet transform DMWT (gp) of the patches. The test set has two groups, group (1) which contains images, their (gp) patches and patches features of the same images as a part of that in the data set beside other images, (gp) patches and features, and group (2) which contains the (gp) patches and patches features the same as a part of that in the database but after modification such as rotation, scaling and translation. Recognition by back propagation (BP) neural network as com

View Publication

Publication Date

Wed May 10 2023

Journal Name

Journal Of Engineering

3-D OBJECT RECOGNITION USING MULTI-WAVELET AND NEURAL NETWORK

Object recognition

feature extraction

patches

multi-wavelet

neural network.

Dr. Tariq Zeyad

Zainab Ibrahim

...Show More Authors

This search has introduced the techniques of multi-wavelet transform and neural network for recognition 3-D object from 2-D image using patches. The proposed techniques were tested on database of different patches features and the high energy subband of discrete multi-wavelet transform DMWT (gp) of the patches. The test set has two groups, group (1) which contains images, their (gp) patches and patches features of the same images as a part of that in the data set beside other images, (gp) patches and features, and group (2) which contains the (gp) patches and patches features the same as a part of that in the database but after modification such as rotation, scaling and translation. Recognition by back propagation (BP) neural network as

View Publication Preview PDF

Publication Date

Tue Jan 01 2019

Journal Name

International Journal Of Machine Learning And Computing

Facial Emotion Recognition from Videos Using Deep Convolutional Neural Networks

Facial emotion recognition

deep convolutional neural network

TensorFlow

ADFES-BIV

WSEFEP

Wisal Hashim

Rafah Shihab

Mohammed Najm

...Show More Authors

Its well known that understanding human facial expressions is a key component in understanding emotions and finds broad applications in the field of human-computer interaction (HCI), has been a long-standing issue. In this paper, we shed light on the utilisation of a deep convolutional neural network (DCNN) for facial emotion recognition from videos using the TensorFlow machine-learning library from Google. This work was applied to ten emotions from the Amsterdam Dynamic Facial Expression Set-Bath Intensity Variations (ADFES-BIV) dataset and tested using two datasets.

View Publication Preview PDF

(56)

(40)

1 2 3 4 ... 1625 1626 1627 1628