Human recognition by utilizing voice recognition and visual recognition

Sukaina Sh Altyar; Samera Shams Hussein; Mahir Jasem Mohammed

Details

Publication Date

Sat Jan 01 2022

Journal Name

International Journal Of Nonlinear Analysis And Applications

Volume

13

Issue Number

1

Choose Citation Style

Statistics

View publication

13

Statistics

Human recognition by utilizing voice recognition and visual recognition

Deep learning Convolutional Neural Networks Human Recognition voice recognition visual recognition

Sukaina Sh Altyar

Samera Shams Hussein

Mahir Jasem Mohammed

...Show More Authors

Audio-visual detection and recognition system is thought to become the most promising methods for many applications includes surveillance, speech recognition, eavesdropping devices, intelligence operations, etc. In the recent field of human recognition, the majority of the research be- coming performed presently is focused on the reidentification of various body images taken by several cameras or its focuses on recognized audio-only. However, in some cases these traditional methods can- not be useful when used alone such as in indoor surveillance systems, that are installed close to the ceiling and capture images right from above in a downwards direction and in some cases people don't look straight the cameras or it cannot be added in some area such as W.C. or sleeping room. Thus, its commonly difficult to identify any movement or breakthrough process, on the other hand when need to pursue suspect when enter a building or party to identify his location and/or listen to his speech only and isolate it from other voices or noises, the other. Hence, the use of the hybrid combination technique is very effective. In this work, we proposed a multimodal human recognition approach that utilizes both the face and audio and is based upon a deep convolutional neural network (CNN). Mainly, to solve the challenge of not capturing part of the body, final results of recognizing via separate CNNs of VGG Face16 and ResNet50 are joined together depending on the score-level combination by Weighted Sum rule to enhance recognition performance. The results show that the proposed system success to recognise each person from his voice and/or his face captured. In addition, the system can separate the person voice and isolate it from noisy environment and determine the existence of desired person.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Wed Aug 01 2018

Journal Name

Engineering And Technology Journal

A Proposed Method for the Sound Recognition Process

Mustafa

...Show More Authors

View Publication

Publication Date

Sat Nov 02 2019

Journal Name

Advances In Intelligent Systems And Computing

Spin-Image Descriptors for Text-Independent Speaker Recognition

Suhaila N.

...Show More Authors

Building a system to identify individuals through their speech recording can find its application in diverse areas, such as telephone shopping, voice mail and security control. However, building such systems is a tricky task because of the vast range of differences in the human voice. Thus, selecting strong features becomes very crucial for the recognition system. Therefore, a speaker recognition system based on new spin-image descriptors (SISR) is proposed in this paper. In the proposed system, circular windows (spins) are extracted from the frequency domain of the spectrogram image of the sound, and then a run length matrix is built for each spin, to work as a base for feature extraction tasks. Five different descriptors are generated fro

View Publication

(7)

(2)

Publication Date

Wed Dec 01 2021

Journal Name

Journal Of Physics: Conference Series

Disc damage likelihood scale recognition for Glaucoma detection

Mohammed S.G.

...Show More Authors

Abstract<p>Glaucoma is a visual disorder, which is one of the significant driving reason for visual impairment. Glaucoma leads to frustrate the visual information transmission to the brain. Dissimilar to other eye illness such as myopia and cataracts. The impact of glaucoma can’t be cured; The Disc Damage Likelihood Scale (DDLS) can be used to assess the Glaucoma. The proposed methodology suggested simple method to extract Neuroretinal rim (NRM) region then dividing the region into four sectors after that calculate the width for each sector and select the minimum value to use it in DDLS factor. The feature was fed to the SVM classification algorithm, the DDLS successfully classified Glaucoma d</p> ... Show More

View Publication

(7)

(2)

Publication Date

Sat Oct 31 2020

Journal Name

International Journal Of Intelligent Engineering And Systems

Speech Emotion Recognition Using MELBP Variants of Spectrogram Image

Speech emotion

Spectrogram image

Multi-block extended local binary pattern (MELBP)

Deep beliefnetwork (DBN)

Short term fourier transform (STFT)

Suhaila N.

...Show More Authors

View Publication Preview PDF

(7)

(4)

Publication Date

Fri May 16 2014

Journal Name

International Journal Of Computer Applications

Design and Implementation of Real Time Face Recognition System (RTFRS)

Zahraa

Mohammed

...Show More Authors

View Publication

(6)

Publication Date

Sat Jan 01 2022

Journal Name

Ieee Access

Hand Gesture Recognition With Acoustic Myography and Wavelet Scattering Transform

Ali H.

Youssef

Rami N.

Slim

Kosai

...Show More Authors

View Publication Preview PDF

(15)

(16)

Publication Date

Tue Sep 01 2020

Journal Name

Baghdad Science Journal

Developing Arabic License Plate Recognition System Using Artificial Neural Network and Canny Edge Detection

Artificial Neural Network

Canny Edge

License Plate

Recognition System

Bydaa Ali

Mohammed Sadoon

...Show More Authors

In recent years, there has been expanding development in the vehicular part and the number of vehicles moving on the roads in all the sections of the country. Arabic vehicle number plate identification based on image processing is a dynamic area of this work; this technique is used for security purposes such as tracking of stolen cars and access control to restricted areas. The License Plate Recognition System (LPRS) exploits a digital camera to capture vehicle plate numbers is used as input to the proposed recognition system. Basically, the proposed system consists of three phases, vehicle license plate localization, character segmentation, and character recognition, the

View Publication Preview PDF

(10)

(4)

Publication Date

Sun Feb 25 2024

Journal Name

Baghdad Science Journal

Facial Emotion Images Recognition Based On Binarized Genetic Algorithm-Random Forest

Binarized genetic algorithm

Facial recognition

Facial emotion

Histograms of oriented gradients

Random forest

Yale face dataset

Murad Ibrahim Husin

Yusliza

Razana

Zuriahati Mohd

Mohamad Shukor

Haswadi

Fahad Taha

Musatafa Abbas Abbood

Majid Razaq Mohamed

Sharifah Zarith Rahmah Syed

...Show More Authors

Most recognition system of human facial emotions are assessed solely on accuracy, even if other performance criteria are also thought to be important in the evaluation process such as sensitivity, precision, F-measure, and G-mean. Moreover, the most common problem that must be resolved in face emotion recognition systems is the feature extraction methods, which is comparable to traditional manual feature extraction methods. This traditional method is not able to extract features efficiently. In other words, there are redundant amount of features which are considered not significant, which affect the classification performance. In this work, a new system to recognize human facial emotions from images is proposed. The HOG (Histograms of Or

View Publication Preview PDF

(12)

(11)

Publication Date

Tue Feb 20 2024

Journal Name

Baghdad Science Journal

Hetero-associative Memory Based New Iraqi License Plate Recognition

Rusul Hussein

Inaam Salman

Rasha Majid

Ali saif aldeen Aubaid

...Show More Authors

As a result of recent developments in highway research as well as the increased use of vehicles, there has been a significant interest paid to the most current, effective, and precise Intelligent Transportation System (ITS). In the field of computer vision or digital image processing, the identification of specific objects in an image plays a crucial role in the creation of a comprehensive image. There is a challenge associated with Vehicle License Plate Recognition (VLPR) because of the variation in viewpoints, multiple formats, and non-uniform lighting conditions at the time of acquisition of the image, shape, and color, in addition, the difficulties like poor image resolution, blurry image, poor lighting, and low contrast, these

View Publication

Publication Date

Thu Feb 07 2019

Journal Name

Journal Of The College Of Education For Women

SPEECH RECOGNITION OF ARABIC WORDS USING ARTIFICIAL NEURAL NETWORKS

Dr. Sadiq jassim

...Show More Authors

The speech recognition system has been widely used by many researchers using different
methods to fulfill a fast and accurate system. Speech signal recognition is a typical
classification problem, which generally includes two main parts: feature extraction and
classification. In this paper, a new approach to achieve speech recognition task is proposed by
using transformation techniques for feature extraction methods; namely, slantlet transform
(SLT), discrete wavelet transforms (DWT) type Daubechies Db1 and Db4. Furthermore, a
modified artificial neural network (ANN) with dynamic time warping (DTW) algorithm is
developed to train a speech recognition system to be used for classification and recognition
purposes. T

View Publication Preview PDF

1 2 ... 5 6 7 8 ... 1651 1652