Human recognition by utilizing voice recognition and visual recognition

Sukaina Sh Altyar; Samera Shams Hussein; Mahir Jasem Mohammed

Details

Publication Date

Sat Jan 01 2022

Journal Name

International Journal Of Nonlinear Analysis And Applications

Volume

13

Issue Number

1

Choose Citation Style

Statistics

View publication

13

Statistics

Human recognition by utilizing voice recognition and visual recognition

Deep learning Convolutional Neural Networks Human Recognition voice recognition visual recognition

Sukaina Sh Altyar

Samera Shams Hussein

Mahir Jasem Mohammed

...Show More Authors

Audio-visual detection and recognition system is thought to become the most promising methods for many applications includes surveillance, speech recognition, eavesdropping devices, intelligence operations, etc. In the recent field of human recognition, the majority of the research be- coming performed presently is focused on the reidentification of various body images taken by several cameras or its focuses on recognized audio-only. However, in some cases these traditional methods can- not be useful when used alone such as in indoor surveillance systems, that are installed close to the ceiling and capture images right from above in a downwards direction and in some cases people don't look straight the cameras or it cannot be added in some area such as W.C. or sleeping room. Thus, its commonly difficult to identify any movement or breakthrough process, on the other hand when need to pursue suspect when enter a building or party to identify his location and/or listen to his speech only and isolate it from other voices or noises, the other. Hence, the use of the hybrid combination technique is very effective. In this work, we proposed a multimodal human recognition approach that utilizes both the face and audio and is based upon a deep convolutional neural network (CNN). Mainly, to solve the challenge of not capturing part of the body, final results of recognizing via separate CNNs of VGG Face16 and ResNet50 are joined together depending on the score-level combination by Weighted Sum rule to enhance recognition performance. The results show that the proposed system success to recognise each person from his voice and/or his face captured. In addition, the system can separate the person voice and isolate it from noisy environment and determine the existence of desired person.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Mon Sep 03 2018

Journal Name

Al-academy

Perspective Cognitive Warm-up Voice

Lubna

...Show More Authors

The aim of this research is to find out about the methods used by the teachers of the subjects (choir, voice training, singing groups) used to warm up in voice training. In the Department of Music of the Faculty of Fine Arts University of Baghdad. The limits of this research were for the academic year (2017-2018). Explanation in the theoretical framework of warm-up types The first part of the body warms the body in terms of relaxation, body moderation, head rotation, tongue exercises, mouth opening, facial mask movements, yawning.The second course will warm up the sound exercises warm up the sound through different ladders (diatonic and chromate), and ladder accordions.And the third topic warm up the impris

View Publication Preview PDF

Publication Date

Thu Feb 01 2018

Journal Name

Journal Of Computational And Theoretical Nanoscience

Controlling of Robot Hand by Using Microcontroller with Visual Basic

Robot Hand (EDARM ED-7100)

Controlling and Monitoring

AT89S52 Microcontroller

Visual Basic.

Huda Hatam

Faieza Abdul

Wan Zuha Wan

Mohd Khairol Anuar Mohd

...Show More Authors

The robot arm is the most popular robotic form used in industry. Thus, it is crucial to make a system programming which could controlled the movement of each part in the industrial robot to make it works properly. One of the simplest models of the robot arm is EDARM ED-7100 which has a controller to control the movement of the robot arm manually. In this study, the robot controller has been redesigned in order to improve this robot's function. The new controller system used AT89S52 microcontroller which has wire connected to the robot hand. A function has been added with this controller to improve the system of controlling and becomes better than the previous system (only manually). The functions of the new system include three mo

View Publication

Publication Date

Mon Sep 30 2024

Journal Name

Iraqi Journal Of Chemical And Petroleum Engineering

Elimination of phenol by sonoelctrochemical process utilizing graphite, stainless steel, and titanium anodes: optimization by taguchi approach

organic pollutant

indirect oxidation

wastewater

ultrasonic

removal.

Hind Jabbar

Najwa Saber

Rasha H.

Khalid M.

...Show More Authors

Phenol is one of the worst-damaging organic pollutants, and it produces a variety of very poisonous organic intermediates, thus it is important to find efficient ways to eliminate it. One of the promising techniques is sonoelectrochemical processing. However, the type of electrodes, removal efficiency, and process cost are the biggest challenges. The main goal of the present study is to investigate the removal of phenol by a sonoelectrochemical process with different anodes, such as graphite, stainless steel, and titanium. The best anode performance was optimized by using the Taguchi approach with an L16 orthogonal array. the degradation of phenol sonoelectrochemically was investigated with three process parameters: current de

View Publication Preview PDF

Publication Date

Sat Jul 27 2019

Journal Name

Sensors

Neurophysiological Characterization of a Non-Human Primate Model of Traumatic Spinal Cord Injury Utilizing Fine-Wire EMG Electrodes

Masood F.

...Show More Authors

This study aims to characterize traumatic spinal cord injury (TSCI) neurophysiologically using an intramuscular fine-wire electromyography (EMG) electrode pair. EMG data were collected from an agonist-antagonist pair of tail muscles of Macaca fasicularis, pre- and post-lesion, and for a treatment and control group. The EMG signals were decomposed into multi-resolution subsets using wavelet transforms (WT), then the relative power (RP) was calculated for each individual reconstructed EMG sub-band. Linear mixed models were developed to test three hypotheses: (i) asymmetrical volitional activity of left and right side tail muscles (ii) the effect of the experimental TSCI on the frequency content of the EMG signal, (iii) and the effect

View Publication

(5)

(4)

Publication Date

Tue Dec 01 2020

Journal Name

Journal Of Engineering

Performance of 2- Link Robot by utilizing Adaptive Sliding Mode Controller

Classical Sliding Mode Controller

Adaptive Sliding Mode Controller

signum function

saturation function

chattering

Dena Hameed

Ahmed Khalaf

...Show More Authors

The Sliding Mode Control (SMC) has been among powerful control techniques increasingly. Much attention is paid to both theoretical and practical aspects of disciplines due to their distinctive characteristics such as insensitivity to bounded matched uncertainties, reduction of the order of sliding equations of motion, decoupling mechanical systems design. In the current study, two-link robot performance in the Classical SMC is enhanced via Adaptive Sliding Mode Controller (ASMC) despite uncertainty, external disturbance, and coulomb friction. The key idea is abstracted as follows: switching gains are depressed to the low allowable values, resulting in decreased chattering motion and control's efforts of the two-link robo

View Publication Preview PDF

Publication Date

Thu Jul 28 2016

Journal Name

Computer And Information Science

Refinement for Ocular Ultrasound Images Quality by Utilizing Combination of Enhancement Techniques

Zinah Rajab

Zaid Rajab

...Show More Authors

Ultrasound has been used as a diagnostic modality for many intraocular diseases, due its safety, low cost, real time and wide availability. Unfortunately, ultrasound images suffer from speckle artifact that are tissue dependent. In this work, we will offer a method to reduce speckle noise and improve ultrasound image to raise the human diagnostic performance. This method combined undecimated wavelet transform with a wavelet coefficient mapping function: where UDWT used to eliminate the noise and a wavelet coefficient mapping function used to enhance the contrast of denoised images obtained from the first component. This methods can be used not only as a means for improving visual quality of medical images but also as a preprocessing

View Publication Preview PDF

Publication Date

Sun Apr 02 2023

Journal Name

Mathematical Modelling Of Engineering Problems

Traffic Classification of IoT Devices by Utilizing Spike Neural Network Learning Approach

Ahmed R.

Nadia Adnan Shiltagh

Ibtesam R.K.

...Show More Authors

Whenever, the Internet of Things (IoT) applications and devices increased, the capability of the its access frequently stressed. That can lead a significant bottleneck problem for network performance in different layers of an end point to end point (P2P) communication route. So, an appropriate characteristic (i.e., classification) of the time changing traffic prediction has been used to solve this issue. Nevertheless, stills remain at great an open defy. Due to of the most of the presenting solutions depend on machine learning (ML) methods, that though give high calculation cost, where they are not taking into account the fine-accurately flow classification of the IoT devices is needed. Therefore, this paper presents a new model bas

View Publication

(9)

(7)

Publication Date

Sun Mar 01 2020

Journal Name

Baghdad Science Journal

Eyewitnesses’ Visual Recollection in Suspect Identification by using Facial Appearance Model

Active Appearance Model

Facial Morphology

Suspect Identification

Horkaew

...Show More Authors

Facial recognition has been an active field of imaging science. With the recent progresses in computer vision development, it is extensively applied in various areas, especially in law enforcement and security. Human face is a viable biometric that could be effectively used in both identification and verification. Thus far, regardless of a facial model and relevant metrics employed, its main shortcoming is that it requires a facial image, against which comparison is made. Therefore, closed circuit televisions and a facial database are always needed in an operational system. For the last few decades, unfortunately, we have experienced an emergence of asymmetric warfare, where acts of terrorism are often committed in secluded area with no

View Publication Preview PDF

(3)

Publication Date

Thu Nov 01 2012

Journal Name

Journal Of Computer Science

VOICE ACTIVATION VISUALIZATION FOR ECHOCARDIOGRAPH AND 3D ANGIOGRAPHIC IMAGES IN SURGERY

Zinah R.

...Show More Authors

In some cases, surgeons need to navigate through the computer system for reconfirmation patients’ details and unfortunately surgeons unable to manage both computer system and operation at the same time. In this paper we propose a solution for this problem especially designed for heart surgeon, by introducing voice activation system with 3D visualization of Angiographic images, 2D visualization of Echocardiography processed video and selected patient’s details. In this study, the processing, approximation of the 3D angiography and the visualization of the 2D echocardiography video with voice recognition control are the most challenging work. The work involve with predicting 3D coronary three from 2D angiography image and also image enhan

View Publication Preview PDF

(1)

Publication Date

Tue Dec 21 2021

Journal Name

Mendel

Hybrid Deep Learning Model for Singing Voice Separation

R.

A.

...Show More Authors

Monaural source separation is a challenging issue due to the fact that there is only a single channel available; however, there is an unlimited range of possible solutions. In this paper, a monaural source separation model based hybrid deep learning model, which consists of convolution neural network (CNN), dense neural network (DNN) and recurrent neural network (RNN), will be presented. A trial and error method will be used to optimize the number of layers in the proposed model. Moreover, the effects of the learning rate, optimization algorithms, and the number of epochs on the separation performance will be explored. Our model was evaluated using the MIR-1K dataset for singing voice separation. Moreover, the proposed approach achi

View Publication

(4)

1 2 ... 12 13 14 15 ... 1637 1638