Audio-visual detection and recognition system is thought to become the most promising methods for many applications includes surveillance, speech recognition, eavesdropping devices, intelligence operations, etc. In the recent field of human recognition, the majority of the research be- coming performed presently is focused on the reidentification of various body images taken by several cameras or its focuses on recognized audio-only. However, in some cases these traditional methods can- not be useful when used alone such as in indoor surveillance systems, that are installed close to the ceiling and capture images right from above in a downwards direction and in some cases people don't look straight the cameras or it cannot be added in some area such as W.C. or sleeping room. Thus, its commonly difficult to identify any movement or breakthrough process, on the other hand when need to pursue suspect when enter a building or party to identify his location and/or listen to his speech only and isolate it from other voices or noises, the other. Hence, the use of the hybrid combination technique is very effective. In this work, we proposed a multimodal human recognition approach that utilizes both the face and audio and is based upon a deep convolutional neural network (CNN). Mainly, to solve the challenge of not capturing part of the body, final results of recognizing via separate CNNs of VGG Face16 and ResNet50 are joined together depending on the score-level combination by Weighted Sum rule to enhance recognition performance. The results show that the proposed system success to recognise each person from his voice and/or his face captured. In addition, the system can separate the person voice and isolate it from noisy environment and determine the existence of desired person.
Facial recognition has been an active field of imaging science. With the recent progresses in computer vision development, it is extensively applied in various areas, especially in law enforcement and security. Human face is a viable biometric that could be effectively used in both identification and verification. Thus far, regardless of a facial model and relevant metrics employed, its main shortcoming is that it requires a facial image, against which comparison is made. Therefore, closed circuit televisions and a facial database are always needed in an operational system. For the last few decades, unfortunately, we have experienced an emergence of asymmetric warfare, where acts of terrorism are often committed in secluded area with no
... Show MoreIn some cases, surgeons need to navigate through the computer system for reconfirmation patients’ details and unfortunately surgeons unable to manage both computer system and operation at the same time. In this paper we propose a solution for this problem especially designed for heart surgeon, by introducing voice activation system with 3D visualization of Angiographic images, 2D visualization of Echocardiography processed video and selected patient’s details. In this study, the processing, approximation of the 3D angiography and the visualization of the 2D echocardiography video with voice recognition control are the most challenging work. The work involve with predicting 3D coronary three from 2D angiography image and also image enhan
... Show MoreMonaural source separation is a challenging issue due to the fact that there is only a single channel available; however, there is an unlimited range of possible solutions. In this paper, a monaural source separation model based hybrid deep learning model, which consists of convolution neural network (CNN), dense neural network (DNN) and recurrent neural network (RNN), will be presented. A trial and error method will be used to optimize the number of layers in the proposed model. Moreover, the effects of the learning rate, optimization algorithms, and the number of epochs on the separation performance will be explored. Our model was evaluated using the MIR-1K dataset for singing voice separation. Moreover, the proposed approach achi
... Show MoreAdvertising technology represents a component of elements of the visual attraction in the urban scape, made its way transmission process of messages between the ends of the source ofinformation (sender) and the Destination information (receiver) of the final recipient of themessage, It serves as a social marked and a means of cultural expression, It is part of the inalienable in creating identity and determine the spatial relationships and also is a reflection ofurban culture to the community. This technology has become an increasing feature of the present era, characterized as the era of the three revolutions: (the information revolution, the technologyrevolution, and the media revolution), Where it became an integral part of the visual
... Show MoreEstimating an individual's age from a photograph of their face is critical in many applications, including intelligence and defense, border security and human-machine interaction, as well as soft biometric recognition. There has been recent progress in this discipline that focuses on the idea of deep learning. These solutions need the creation and training of deep neural networks for the sole purpose of resolving this issue. In addition, pre-trained deep neural networks are utilized in the research process for the purpose of facial recognition and fine-tuning for accurate outcomes. The purpose of this study was to offer a method for estimating human ages from the frontal view of the face in a manner that is as accurate as possible and takes
... Show MoreThe Qur'an is an inexhaustible source for researchers, and all of them find a rich material for its research, and no wonder in it is the book of the greatest Arabic. Quranic research has been an attempt to extract the secret in the miracle of the Koran, and not the Quranic miracle is limited to the word and its meaning, but that the miracle extends to include every sound in motion or silent; the sound performance of the Quranic text increases the meaning of beauty and earns the word heartbeat, Souls; and this may be due to the beauty of voice in the performance and harmony between sounds and words, and harmony between the exits and descriptions, or the tides of the tides,
Based on the above and to show the miraculous aspects of the Qu
Recently, many materials have shown that they can be used as alternatives to chemicals materials in order to be used to improve the properties of drilling fluids. Some of these materials are banana peels and corn cobs which both are considered environmentally- friendly materials. The results of the X-ray diffraction examination have proved that the main components of these materials are cellulose and hemicellulose, which contribute greatly to the increasing of the effectiveness of these two materials. Due to their distinct composition, these two materials have improved the rheological properties (plastic viscosity and yield point) and reduced the filtration of the drilling fluids to a large extent. The addition rates used for each o
... Show MoreVoice studies are one of the sensory studies of their first association with auditory taste, which broadcasts its connection to the most sensory point contained in the physical corners, and in order to satisfy our conscience from these fundamental voices, we searched for in modern contexts, high proportions related to the words of Muhammadiyah, so we are looking for (events In the hadiths of the Prophet (peace be upon him), to settle our journey when the true adhkaar received from the Prophet, peace be upon him, and to control us start a journey through which we repeat between the cities of audio repetitions, sometimes we find ourselves have stood at the entrance of what corresponds to the sounds between the title of the Hadith and its t
... Show MoreThree-dimensional (3D) reconstruction from images is a most beneficial method of object regeneration by using a photo-realistic way that can be used in many fields. For industrial fields, it can be used to visualize the cracks within alloys or walls. In medical fields, it has been used as 3D scanner to reconstruct some human organs such as internal nose for plastic surgery or to reconstruct ear canal for fabricating a hearing aid device, and others. These applications need high accuracy details and measurement that represent the main issue which should be taken in consideration, also the other issues are cost, movability, and ease of use which should be taken into consideration. This work has presented an approach for design and construc
... Show MoreCombining ultrasonic irradiation and the Fenton process as a sono-Fenton process, the chemical oxygen demand (COD) in refinery wastewater was successfully eliminated using response surface methodology (RSM) with central composite design (CCD). The impact of two main influential operational parameters (iron dosage and reaction time) on the COD removal from wastewater generated by an Iraqi petroleum refinery facility was explored. Removal of 85.81% was attained under the optimal conditions of 21 minutes and 0.289 mM of concentration. Additionally, the results revealed that the concentration of has the highest effect on the COD elimination, followed by reaction time. The high R2 value (96.40%) validated the strong fit of the mo
... Show More