Human recognition by utilizing voice recognition and visual recognition

Sukaina Sh Altyar; Samera Shams Hussein; Mahir Jasem Mohammed

Details

Publication Date

Sat Jan 01 2022

Journal Name

International Journal Of Nonlinear Analysis And Applications

Volume

13

Issue Number

1

Choose Citation Style

Statistics

View publication

12

Statistics

Human recognition by utilizing voice recognition and visual recognition

Deep learning Convolutional Neural Networks Human Recognition voice recognition visual recognition

Sukaina Sh Altyar

Samera Shams Hussein

Mahir Jasem Mohammed

...Show More Authors

Audio-visual detection and recognition system is thought to become the most promising methods for many applications includes surveillance, speech recognition, eavesdropping devices, intelligence operations, etc. In the recent field of human recognition, the majority of the research be- coming performed presently is focused on the reidentification of various body images taken by several cameras or its focuses on recognized audio-only. However, in some cases these traditional methods can- not be useful when used alone such as in indoor surveillance systems, that are installed close to the ceiling and capture images right from above in a downwards direction and in some cases people don't look straight the cameras or it cannot be added in some area such as W.C. or sleeping room. Thus, its commonly difficult to identify any movement or breakthrough process, on the other hand when need to pursue suspect when enter a building or party to identify his location and/or listen to his speech only and isolate it from other voices or noises, the other. Hence, the use of the hybrid combination technique is very effective. In this work, we proposed a multimodal human recognition approach that utilizes both the face and audio and is based upon a deep convolutional neural network (CNN). Mainly, to solve the challenge of not capturing part of the body, final results of recognizing via separate CNNs of VGG Face16 and ResNet50 are joined together depending on the score-level combination by Weighted Sum rule to enhance recognition performance. The results show that the proposed system success to recognise each person from his voice and/or his face captured. In addition, the system can separate the person voice and isolate it from noisy environment and determine the existence of desired person.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Tue Oct 01 2019

Journal Name

2019 Ieee 9th International Conference On System Engineering And Technology (icset)

A Digital Signature System Based on Real Time Face Recognition

Digital signature

Face recognition

SHA256

Asraa

Taha

Firas A.

Mustafa S.

Mohd Shafry Mohd

...Show More Authors

This study proposed a biometric-based digital signature scheme proposed for facial recognition. The scheme is designed and built to verify the person’s identity during a registration process and retrieve their public and private keys stored in the database. The RSA algorithm has been used as asymmetric encryption method to encrypt hashes generated for digital documents. It uses the hash function (SHA-256) to generate digital signatures. In this study, local binary patterns histograms (LBPH) were used for facial recognition. The facial recognition method was evaluated on ORL faces retrieved from the database of Cambridge University. From the analysis, the LBPH algorithm achieved 97.5% accuracy; the real-time testing was done on thirty subj

View Publication Preview PDF

(9)

(2)

Publication Date

Sun Oct 01 2023

Journal Name

Baghdad Science Journal

Using VGG Models with Intermediate Layer Feature Maps for Static Hand Gesture Recognition

Convolutional Neural Networks

Deep Learning

Hand Gesture Recognition

VGG-16

VGG-19.

Osamah Y.

Bashar S

Ayad R.

...Show More Authors

A hand gesture recognition system provides a robust and innovative solution to nonverbal communication through human–computer interaction. Deep learning models have excellent potential for usage in recognition applications. To overcome related issues, most previous studies have proposed new model architectures or have fine-tuned pre-trained models. Furthermore, these studies relied on one standard dataset for both training and testing. Thus, the accuracy of these studies is reasonable. Unlike these works, the current study investigates two deep learning models with intermediate layers to recognize static hand gesture images. Both models were tested on different datasets, adjusted to suit the dataset, and then trained under different m

View Publication Preview PDF

(9)

(3)

Publication Date

Thu Dec 01 2022

Journal Name

Journal Of Education For Pure Science- University Of Thi-qar

Dorsal Hand Vein Image Recognition: A Review

Maha A.

...Show More Authors

Subcutaneous vascularization has become a new solution for identification management over the past few years. Systems based on dorsal hand veins are particularly promising for high-security settings. The dorsal hand vein recognition system comprises the following steps: acquiring images from the database and preprocessing them, locating the region of interest, and extracting and recognizing information from the dorsal hand vein pattern. This paper reviewed several techniques for obtaining the dorsal hand vein area and identifying a person. Therefore, this study just provides a comprehensive review of existing previous theories. This model aims to offer the improvement in the accuracy rate of the system that was shown in previous studies and

Publication Date

Mon Jun 05 2023

Journal Name

Journal Of Engineering

Isolated Word Speech Recognition Using Mixed Transform

Mixed Transform

Radon Transform

Discrete Wavelet Transform

Discrete Multicircularlet Transform

Dynamic Time Warping

Sadiq Jassim

Shahad Mujeeb

...Show More Authors

Methods of speech recognition have been the subject of several studies over the past decade. Speech recognition has been one of the most exciting areas of the signal processing. Mixed transform is a useful tool for speech signal processing; it is developed for its abilities of improvement in feature extraction. Speech recognition includes three important stages, preprocessing, feature extraction, and classification. Recognition accuracy is so affected by the features extraction stage; therefore different models of mixed transform for feature extraction were proposed. The properties of the recorded isolated word will be 1-D, which achieve the conversion of each 1-D word into a 2-D form. The second step of the word recognizer requires, the

View Publication Preview PDF

(1)

Publication Date

Thu Jun 29 2023

Journal Name

Iraqi Journal Of Computer, Communication, Control And System Engineering

Recognition of Upper Limb Movements Based on Hybrid EEG and EMG Signals for Human-Robot Interaction

Huda

Alia

Ali H.

...Show More Authors

Upper limb amputation is a condition that severely limits the amputee’s movement. Patients who have lost the use of one or more of their upper extremities have difficulty performing activities of daily living. To help improve the control of upper limb prosthesis with pattern recognition, non-invasive approaches (EEG and EMG signals) is proposed in this paper and are integrated with machine learning techniques to recognize the upper-limb motions of subjects. EMG and EEG signals are combined, and five features are utilized to classify seven hand movements such as (wrist flexion (WF), outward part of the wrist (WE), hand open (HO), hand close (HC), pronation (PRO), supination (SUP), and rest (RST)). Experiments demonstrate that usin

View Publication

Publication Date

Thu Oct 01 2020

Journal Name

Defence Technology

A novel facial emotion recognition scheme based on graph mining

Emotion recognition

Facial landmarks

Graph mining

gSpan algorithm

Binary cat swarm optimization (BCSO)

Neural network

Suhaila N.

...Show More Authors

Recent years have seen an explosion in graph data from a variety of scientific, social and technological fields. From these fields, emotion recognition is an interesting research area because it finds many applications in real life such as in effective social robotics to increase the interactivity of the robot with human, driver safety during driving, pain monitoring during surgery etc. A novel facial emotion recognition based on graph mining has been proposed in this paper to make a paradigm shift in the way of representing the face region, where the face region is represented as a graph of nodes and edges and the gSpan frequent sub-graphs mining algorithm is used to find the frequent sub-structures in the graph database of each emotion. T

View Publication Preview PDF

(47)

(37)

Publication Date

Thu Mar 21 2019

Journal Name

J. Eng. Appl. Sci

Developing an Arabic handwritten recognition system by means of artificial neural network

Ali

Mohammed

...Show More Authors

The matter of handwritten text recognition is as yet a major challenge to mainstream researchers. A few ways deal with this challenge have been endeavored in the most recent years, for the most part concentrating on the English pre-printed or handwritten characters space. Consequently, the need to effort a research concerning to Arabic texts handwritten recognition. The Arabic handwriting presents unique technical difficulties because it is cursive, right to left in writing and the letters convert its shapes and structures when it is putted at initial, middle, isolation or at the end of words. In this study, the Arabic text recognition is developed and designed to recognize image of Arabic text/characters. The proposed model gets a single l

Publication Date

Tue Dec 05 2023

Journal Name

Baghdad Science Journal

An improved neurogenetic model for recognition of 3D kinetic data of human extracted from the Vicon Robot system

Breaking-up process

Combining process

Crossover

Feedforward ANN

Mutation

Neuro-Genetic model

Optimization

Recognition

Vicon Robot

3D data

Ivan V.

Safa A.

...Show More Authors

These days, it is crucial to discern between different types of human behavior, and artificial intelligence techniques play a big part in that. The characteristics of the feedforward artificial neural network (FANN) algorithm and the genetic algorithm have been combined to create an important working mechanism that aids in this field. The proposed system can be used for essential tasks in life, such as analysis, automation, control, recognition, and other tasks. Crossover and mutation are the two primary mechanisms used by the genetic algorithm in the proposed system to replace the back propagation process in ANN. While the feedforward artificial neural network technique is focused on input processing, this should be based on the proce

View Publication Preview PDF

(1)

Publication Date

Wed Aug 01 2012

Journal Name

I-manger's Journal On Information Technology

A MODULE FOR ENHANCING RECOGNITION SYSTEM FOR QR CODE SCANNED IMAGE

QR Code

Finder Pattern

Recognition System

Furat N.

Yasmine

...Show More Authors

A QR code is a type of barcode that can hold more information than the familiar kind scanned at checkouts around the world. The “QR” stands for “Quick Response”, a reference to the speed at which the large amounts of information they contain can be decoded by scanners. They are being widely used for advertising campaigns, linking to company websites, contest sign-up pages and online menus. In this paper, we propose an efficient module to extract QR code from background and solve problem of rotation in case of inaccurate image taken from mobile camera.

Publication Date

Sat Apr 01 2023

Journal Name

Bulletin Of Electrical Engineering And Informatics

Accurate license plate recognition system for different styles of Iraqi license plates

Automatic license plate Recognition Arabic license plate Pattern recognition ALPR Plate Recognition

Sukaina

Samera

Lubab

...Show More Authors

Automatic license plate recognition (ALPR) used for many applications especially in security applications, including border control. However, more accurate and language-independent techniques are still needed. This work provides a new approach to identifying Arabic license plates in different formats, colors, and even including English characters. Numbers, characters, and layouts with either 1-line or 2-line layouts are presented. For the test, we intend to use Iraqi license plates as there is a wide range of license plate styles written in Arabic, Kurdish, and English/Arabic languages, each different in style and color. This variety makes it difficult for recent traditional license plate recognition systems and algorithms to recogn

View Publication Preview PDF

(4)

(3)

1 2 ... 4 5 6 7 ... 1678 1679