An automatic lip reading for short sentences using deep learning nets

Maha Abd Rajab; Kadhim Hashim

doi:10.26555/ijain.v9i1.920

Details

Publication Date

Wed Mar 15 2023

Journal Name

International Journal Of Advances In Intelligent Informatics

Volume

9

DOI

10.26555/ijain.v9i1.920

Choose Citation Style

Statistics

View publication

7

Statistics

(6)

(3)

An automatic lip reading for short sentences using deep learning nets

Maha Abd Rajab

Kadhim Hashim

...Show More Authors

One study whose importance has significantly grown in recent years is lip-reading, particularly with the widespread of using deep learning techniques. Lip reading is essential for speech recognition in noisy environments or for those with hearing impairments. It refers to recognizing spoken sentences using visual information acquired from lip movements. Also, the lip area, especially for males, suffers from several problems, such as the mouth area containing the mustache and beard, which may cover the lip area. This paper proposes an automatic lip-reading system to recognize and classify short English sentences spoken by speakers using deep learning networks. The input video extracts frames and each frame is passed to the Viola-Jones to detect the face area. Then 68 landmarks of the facial area are determined, and the landmarks from 48 to 68 represent the lip area extracted based on building a binary mask. Then, the contrast is enhanced to improve the quality of the lip image by applying contrast adjustment. Finally, sentences are classified using two deep learning models, the first is AlexNet, and the second is VGG-16 Net. The database consists of 39 participants (32 males and 7 females). Each participant repeats the short sentences five times. The outcomes demonstrate the accuracy rate of AlexNet is 90.00%, whereas the accuracy rate for VGG-16 Net is 82.34%. We concluded that AlexNet performs better for classifying short sentences than VGG-16 Net.

View Publication

Publication Date

Mon Apr 01 2024

Journal Name

Telkomnika (telecommunication Computing Electronics And Control)

Classification of grapevine leaves images using VGG-16 and VGG-19 deep learning nets

Maha

Firas A.

Tole

...Show More Authors

The successful implementation of deep learning nets opens up possibilities for various applications in viticulture, including disease detection, plant health monitoring, and grapevine variety identification. With the progressive advancements in the domain of deep learning, further advancements and refinements in the models and datasets can be expected, potentially leading to even more accurate and efficient classification systems for grapevine leaves and beyond. Overall, this research provides valuable insights into the potential of deep learning for agricultural applications and paves the way for future studies in this domain. This work employs a convolutional neural network (CNN)-based architecture to perform grapevine leaf image classifi

View Publication

(10)

Publication Date

Fri Dec 01 2023

Journal Name

Al-khwarizmi Engineering Journal

An Overview of Audio-Visual Source Separation Using Deep Learning

Noorulhuda Mudhafar

Ahmed

Mohammed Najah

...Show More Authors

In this article, the research presents a general overview of deep learning-based AVSS (audio-visual source separation) systems. AVSS has achieved exceptional results in a number of areas, including decreasing noise levels, boosting speech recognition, and improving audio quality. The advantages and disadvantages of each deep learning model are discussed throughout the research as it reviews various current experiments on AVSS. The TCD TIMIT dataset (which contains top-notch audio and video recordings created especially for speech recognition tasks) and the Voxceleb dataset (a sizable collection of brief audio-visual clips with human speech) are just a couple of the useful datasets summarized in the paper that can be used to test A

View Publication Preview PDF

(1)

Publication Date

Thu Jun 01 2023

Journal Name

International Journal Of Electrical And Computer Engineering (ijece)

An optimized deep learning model for optical character recognition applications

Salih S.Q.

NUHA SAMI

...Show More Authors

The convolutional neural networks (CNN) are among the most utilized neural networks in various applications, including deep learning. In recent years, the continuing extension of CNN into increasingly complicated domains has made its training process more difficult. Thus, researchers adopted optimized hybrid algorithms to address this problem. In this work, a novel chaotic black hole algorithm-based approach was created for the training of CNN to optimize its performance via avoidance of entrapment in the local minima. The logistic chaotic map was used to initialize the population instead of using the uniform distribution. The proposed training algorithm was developed based on a specific benchmark problem for optical character recog

View Publication

(1)

Publication Date

Wed Jun 16 2021

Journal Name

Cognitive Computation

Deep Transfer Learning for Improved Detection of Keratoconus using Corneal Topographic Maps

Ali H.

Nebras H.

Zahraa M.

Javier

...Show More Authors

Abstract <p>Clinical keratoconus (KCN) detection is a challenging and time-consuming task. In the diagnosis process, ophthalmologists must revise demographic and clinical ophthalmic examinations. The latter include slit-lamb, corneal topographic maps, and Pentacam indices (PI). We propose an Ensemble of Deep Transfer Learning (EDTL) based on corneal topographic maps. We consider four pretrained networks, SqueezeNet (SqN), AlexNet (AN), ShuffleNet (SfN), and MobileNet-v2 (MN), and fine-tune them on a dataset of KCN and normal cases, each including four topographic maps. We also consider a PI classifier. Then, our EDTL method combines the output probabilities of each of the five classifiers to obtain a decision b</p> ... Show More

View Publication

(34)

(30)

Publication Date

Fri Sep 01 2023

Journal Name

Journal Of Engineering

Iraqi Sentiment and Emotion Analysis Using Deep Learning

Emotion analysis

Sentiment analysis

CNN

GRU

Iraqi dialect

Anwar Abdul-Razzaq

Nada A. Z.

...Show More Authors

Analyzing sentiment and emotions in Arabic texts on social networking sites has gained wide interest from researchers. It has been an active research topic in recent years due to its importance in analyzing reviewers' opinions. The Iraqi dialect is one of the Arabic dialects used in social networking sites, characterized by its complexity and, therefore, the difficulty of analyzing sentiment. This work presents a hybrid deep learning model consisting of a Convolution Neural Network (CNN) and the Gated Recurrent Units (GRU) to analyze sentiment and emotions in Iraqi texts. Three Iraqi datasets (Iraqi Arab Emotions Data Set (IAEDS), Annotated Corpus of Mesopotamian-Iraqi Dialect (ACMID), and Iraqi Arabic Dataset (IAD)) col

View Publication Preview PDF

(4)

Publication Date

Mon Jan 01 2024

Journal Name

Journal Of Engineering

Face-based Gender Classification Using Deep Learning Model

Alex-Net

CLAHE

Deep learning

Gender Classification

Buraq Abed Ruda

Faten Abed Ali

...Show More Authors

Gender classification is a critical task in computer vision. This task holds substantial importance in various domains, including surveillance, marketing, and human-computer interaction. In this work, the face gender classification model proposed consists of three main phases: the first phase involves applying the Viola-Jones algorithm to detect facial images, which includes four steps: 1) Haar-like features, 2) Integral Image, 3) Adaboost Learning, and 4) Cascade Classifier. In the second phase, four pre-processing operations are employed, namely cropping, resizing, converting the image from(RGB) Color Space to (LAB) color space, and enhancing the images using (HE, CLAHE). The final phase involves utilizing Transfer lea

View Publication Preview PDF

(2)

Publication Date

Mon Jun 01 2020

Journal Name

Journal Of Engineering

Arabic Sentiment Analysis (ASA) Using Deep Learning Approach

Deep Learning (DL)

Machine Learning (ML)

Arabic Sentiment Analysis (ASA)

word embedding

Long-Short Term Memory (LSTM)

features

Abdulhakeem Qusay

Ahmed S.

Saman Hameed

...Show More Authors

Sentiment analysis is one of the major fields in natural language processing whose main task is to extract sentiments, opinions, attitudes, and emotions from a subjective text. And for its importance in decision making and in people's trust with reviews on web sites, there are many academic researches to address sentiment analysis problems. Deep Learning (DL) is a powerful Machine Learning (ML) technique that has emerged with its ability of feature representation and differentiating data, leading to state-of-the-art prediction results. In recent years, DL has been widely used in sentiment analysis, however, there is scarce in its implementation in the Arabic language field. Most of the previous researches address other l

View Publication Preview PDF

(23)

Publication Date

Sat Jan 19 2019

Journal Name

Artificial Intelligence Review

Survey on supervised machine learning techniques for automatic text classification

Kadhim A.I.

...Show More Authors

View Publication

(300)

(271)

Publication Date

Mon Jan 09 2023

Journal Name

2023 15th International Conference On Developments In Esystems Engineering (dese)

Deep Learning-Based Speech Enhancement Algorithm Using Charlier Transform

Sally Antoin

Hala Jassim

Hayder Saadi Radeaf

Basheera M.

Sadiq H.

...Show More Authors

View Publication

(7)

(5)

Publication Date

Sun Nov 01 2020

Journal Name

Iop Conference Series: Materials Science And Engineering

3D scenes semantic segmentation using deep learning based Survey

Noori A.Y.

Shaimaa Hameed

Azeez R.A.

...Show More Authors

Abstract<p>Semantic segmentation realization and understanding is a stringent task not just for computer vision but also in the researches of the sciences of earth, semantic segmentation decompose compound architectures in one elements, the most mutual object in a civil outside or inside senses must classified then reinforced with information meaning of all object, it’s a method for labeling and clustering point cloud automatically. Three dimensions natural scenes classification need a point cloud dataset to representation data format as input, many challenge appeared with working of 3d data like: little number, resolution and accurate of three Dimensional dataset . Deep learning now is the po</p> ... Show More

View Publication

(1)

1 2 3 4 ... 976 977 978 979