An optimized deep learning model for optical character recognition applications

Salih S.Q. Salih S.Q.; NUHA SAMI MOHSIN

doi:10.11591/ijece.v13i3.pp3010-3018

Details

Publication Date

Thu Jun 01 2023

Journal Name

International Journal Of Electrical And Computer Engineering (ijece)

Volume

13

DOI

10.11591/ijece.v13i3.pp3010-3018

Choose Citation Style

Statistics

View publication

11

Statistics

(1)

An optimized deep learning model for optical character recognition applications

Salih S.Q. Salih S.Q.

NUHA SAMI MOHSIN

...Show More Authors

The convolutional neural networks (CNN) are among the most utilized neural networks in various applications, including deep learning. In recent years, the continuing extension of CNN into increasingly complicated domains has made its training process more difficult. Thus, researchers adopted optimized hybrid algorithms to address this problem. In this work, a novel chaotic black hole algorithm-based approach was created for the training of CNN to optimize its performance via avoidance of entrapment in the local minima. The logistic chaotic map was used to initialize the population instead of using the uniform distribution. The proposed training algorithm was developed based on a specific benchmark problem for optical character recognition applications; the proposed method was evaluated for performance in terms of computational accuracy, convergence analysis, and cost.

View Publication

Publication Date

Tue Dec 05 2023

Journal Name

Baghdad Science Journal

Indoor/Outdoor Deep Learning Based Image Classification for Object Recognition Applications

Deep learning

GoogleNet

Image classification

Indoor/outdoor

Transfer learning.

Omar Abdullatif

Mohammed Jawad

Zenah Hadi

...Show More Authors

With the rapid development of smart devices, people's lives have become easier, especially for visually disabled or special-needs people. The new achievements in the fields of machine learning and deep learning let people identify and recognise the surrounding environment. In this study, the efficiency and high performance of deep learning architecture are used to build an image classification system in both indoor and outdoor environments. The proposed methodology starts with collecting two datasets (indoor and outdoor) from different separate datasets. In the second step, the collected dataset is split into training, validation, and test sets. The pre-trained GoogleNet and MobileNet-V2 models are trained using the indoor and outdoor se

View Publication Preview PDF

(5)

Publication Date

Sun Jan 14 2018

Journal Name

Journal Of Engineering

Optical Character Recognition Using Active Contour Segmentation

OCR

active contour

Segmentation

Automatic Character Recognition

Pattern Recognition

Tesseract OCR Engine.

Nabeel

Maher Faik

Estabraq

...Show More Authors

Document analysis of images snapped by camera is a growing challenge. These photos are often poor-quality compound images, composed of various objects and text; this makes automatic analysis complicated. OCR is one of the image processing techniques which is used to perform automatic identification of texts. Existing image processing techniques need to manage many parameters in order to clearly recognize the text in such pictures. Segmentation is regarded one of these essential parameters. This paper discusses the accuracy of segmentation process and its effect over the recognition process. According to the proposed method, the images were firstly filtered using the wiener filter then the active contour algorithm could b

View Publication Preview PDF

Publication Date

Tue Dec 21 2021

Journal Name

Mendel

Hybrid Deep Learning Model for Singing Voice Separation

R.

A.

...Show More Authors

Monaural source separation is a challenging issue due to the fact that there is only a single channel available; however, there is an unlimited range of possible solutions. In this paper, a monaural source separation model based hybrid deep learning model, which consists of convolution neural network (CNN), dense neural network (DNN) and recurrent neural network (RNN), will be presented. A trial and error method will be used to optimize the number of layers in the proposed model. Moreover, the effects of the learning rate, optimization algorithms, and the number of epochs on the separation performance will be explored. Our model was evaluated using the MIR-1K dataset for singing voice separation. Moreover, the proposed approach achi

View Publication

(4)

Publication Date

Sun Nov 01 2020

Journal Name

Iop Conference Series: Materials Science And Engineering

Face Recognition and Emotion Recognition from Facial Expression Using Deep Learning Neural Network

Ali

Zubaidah

Zainab

...Show More Authors

Abstract<p>Face recognition, emotion recognition represent the important bases for the human machine interaction. To recognize the person’s emotion and face, different algorithms are developed and tested. In this paper, an enhancement face and emotion recognition algorithm is implemented based on deep learning neural networks. Universal database and personal image had been used to test the proposed algorithm. Python language programming had been used to implement the proposed algorithm.</p>

View Publication

(8)

(2)

Publication Date

Mon Nov 21 2022

Journal Name

Sensors

Deep Learning-Based Computer-Aided Diagnosis (CAD): Applications for Medical Image Datasets

deep learning

CNN

auto-encoder

ant colony optimization

COVID-19

brain tumor

Yezi Ali

...Show More Authors

Computer-aided diagnosis (CAD) has proved to be an effective and accurate method for diagnostic prediction over the years. This article focuses on the development of an automated CAD system with the intent to perform diagnosis as accurately as possible. Deep learning methods have been able to produce impressive results on medical image datasets. This study employs deep learning methods in conjunction with meta-heuristic algorithms and supervised machine-learning algorithms to perform an accurate diagnosis. Pre-trained convolutional neural networks (CNNs) or auto-encoder are used for feature extraction, whereas feature selection is performed using an ant colony optimization (ACO) algorithm. Ant colony optimization helps to search for the bes

View Publication

(30)

(25)

Publication Date

Fri Mar 01 2024

Journal Name

Baghdad Science Journal

Deep Learning Techniques in the Cancer-Related Medical Domain: A Transfer Deep Learning Ensemble Model for Lung Cancer Prediction

Breast cancer

Cancer prediction

Deep learning

Ensemble learning

Lung cancer

Machine learning

Medical engineering.

Omar Abdullatif

Mohammed Jawad

Zenah Hadi Saied

...Show More Authors

Problem: Cancer is regarded as one of the world's deadliest diseases. Machine learning and its new branch (deep learning) algorithms can facilitate the way of dealing with cancer, especially in the field of cancer prevention and detection. Traditional ways of analyzing cancer data have their limits, and cancer data is growing quickly. This makes it possible for deep learning to move forward with its powerful abilities to analyze and process cancer data. Aims: In the current study, a deep-learning medical support system for the prediction of lung cancer is presented. Methods: The study uses three different deep learning models (EfficientNetB3, ResNet50 and ResNet101) with the transfer learning concept. The three models are trained using a

View Publication Preview PDF

(6)

(4)

Publication Date

Sat Jun 01 2024

Journal Name

Al-rafidain Journal Of Computer Sciences And Mathematics

Braille Character Recognition System: Review

Braille Recognition System

Image acquisition

Image pre-processing

Character Recognition

neural network

Rusul

Inaam

Rasha

...Show More Authors

The Braille Recognition System is the process of capturing a Braille document image and turning its content into its equivalent natural language characters. The Braille Recognition System's cell transcription and Braille cell recognition are the two basic phases that follow one another. The Braille Recognition System is a technique for locating and recognizing a Braille document stored as an image, such as a jpeg, jpg, tiff, or gif image, and converting the text into a machine-readable format, such as a text file. BCR translates an image's pixel representation into its character representation. As workers at visually impaired schools and institutes, we profit from Braille recognition in a variety of ways. The Braille Recognition S

View Publication Preview PDF

Publication Date

Mon Jan 01 2024

Journal Name

Journal Of Engineering

Face-based Gender Classification Using Deep Learning Model

Alex-Net

CLAHE

Deep learning

Gender Classification

Buraq Abed Ruda

Faten Abed Ali

...Show More Authors

Gender classification is a critical task in computer vision. This task holds substantial importance in various domains, including surveillance, marketing, and human-computer interaction. In this work, the face gender classification model proposed consists of three main phases: the first phase involves applying the Viola-Jones algorithm to detect facial images, which includes four steps: 1) Haar-like features, 2) Integral Image, 3) Adaboost Learning, and 4) Cascade Classifier. In the second phase, four pre-processing operations are employed, namely cropping, resizing, converting the image from(RGB) Color Space to (LAB) color space, and enhancing the images using (HE, CLAHE). The final phase involves utilizing Transfer lea

View Publication Preview PDF

(2)

Publication Date

Wed Mar 15 2023

Journal Name

International Journal Of Advances In Intelligent Informatics

An automatic lip reading for short sentences using deep learning nets

Maha Abd

Kadhim

...Show More Authors

One study whose importance has significantly grown in recent years is lip-reading, particularly with the widespread of using deep learning techniques. Lip reading is essential for speech recognition in noisy environments or for those with hearing impairments. It refers to recognizing spoken sentences using visual information acquired from lip movements. Also, the lip area, especially for males, suffers from several problems, such as the mouth area containing the mustache and beard, which may cover the lip area. This paper proposes an automatic lip-reading system to recognize and classify short English sentences spoken by speakers using deep learning networks. The input video extracts frames and each frame is passed to the Viola-Jone

View Publication

(6)

(3)

Publication Date

Thu Nov 17 2022

Journal Name

Journal Of Information And Optimization Sciences

Hybrid deep learning model for Arabic text classification based on mutual information

Farah A.

Nada A. Z.

...Show More Authors

View Publication

(1)

1 2 3 4 ... 831 832 833 834