Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

Nuha M. Khassaf; Nada Hussein M. Ali

doi:10.48084/etasr.8455

Details

Publication Date

Wed Oct 09 2024

Journal Name

Engineering, Technology & Applied Science Research

Volume

14

Issue Number

5

DOI

10.48084/etasr.8455

Choose Citation Style

Statistics

View publication

10

Statistics

(9)

(5)

Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

CNN pre-trained models

LSTM

activation function

hyper-parameters

overfitting

Nuha M. Khassaf

Nada Hussein M. Ali

...Show More Authors

The issue of image captioning, which comprises automatic text generation to understand an image’s visual information, has become feasible with the developments in object recognition and image classification. Deep learning has received much interest from the scientific community and can be very useful in real-world applications. The proposed image captioning approach involves the use of Convolution Neural Network (CNN) pre-trained models combined with Long Short Term Memory (LSTM) to generate image captions. The process includes two stages. The first stage entails training the CNN-LSTM models using baseline hyper-parameters and the second stage encompasses training CNN-LSTM models by optimizing and adjusting the hyper-parameters of the previous stage. Improvements include the use of a new activation function, regular parameter tuning, and an improved learning rate in the later stages of training. The experimental results on the flickr8k dataset showed a noticeable and satisfactory improvement in the second stage, where a clear increment was achieved in the evaluation metrics Bleu1-4, Meteor, and Rouge-L. This increment confirmed the effectiveness of the alterations and highlighted the importance of hyper-parameter tuning in improving the performance of CNN-LSTM models in image caption tasks.

View Publication

Publication Date

Tue Dec 05 2023

Journal Name

Baghdad Science Journal

Indoor/Outdoor Deep Learning Based Image Classification for Object Recognition Applications

Deep learning

GoogleNet

Image classification

Indoor/outdoor

Transfer learning.

Omar Abdullatif

Mohammed Jawad

Zenah Hadi

...Show More Authors

With the rapid development of smart devices, people's lives have become easier, especially for visually disabled or special-needs people. The new achievements in the fields of machine learning and deep learning let people identify and recognise the surrounding environment. In this study, the efficiency and high performance of deep learning architecture are used to build an image classification system in both indoor and outdoor environments. The proposed methodology starts with collecting two datasets (indoor and outdoor) from different separate datasets. In the second step, the collected dataset is split into training, validation, and test sets. The pre-trained GoogleNet and MobileNet-V2 models are trained using the indoor and outdoor se

View Publication Preview PDF

(7)

(1)

Publication Date

Sat Oct 30 2021

Journal Name

Iraqi Journal Of Science

Small Binary Codebook Design for Image Compression Depending on Rotating Blocks

Rafah Rasheed

Saif B.

Rafah Rasheed

...Show More Authors

The searching process using a binary codebook of combined Block Truncation Coding (BTC) method and Vector Quantization (VQ), i.e. a full codebook search for each input image vector to find the best matched code word in the codebook, requires a long time. Therefore, in this paper, after designing a small binary codebook, we adopted a new method by rotating each binary code word in this codebook into 900 to 2700 step 900 directions. Then, we systematized each code word depending on its angle to involve four types of binary code books (i.e. Pour when , Flat when , Vertical when, or Zigzag). The proposed scheme was used for decreasing the time of the coding procedure, with very small distortion per block, by designing s

(6)

(2)

Publication Date

Mon Dec 05 2022

Journal Name

Baghdad Science Journal

MSRD-Unet: Multiscale Residual Dilated U-Net for Medical Image Segmentation

Attention

Deep Learning

Dilated Convolution

Medical Image Segmentation

U-Net

Muna

Ban N.

...Show More Authors

Semantic segmentation is an exciting research topic in medical image analysis because it aims to detect objects in medical images. In recent years, approaches based on deep learning have shown a more reliable performance than traditional approaches in medical image segmentation. The U-Net network is one of the most successful end-to-end convolutional neural networks (CNNs) presented for medical image segmentation. This paper proposes a multiscale Residual Dilated convolution neural network (MSRD-UNet) based on U-Net. MSRD-UNet replaced the traditional convolution block with a novel deeper block that fuses multi-layer features using dilated and residual convolution. In addition, the squeeze and execution attention mechanism (SE) and the s

View Publication Preview PDF

(16)

(9)

Publication Date

Tue Apr 04 2023

Journal Name

Results In Nonlinear Analysis

The fractional integrodifferential operator and its univalence and boundedness features according to Pre-Schwarzian derivative structure

Regular function

Locally univalent

Pre-Schwarzian derivative

Fractional calculus

Hiba Fawzi

...Show More Authors

Complex-valued regular functions that are normalized in the open unit disk are vastly studied. The current study introduces a new fractional integrodifferential (non-linear) operator. Based on the pre-Schwarzian derivative, certain appropriate stipulations on the parameters included in this con-structed operator to be univalent and bounded are investigated and determined.

View Publication Preview PDF

(1)

Publication Date

Sun Dec 01 2024

Journal Name

Journal Of The College Of Basic Education

Efficiency of SCL Via Google Classroom on Female Pre-service Teachers' Teaching Readiness

efficiency

Google classroom

perceptions

pre-service teachers

student-centered learning

teaching readiness

Hanan

...Show More Authors

This study intends to examine the efficiency of student-centered learning (SCL) through Google classroom in enhancing the readiness of fourth stage females’ pre-service teachers. The research employs a quasi-experimental design with a control and experimental group to compare the teaching readiness of participants before and after the intervention. The participants were 30 of fourth stage students at the University of Baghdad - College of Education for Women/the department of English and data were collected through observation checklist to assess their teaching experience and questionnaires to assess their perceptions towards using Google Classroom. Two sections were selected, C as a control group and D as the experimental one each with (

Preview PDF

Publication Date

Mon Sep 02 2024

Journal Name

Palestine Journal Of Mathematics

Class of Holomorphic Functions Considering Seven-Parameter Mittag-Leffler Function

Maryam K.

Abdulrahman H.

...Show More Authors

Preview PDF

(1)

Publication Date

Sun Jun 01 2014

Journal Name

Baghdad Science Journal

Effect of Silver Oxide Film Thickness on Some Optical Parameter

spray pyrolysis: Ag2O thin films

optical properties

Israa H.

Waffaa K.

Zahrra H.

...Show More Authors

Films of silver oxide of different thickness have been prepared by the chemical spray paralysis. Transmission and absorption spectra have recorded in order to study the effect of increasing thickness on some optical parameter such as reflectance, refractive index , and dielectric constant in its two parts . This study reveals that all these paramters affect by increasing the thickness .

View Publication Preview PDF

Publication Date

Mon Oct 01 2018

Journal Name

Al–bahith Al–a'alami

President Trump›s media discourse in the US election Study in electronic news sites - CNN ARABIC Model

President Trump›s’ media discourse’ US election’ discourse

Layth Bader

...Show More Authors

The letter is defined as a message directed by the sender to another party, the future. The aim is to convey, clarify or explain a particular point or subject, and in the form of direct oral communication through speech that contains a set of words and words, The future can discuss the sender directly to exchange ideas with each other, or it may be written and in this case does not require direct interaction between the matchmaker and the recipient. As a result of the different sources and topics of the discourse, and the different types of categories addressed to the speech, and the number, it has been divided into several types.
And schools of discourse analysis emerged in the early eighties of the last century and has spread and ha

View Publication Preview PDF

(2)

Publication Date

Fri Dec 01 2023

Journal Name

Iop Conference Series: Earth And Environmental Science

Improving the Growth and Production of Beets by Fertilizing with Fish Water and Spraying Lentil Extract

W. A.

...Show More Authors

Abstract<p>The experiment was conducted in the fields belonging to the Department of Horticulture, College of Agricultural Engineering Sciences, University of Baghdad, at Al-Jadriya Complex / Station A, for the autumn season of 2022-2023. The aim was to study the effect of water fish irrigation and water lens plant extract foliar application on the growth and productivity of beetroot. The experiment included two factors: the first factor was water fish irrigation with five concentrations (A) Control treatment (irrigation with river water and recommended fertilization), (B) Water fish irrigation at 25% concentration, (C) water Fish irrigation at 50% concentration, (D) Water Fish irrigation at 75%</p> ... Show More

View Publication

Publication Date

Sat Jul 01 2023

Journal Name

Industrial Laboratory. Materials Diagnostics

OPTIMIZATION OF PLASMA-ASSISTED DESORPTION/IONIZATIONMASS SPECTROMETRY FOR ANALYSIS OF IBUPROFEN

Plasma-assisted desorption ionization-mass spectrometry

optimization

ibuprofen

Jasim M. S.

...Show More Authors

In medical practice, nonsteroidal anti-inflammatory drugs (NSAIDs) are often used to treat osteoarthritis and rheumatoid arthritis. Ibuprofen is a well-known NSAID, analgesic, and antipyretic medication. This chemical is an active ingredient of several oral medications that are offered in tablet, gel pellet, and syrup forms and has higher efficacy, tolerance, and side effect rates than other compounds, including pyrazolone derivatives. We present a unique plasma-assisted desorption/ionization mass spectrometry (PADI-MS) approach for improving pharmaceutically important solids using an ibuprofen tablet as a model solid sample. The goal of the study is to create an innovative mass spectrometric method that could be used for quick and accur

Preview PDF

1 2 ... 31 32 33 34 ... 1078 1079