Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

Nuha M. Khassaf; Nada Hussein M. Ali

doi:10.48084/etasr.8455

Details

Publication Date

Wed Oct 09 2024

Journal Name

Engineering, Technology & Applied Science Research

Volume

14

Issue Number

5

DOI

10.48084/etasr.8455

Choose Citation Style

Statistics

View publication

10

Statistics

(9)

(5)

Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

CNN pre-trained models

LSTM

activation function

hyper-parameters

overfitting

Nuha M. Khassaf

Nada Hussein M. Ali

...Show More Authors

The issue of image captioning, which comprises automatic text generation to understand an image’s visual information, has become feasible with the developments in object recognition and image classification. Deep learning has received much interest from the scientific community and can be very useful in real-world applications. The proposed image captioning approach involves the use of Convolution Neural Network (CNN) pre-trained models combined with Long Short Term Memory (LSTM) to generate image captions. The process includes two stages. The first stage entails training the CNN-LSTM models using baseline hyper-parameters and the second stage encompasses training CNN-LSTM models by optimizing and adjusting the hyper-parameters of the previous stage. Improvements include the use of a new activation function, regular parameter tuning, and an improved learning rate in the later stages of training. The experimental results on the flickr8k dataset showed a noticeable and satisfactory improvement in the second stage, where a clear increment was achieved in the evaluation metrics Bleu1-4, Meteor, and Rouge-L. This increment confirmed the effectiveness of the alterations and highlighted the importance of hyper-parameter tuning in improving the performance of CNN-LSTM models in image caption tasks.

View Publication

Publication Date

Sat Jan 01 2011

Journal Name

Journal Of Engineering

RADAR PARAMETER GENERATION TO IDENTIFY THE TARGET

Identification

Target

Electronic Warefare

Radar Pulse

Graphical User Interface

W. A.

A. K.

F. D.

...Show More Authors

Due to the popularity of radar, receivers often “hear” a great number of other transmitters in
addition to their own return merely in noise. The dealing with the problem of identifying and/or
separating a sum of tens of such pulse trains from a number of different sources are often received on
the one communication channel. It is then of interest to identify which pulses are from which source,
based on the assumption that the different sources have different characteristics. This search deals with a
graphical user interface (GUI) to generate the radar pulse in order to use the required radar signal in any
specified location.

View Publication Preview PDF

(1)

Publication Date

Sun Apr 01 2018

Journal Name

Journal Of Economics And Administrative Sciences

Bayes Estimators for the Parameter of the Inverted Exponential Distribution Under different Double informative priors

Inverted exponential distribution

Bayes method

Prior distributions (Chi-squared distribution

Gamma distribution

Erlang distribution)

mean squared errors (MSE).

جنان عباس

...Show More Authors

In this paper, we present a comparison of double informative priors which are assumed for the parameter of inverted exponential distribution.To estimate the parameter of inverted exponential distribution by using Bayes estimation ,will be used two different kind of information in the Bayes estimation; two different priors have been selected for the parameter of inverted exponential distribution. Also assumed Chi-squared - Gamma distribution, Chi-squared - Erlang distribution, and- Gamma- Erlang distribution as double priors. The results are the derivations of these estimators under the squared error loss function with three different double priors.

Additionally Maximum likelihood estimation method

View Publication Preview PDF

Publication Date

Sat Nov 02 2019

Journal Name

Advances In Intelligent Systems And Computing

Spin-Image Descriptors for Text-Independent Speaker Recognition

Suhaila N.

...Show More Authors

Building a system to identify individuals through their speech recording can find its application in diverse areas, such as telephone shopping, voice mail and security control. However, building such systems is a tricky task because of the vast range of differences in the human voice. Thus, selecting strong features becomes very crucial for the recognition system. Therefore, a speaker recognition system based on new spin-image descriptors (SISR) is proposed in this paper. In the proposed system, circular windows (spins) are extracted from the frequency domain of the spectrogram image of the sound, and then a run length matrix is built for each spin, to work as a base for feature extraction tasks. Five different descriptors are generated fro

View Publication

(7)

(2)

Publication Date

Mon Sep 30 2024

Journal Name

Al-mustansiriyah Journal Of Science

A Transfer Learning Approach for Arabic Image Captions

Haneen serag

Narjis

Abdul Rahman A.

...Show More Authors

Publication Date

Sat Dec 30 2017

Journal Name

International Journal Of Science And Research (ijsr)

Color-based for tree yield fruits image counting

Image Segmentation

Object Labeling

Color Space

contrast stretching

morphological operations

Faisel G. Mohammed

Wejdan A. Amer

...Show More Authors

Identifying the total number of fruits on trees has long been of interest in agricultural crop estimation work. Yield prediction of fruits in practical environment is one of the hard and significant tasks to obtain better results in crop management system to achieve more productivity with regard to moderate cost. Utilized color vision in machine vision system to identify citrus fruits, and estimated yield information of the citrus grove in-real time. Fruit recognition algorithms based on color features to estimate the number of fruit. In the current research work, some low complexity and efficient image analysis approach was proposed to count yield fruits image in the natural scene. Semi automatic segmentation and yield calculation of fruit

View Publication

Publication Date

Fri Jul 01 2016

Journal Name

International Journal Of Computer Science And Mobile Computing

. Interpolative Absolute Block Truncation Coding for Image Compression

Ghadah

...Show More Authors

Publication Date

Mon Apr 17 2023

Journal Name

Wireless Communications And Mobile Computing

A Double Clustering Approach for Color Image Segmentation

Asma Khazaal Abdulsahib

Siti Sakira Kamaruddin

and Mustafa Musa Jabar

...Show More Authors

One of the significant stages in computer vision is image segmentation which is fundamental for different applications, for example, robot control and military target recognition, as well as image analysis of remote sensing applications. Studies have dealt with the process of improving the classification of all types of data, whether text or audio or images, one of the latest studies in which researchers have worked to build a simple, effective, and high-accuracy model capable of classifying emotions from speech data, while several studies dealt with improving textual grouping. In this study, we seek to improve the classification of image division using a novel approach depending on two methods used to segment the images. The first

View Publication

(4)

(3)

Publication Date

Mon Oct 30 2023

Journal Name

Iraqi Journal Of Science

Machine Learning Approach for Facial Image Detection System

Hind Moutaz

...Show More Authors

HM Al-Dabbas, RA Azeez, AE Ali, Iraqi Journal of Science, 2023

View Publication

(8)

Publication Date

Mon Jan 01 2024

Journal Name

Aip Conference Proceedings

Non-linear support vector machine classification models using kernel tricks with applications

Classification

Logistic regression

Naïve Bayes

Slack variable

Support vector machine

Ghadeer Jasim Mohammed

Seror Faeq

...Show More Authors

The support vector machine, also known as SVM, is a type of supervised learning model that can be used for classification or regression depending on the datasets. SVM is used to classify data points by determining the best hyperplane between two or more groups. Working with enormous datasets, on the other hand, might result in a variety of issues, including inefficient accuracy and time-consuming. SVM was updated in this research by applying some non-linear kernel transformations, which are: linear, polynomial, radial basis, and multi-layer kernels. The non-linear SVM classification model was illustrated and summarized in an algorithm using kernel tricks. The proposed method was examined using three simulation datasets with different sample

View Publication Preview PDF

(1)

(2)

Publication Date

Wed Aug 11 2021

Journal Name

International Journal Of Interactive Mobile Technologies (ijim)

Image Denoising Using Multiwavelet Transform with Different Filters and Rules

Muna Majeed

...Show More Authors

<p class="0abstract">Image denoising is a technique for removing unwanted signals called the noise, which coupling with the original signal when transmitting them; to remove the noise from the original signal, many denoising methods are used. In this paper, the Multiwavelet Transform (MWT) is used to denoise the corrupted image by Choosing the HH coefficient for processing based on two different filters Tri-State Median filter and Switching Median filter. With each filter, various rules are used, such as Normal Shrink, Sure Shrink, Visu Shrink, and Bivariate Shrink. The proposed algorithm is applied Salt& pepper noise with different levels for grayscale test images. The quality of the denoised image is evaluated by usi

View Publication

(5)

(3)

1 2 ... 20 21 22 23 ... 1078 1079