Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

Nuha M. Khassaf; Nada Hussein M. Ali

doi:10.48084/etasr.8455

Details

Publication Date

Wed Oct 09 2024

Journal Name

Engineering, Technology & Applied Science Research

Volume

14

Issue Number

5

DOI

10.48084/etasr.8455

Choose Citation Style

Statistics

View publication

9

Statistics

(4)

Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

CNN pre-trained models

LSTM

activation function

hyper-parameters

overfitting

Nuha M. Khassaf

Nada Hussein M. Ali

...Show More Authors

The issue of image captioning, which comprises automatic text generation to understand an image’s visual information, has become feasible with the developments in object recognition and image classification. Deep learning has received much interest from the scientific community and can be very useful in real-world applications. The proposed image captioning approach involves the use of Convolution Neural Network (CNN) pre-trained models combined with Long Short Term Memory (LSTM) to generate image captions. The process includes two stages. The first stage entails training the CNN-LSTM models using baseline hyper-parameters and the second stage encompasses training CNN-LSTM models by optimizing and adjusting the hyper-parameters of the previous stage. Improvements include the use of a new activation function, regular parameter tuning, and an improved learning rate in the later stages of training. The experimental results on the flickr8k dataset showed a noticeable and satisfactory improvement in the second stage, where a clear increment was achieved in the evaluation metrics Bleu1-4, Meteor, and Rouge-L. This increment confirmed the effectiveness of the alterations and highlighted the importance of hyper-parameter tuning in improving the performance of CNN-LSTM models in image caption tasks.

View Publication

Publication Date

Fri Mar 01 2019

Journal Name

Al-khwarizmi Engineering Journal

Reverse Engineering Representation Using an Image Processing Modification

Bezier

reverse engineering

tool path generation.

Ahmed A. A.

Nareen Hafidh

Safaa Kadhim

...Show More Authors

In the reverse engineering approach, a massive amount of point data is gathered together during data acquisition and this leads to larger file sizes and longer information data handling time. In addition, fitting of surfaces of these data point is time-consuming and demands particular skills. In the present work a method for getting the control points of any profile has been presented. Where, many process for an image modification was explained using Solid Work program, and a parametric equation of the profile that proposed has been derived using Bezier technique with the control points that adopted. Finally, the proposed profile was machined using 3-aixs CNC milling machine and a compression in dimensions process has been occurred betwe

View Publication Preview PDF

(1)

Publication Date

Thu Feb 28 2019

Journal Name

Journal Of Engineering

Digital Color Image Watermarking Using Encoded Frequent Mark

watermarking

security

robustness.

Abdulkareem Mohammed

Salih Hassan

...Show More Authors

With the increased development in digital media and communication, the need for methods to protection and security became very important factor, where the exchange and transmit date over communication channel led to make effort to protect these data from unauthentication access.

This paper present a new method to protect color image from unauthentication access using watermarking. The watermarking algorithm hide the encoded mark image in frequency domain using Discrete Cosine Transform. The main principle of the algorithm is encode frequent mark in cover color image. The watermark image bits are spread by repeat the mark and arrange in encoded method that provide algorithm more robustness and security. The propos

View Publication Preview PDF

(3)

Publication Date

Mon Jun 05 2023

Journal Name

Journal Of Engineering

Image Compression Using 3-D Two-Level Technique

Image compression

3-D two level wavelet transform

3-D two level multi-wavelet transform

3-D two level hybrid technique

Image data properties

Zainab Ibraheem

...Show More Authors

In this paper three techniques for image compression are implemented. The proposed techniques consist of three dimension (3-D) two level discrete wavelet transform (DWT), 3-D two level discrete multi-wavelet transform (DMWT) and 3-D two level hybrid (wavelet-multiwavelet transform) technique. Daubechies and Haar are used in discrete wavelet transform and Critically Sampled preprocessing is used in discrete multi-wavelet transform. The aim is to maintain to increase the compression ratio (CR) with respect to increase the level of the transformation in case of 3-D transformation, so, the compression ratio is measured for each level. To get a good compression, the image data properties, were measured, such as, image entropy (He), percent r

View Publication Preview PDF

Publication Date

Sun Feb 24 2019

Journal Name

Iraqi Journal Of Physics

Adaptive inter frame compression using image segmented technique

Video compression

Image segmented

motion estimation

Ban Sabah

...Show More Authors

The computer vision branch of the artificial intelligence field is concerned with developing algorithms for analyzing video image content. Extracting edge information, which is the essential process in most pictorial pattern recognition problems. A new method of edge detection technique has been introduces in this research, for detecting boundaries.

Selection of typical lossy techniques for encoding edge video images are also discussed in this research. The concentration is devoted to discuss the Block-Truncation coding technique and Discrete Cosine Transform (DCT) coding technique. In order to reduce the volume of pictorial data which one may need to store or transmit,

View Publication Preview PDF

Publication Date

Fri Sep 09 2022

Journal Name

Research Anthology On Improving Medical Imaging Techniques For Analysis And Intervention

Groupwise Non-Rigid Image Alignment Using Few Parameters

Ahmad Hashim

Bernard

Paul

Reyer

...Show More Authors

Groupwise non-rigid image alignment is a difficult non-linear optimization problem involving many parameters and often large datasets. Previous methods have explored various metrics and optimization strategies. Good results have been previously achieved with simple metrics, requiring complex optimization, often with many unintuitive parameters that require careful tuning for each dataset. In this chapter, the problem is restructured to use a simpler, iterative optimization algorithm, with very few free parameters. The warps are refined using an iterative Levenberg-Marquardt minimization to the mean, based on updating the locations of a small number of points and incorporating a stiffness constraint. This optimization approach is eff

View Publication

Publication Date

Sat Sep 30 2023

Journal Name

حوليات أداب عين شمس

Ethnology and globalization of image in cinematographic discourse

Shurooq malik hasan

...Show More Authors

View Publication Preview PDF

Publication Date

Tue Jun 01 2021

Journal Name

Al-nahrain Journal Of Science

Medical Image Denoising Via Matrix Norm Minimization Problems

Basad

...Show More Authors

This paper presents the matrix completion problem for image denoising. Three problems based on matrix norm are performing: Spectral norm minimization problem (SNP), Nuclear norm minimization problem (NNP), and Weighted nuclear norm minimization problem (WNNP). In general, images representing by a matrix this matrix contains the information of the image, some information is irrelevant or unfavorable, so to overcome this unwanted information in the image matrix, information completion is used to comperes the matrix and remove this unwanted information. The unwanted information is handled by defining {0,1}-operator under some threshold. Applying this operator on a given ma

View Publication

(3)

Publication Date

Sat Nov 01 2014

Journal Name

Journal Of Next Generation Information Technology

The effect of the smoothing filter on an image encrypted by the blowfish algorithm then hiding it in a BMP image

Smoothing Filter

Gaussian Filter

Cryptography

Blowfish

Steganography.

Nada Abdul Aziz Mustafa

...Show More Authors

order to increase the level of security, as this system encrypts the secret image before sending it through the internet to the recipient (by the Blowfish method). As The Blowfish method is known for its efficient security; nevertheless, the encrypting time is long. In this research we try to apply the smoothing filter on the secret image which decreases its size and consequently the encrypting and decrypting time are decreased. The secret image is hidden after encrypting it into another image called the cover image, by the use of one of these two methods" Two-LSB" or" Hiding most bits in blue pixels". Eventually we compare the results of the two methods to determine which one is better to be used according to the PSNR measurs

View Publication Preview PDF

Publication Date

Fri Dec 31 2021

Journal Name

Iraqi Journal Of Market Research And Consumer Protection

GEOMETRY OPTIMIZATION OF COUPLING ALLIN -METFORMIN USING DFT/B3LYP MOLECULAR MODELLING TECHNIQUE: GEOMETRY OPTIMIZATION OF COUPLING ALLIN -METFORMIN USING DFT/B3LYP MOLECULAR MODELLING TECHNIQUE

Alliin

metformin

computational chemistry

molecular modeling technique

Lekaa H.

...Show More Authors

This researchpaper includes the incorporation of Alliin at various energy levels and angles

With Metformin using Gaussian 09 and Gaussian view 06. Two computers were used in this work. Samples were generated to draw, integrate, simulate and measure the value of the potential energy surface by means of which the lowest energy value was (-1227.408au). The best correlation compound was achieved between Alliin and Metformin through the low energy values where the best place for metformin to b

View Publication Preview PDF

Publication Date

Sat Jun 04 2022

Journal Name

Journal Of Inorganic And Organometallic Polymers And Materials

Improving the Mechanical Properties, Roughness, Thermal Stability, and Contact Angle of the Acrylic Polymer by Graphene and Carbon Fiber Doping for Waterproof Coatings

Tabarak M.

Seenaa I.

...Show More Authors

View Publication

(13)

1 2 ... 65 66 67 68 ... 1022 1023