Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

Nuha M. Khassaf; Nada Hussein M. Ali

doi:10.48084/etasr.8455

Details

Publication Date

Wed Oct 09 2024

Journal Name

Engineering, Technology & Applied Science Research

Volume

14

Issue Number

5

DOI

10.48084/etasr.8455

Choose Citation Style

Statistics

View publication

10

Statistics

(9)

(5)

Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

CNN pre-trained models

LSTM

activation function

hyper-parameters

overfitting

Nuha M. Khassaf

Nada Hussein M. Ali

...Show More Authors

The issue of image captioning, which comprises automatic text generation to understand an image’s visual information, has become feasible with the developments in object recognition and image classification. Deep learning has received much interest from the scientific community and can be very useful in real-world applications. The proposed image captioning approach involves the use of Convolution Neural Network (CNN) pre-trained models combined with Long Short Term Memory (LSTM) to generate image captions. The process includes two stages. The first stage entails training the CNN-LSTM models using baseline hyper-parameters and the second stage encompasses training CNN-LSTM models by optimizing and adjusting the hyper-parameters of the previous stage. Improvements include the use of a new activation function, regular parameter tuning, and an improved learning rate in the later stages of training. The experimental results on the flickr8k dataset showed a noticeable and satisfactory improvement in the second stage, where a clear increment was achieved in the evaluation metrics Bleu1-4, Meteor, and Rouge-L. This increment confirmed the effectiveness of the alterations and highlighted the importance of hyper-parameter tuning in improving the performance of CNN-LSTM models in image caption tasks.

View Publication

Publication Date

Thu Feb 01 2024

Journal Name

Baghdad Science Journal

Improving the efficiency and security of passport control processes at airports by using the R-CNN object detection model

Elhoucine

Yassine

Nabil

Belaid

...Show More Authors

The use of real-time machine learning to optimize passport control procedures at airports can greatly improve both the efficiency and security of the processes. To automate and optimize these procedures, AI algorithms such as character recognition, facial recognition, predictive algorithms and automatic data processing can be implemented. The proposed method is to use the R-CNN object detection model to detect passport objects in real-time images collected by passport control cameras. This paper describes the step-by-step process of the proposed approach, which includes pre-processing, training and testing the R-CNN model, integrating it into the passport control system, and evaluating its accuracy and speed for efficient passenger flow

View Publication Preview PDF

(3)

(1)

Publication Date

Mon Feb 01 2016

Journal Name

Swarm And Evolutionary Computation

Improving the performance of evolutionary multi-objective co-clustering models for community detection in complex social networks

Bara׳a A.

Wisam A.

Mayyadah F.

...Show More Authors

(34)

(29)

Publication Date

Sat Dec 02 2023

Journal Name

Journal Of Engineering

Performance Assessment of Pile Models Chemically Grouted by Low-Pressure Injection Laboratory Device for Improving Loose Sand

Jet grouting

Grouted piles

Silica Fume

Chemical additive

W/C ratio

Mohammed Saleh

Mahmood D.

...Show More Authors

The complexity and partially defined nature of jet grouting make it hard to predict the performance of grouted piles. So the trials of cement injection at a location with similar soil properties as the erecting site are necessary to assess the performance of the grouted piles. Nevertheless, instead of executing trial-injected piles at the pilot site, which wastes money, time, and effort, the laboratory cement injection devices are essential alternatives for evaluating soil injection ability. This study assesses the performance of a low-pressure laboratory grouting device by improving loose sandy soil injected using binders formed of Silica Fume (SF) as a chemical admixture (10% of Ordinary Portland Cement OPC mass) to di

View Publication Preview PDF

(1)

Publication Date

Wed Oct 17 2018

Journal Name

Journal Of Economics And Administrative Sciences

A Comparison of Bayes Estimators for the parameter of Rayleigh Distribution with Simulation

Rayleigh distribution

Bayes method double informative and non- informative priors

the posterior distribution

the squared error loss function

the weighted squared error loss function.

جنان عباس

...Show More Authors

A comparison of double informative and non- informative priors assumed for the parameter of Rayleigh distribution is considered. Three different sets of double priors are included, for a single unknown parameter of Rayleigh distribution. We have assumed three double priors: the square root inverted gamma (SRIG) - the natural conjugate family of priors distribution, the square root inverted gamma – the non-informative distribution, and the natural conjugate family of priors - the non-informative distribution as double priors .The data is generating form three cases from Rayleigh distribution for different samples sizes (small, medium, and large). And Bayes estimators for the parameter is derived under a squared erro

View Publication Preview PDF

Publication Date

Sun Feb 25 2024

Journal Name

Baghdad Science Journal

Hybrid CNN-based Recommendation System

CNN

deep learning

Recommendation systems

Social networks

Social recommendation

Muhammad

Roliana

Ali

...Show More Authors

Recommendation systems are now being used to address the problem of excess information in several sectors such as entertainment, social networking, and e-commerce. Although conventional methods to recommendation systems have achieved significant success in providing item suggestions, they still face many challenges, including the cold start problem and data sparsity. Numerous recommendation models have been created in order to address these difficulties. Nevertheless, including user or item-specific information has the potential to enhance the performance of recommendations. The ConvFM model is a novel convolutional neural network architecture that combines the capabilities of deep learning for feature extraction with the effectiveness o

View Publication Preview PDF

(9)

(7)

Publication Date

Thu Jun 01 2023

Journal Name

Baghdad Science Journal

Comparison of Faster R-CNN and YOLOv5 for Overlapping Objects Recognition

Computer vision

Convolutional neural network

Faster r-cnn

Kitchen utensils

Overlapping object recognition

Yolo

Muhamad Munawar

Rozniza

Muhammad Suzuri

...Show More Authors

Classifying an overlapping object is one of the main challenges faced by researchers who work in object detection and recognition. Most of the available algorithms that have been developed are only able to classify or recognize objects which are either individually separated from each other or a single object in a scene(s), but not overlapping kitchen utensil objects. In this project, Faster R-CNN and YOLOv5 algorithms were proposed to detect and classify an overlapping object in a kitchen area. The YOLOv5 and Faster R-CNN were applied to overlapping objects where the filter or kernel that are expected to be able to separate the overlapping object in the dedicated layer of applying models. A kitchen utensil benchmark image database and

View Publication Preview PDF

(28)

(22)

Publication Date

Sat Jun 01 2024

Journal Name

International Journal Of Advanced And Applied Sciences

High-accuracy models for iris recognition with merging features

Iris recognition

Biometric security

Artificial intelligence

Feature selection

Impersonation prevention

Hind Moutaz

...Show More Authors

Due to advancements in computer science and technology, impersonation has become more common. Today, biometrics technology is widely used in various aspects of people's lives. Iris recognition, known for its high accuracy and speed, is a significant and challenging field of study. As a result, iris recognition technology and biometric systems are utilized for security in numerous applications, including human-computer interaction and surveillance systems. It is crucial to develop advanced models to combat impersonation crimes. This study proposes sophisticated artificial intelligence models with high accuracy and speed to eliminate these crimes. The models use linear discriminant analysis (LDA) for feature extraction and mutual info

View Publication

(4)

Publication Date

Thu Jun 16 2022

Journal Name

Periodicals Of Engineering And Natural Sciences (pen)

Optimization algorithms for transportation problems with stochastic demand

Marwan Abdul Hameed

Alyaa Abdulameer

Ammar Sh.

...Show More Authors

The purpose of this paper is to solve the stochastic demand for the unbalanced transport problem using heuristic algorithms to obtain the optimum solution, by minimizing the costs of transporting the gasoline product for the Oil Products Distribution Company of the Iraqi Ministry of Oil. The most important conclusions that were reached are the results prove the possibility of solving the random transportation problem when the demand is uncertain by the stochastic programming model. The most obvious finding to emerge from this work is that the genetic algorithm was able to address the problems of unbalanced transport, And the possibility of applying the model approved by the oil products distribution company in the Iraqi Ministry of Oil to m

View Publication

(15)

(7)

Publication Date

Sat Apr 05 2025

Journal Name

2025 Ieee 4th International Conference On Computing And Machine Intelligence (icmi)

From Pixels to Diagnosis: AI-Powered CNN for Pneumonia Detection

Bilal Hameed

Ahmed Hadi

Anfal Sabeeh

...Show More Authors

View Publication Preview PDF

(1)

Publication Date

Sat May 01 2021

Journal Name

Journal Of Physics: Conference Series

Interval value fuzzy hyper AT-algebras

Saba Hussein

Fatema F.

Areej

...Show More Authors

Abstract<p>The aim of this work is to a connection between two concepts which are an interval value fuzzy set and a hyper AT-algebra. Also, some properties of these concepts are found. The notions of IVF hyper AT-subalgebras, IVF hyper ideals and IVF hyper AT-ideals are defined. Then IVF (weak, strong) hyper ideals and IVF (weak, strong) hyper AT-ideals are discussed. After that, some relations among these ideals are presented and some interesting theorems are proved.</p>

View Publication

(2)

(1)

1 2 3 4 ... 1076 1077 1078 1079