A Survey on Image Caption Generation in Various Languages

haneen siraj ibrahim

doi:10.24996/ijs.2024.65.7.38

Details

Publication Date

Tue Jul 30 2024

Journal Name

Iraqi Journal Of Science

DOI

10.24996/ijs.2024.65.7.38

Choose Citation Style

Statistics

View publication

24

Statistics

(4)

A Survey on Image Caption Generation in Various Languages

haneen siraj ibrahim

...Show More Authors

The image caption is the process of adding an explicit, coherent description to the contents of the image. This is done by using the latest deep learning techniques, which include computer vision and natural language processing, to understand the contents of the image and give it an appropriate caption. Multiple datasets suitable for many applications have been proposed. The biggest challenge for researchers with natural language processing is that the datasets are incompatible with all languages. The researchers worked on translating the most famous English data sets with Google Translate to understand the content of the images in their mother tongue. In this paper, the proposed review aims to enhance the understanding of image captioning strategies and to survey previous research related to image captioning while examining the most popular databases in different languages, mostly English, translating into other languages using the latest models for describing images, summarizing evaluation measures, and comparing them.

View Publication

Publication Date

Wed Jan 28 2015

Journal Name

Al-khwarizmi Engineering Journal

Thermal Field Analysis of Oblique Machining Process with Infrared Image for AA6063-T6

Infrared

Deform-3D

tool obliquity

turning.

Osamah F.

Lara A.

...Show More Authors

Abstract

Metal cutting processes still represent the largest class of manufacturing operations. Turning is the most commonly employed material removal process. This research focuses on analysis of the thermal field of the oblique machining process. Finite element method (FEM) software DEFORM 3D V10.2 was used together with experimental work carried out using infrared image equipment, which include both hardware and software simulations. The thermal experiments are conducted with AA6063-T6, using different tool obliquity, cutting speeds and feed rates. The results show that the temperature relatively decreased when tool obliquity increases at different cutting speeds and feed rates, also it

View Publication Preview PDF

Publication Date

Tue Feb 01 2022

Journal Name

Baghdad Science Journal

An Enhanced Approach of Image Steganographic Using Discrete Shearlet Transform and Secret Sharing

Discrete Shearlet Transform

Image Steganography

Stego Image

Secret Sharing.

Yasir Ahmed

Nada Elya

Mohammed Qasim

...Show More Authors

Recently, the internet has made the users able to transmit the digital media in the easiest manner. In spite of this facility of the internet, this may lead to several threats that are concerned with confidentiality of transferred media contents such as media authentication and integrity verification. For these reasons, data hiding methods and cryptography are used to protect the contents of digital media. In this paper, an enhanced method of image steganography combined with visual cryptography has been proposed. A secret logo (binary image) of size (128x128) is encrypted by applying (2 out 2 share) visual cryptography on it to generate two secret share. During the embedding process, a cover red, green, and blue (RGB) image of size (512

View Publication Preview PDF

(15)

(9)

Publication Date

Mon Mar 01 2021

Journal Name

Iraqi Journal Of Physics

Enhancement CT Scan Image and Study Electronic, Structural and Vibrational Properties of Iobenguane

CT scan image

Iobenguane Properties

Enhancement Contrast

Lifting wavelet Transform (LWT)Edges detection.

Ahlam Majead

Huda Muhamed

Shaimaa H.

...Show More Authors

This work is divided into two parts first part study electronic structure and vibration properties of the Iobenguane material that is used in CT scan imaging. Iobenguane, or MIBG, is an aralkylguanidine analog of the adrenergic neurotransmitter norepinephrine and a radiopharmaceutical. It acts as a blocking agent for adrenergic neurons. When radiolabeled, it can be used in nuclear medicinal diagnostic techniques as well as in neuroendocrine antineoplastic treatments. The aim of this work is to provide general information about Iobenguane that can be used to obtain results to diagnose the diseases. The second part study image processing techniques, the CT scan image is transformed to frequency domain using the LWT. Two methods of contrast

View Publication Preview PDF

Publication Date

Sun Jun 01 2014

Journal Name

International Journal Of Advanced Research In Computer Science And Software Engineering

Medical Image Compression using Wavelet Quadrants of Polynomial Prediction Coding & Bit Plane Slicing

Ghadah

...Show More Authors

Publication Date

Tue Dec 03 2013

Journal Name

Ibn Al-haitham Journal For Pure And Applied Science

New adaptive satellite image classification technique for al Habbinya region west of Iraq

Taghreed

...Show More Authors

Publication Date

Fri Feb 08 2019

Journal Name

Journal Of The College Of Education For Women

COMPARATIVE STUDY FOR EDGE DETECTION OF NOISY IMAGE USING SOBEL AND LAPLACE OPERATORS

Sobel Operator

Laplace Operator

Noise Reduction

Mean filter

Image Thresholding

Instructor Sameera A. Abdul-Kader

...Show More Authors

Many approaches of different complexity already exist to edge detection in
color images. Nevertheless, the question remains of how different are the results
when employing computational costly techniques instead of simple ones. This
paper presents a comparative study on two approaches to color edge detection to
reduce noise in image. The approaches are based on the Sobel operator and the
Laplace operator. Furthermore, an efficient algorithm for implementing the two
operators is presented. The operators have been applied to real images. The results
are presented in this paper. It is shown that the quality of the results increases by
using second derivative operator (Laplace operator). And noise reduced in a good

View Publication Preview PDF

Publication Date

Sat Jun 06 2020

Journal Name

Journal Of The College Of Education For Women

Image classification with Deep Convolutional Neural Network Using Tensorflow and Transfer of Learning

Convolutional Neural Network (CNN)

Synthetic Aperture Radar (SAR)

TensorFlow

Transfer learning

Visual Geometry Group (VGG16)

Aseel Sami

MatheelEmaduldin

...Show More Authors

The deep learning algorithm has recently achieved a lot of success, especially in the field of computer vision. This research aims to describe the classification method applied to the dataset of multiple types of images (Synthetic Aperture Radar (SAR) images and non-SAR images). In such a classification, transfer learning was used followed by fine-tuning methods. Besides, pre-trained architectures were used on the known image database ImageNet. The model VGG16 was indeed used as a feature extractor and a new classifier was trained based on extracted features.The input data mainly focused on the dataset consist of five classes including the SAR images class (houses) and the non-SAR images classes (Cats, Dogs, Horses, and Humans). The Conv

View Publication Preview PDF

(1)

Publication Date

Wed Oct 09 2024

Journal Name

Engineering, Technology & Applied Science Research

Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

CNN pre-trained models

LSTM

activation function

hyper-parameters

overfitting

Nuha M.

Nada

...Show More Authors

The issue of image captioning, which comprises automatic text generation to understand an image’s visual information, has become feasible with the developments in object recognition and image classification. Deep learning has received much interest from the scientific community and can be very useful in real-world applications. The proposed image captioning approach involves the use of Convolution Neural Network (CNN) pre-trained models combined with Long Short Term Memory (LSTM) to generate image captions. The process includes two stages. The first stage entails training the CNN-LSTM models using baseline hyper-parameters and the second stage encompasses training CNN-LSTM models by optimizing and adjusting the hyper-parameters of

View Publication

(5)

(4)

Publication Date

Tue Oct 15 2019

Journal Name

International Journal Of Electrical And Computer Engineering (ijece)

Combining Convolutional Neural Networks and Slantlet Transform For An Effective Image Retrieval Scheme

Content base image retrieval wavelet transforms Convolutional neural networks Deep learning Information retrieval Slanlet transform

Mohammed S. H.

...Show More Authors

In the latest years there has been a profound evolution in computer science and technology, which incorporated several fields. Under this evolution, Content Base Image Retrieval (CBIR) is among the image processing field. There are several image retrieval methods that can easily extract feature as a result of the image retrieval methods’ progresses. To the researchers, finding resourceful image retrieval devices has therefore become an extensive area of concern. Image retrieval technique refers to a system used to search and retrieve images from digital images’ huge database. In this paper, the author focuses on recommendation of a fresh method for retrieving image. For multi presentation of image in Convolutional Neural Network (CNN),

(11)

(5)

Publication Date

Thu Mar 03 2022

Journal Name

Multimedia Tools And Applications

Boosting Marine Predators Algorithm by Salp Swarm Algorithm for Multilevel Thresholding Image Segmentation

Laith

Nada Khalil

Mohamed Abd

Essam H.

...Show More Authors

View Publication

(51)

(47)

1 2 ... 75 76 77 78 ... 2118 2119