A Survey on Image Caption Generation in Various Languages

haneen siraj ibrahim

doi:10.24996/ijs.2024.65.7.38

Details

Publication Date

Tue Jul 30 2024

Journal Name

Iraqi Journal Of Science

DOI

10.24996/ijs.2024.65.7.38

Choose Citation Style

Statistics

View publication

21

Statistics

(1)

A Survey on Image Caption Generation in Various Languages

haneen siraj ibrahim

...Show More Authors

The image caption is the process of adding an explicit, coherent description to the contents of the image. This is done by using the latest deep learning techniques, which include computer vision and natural language processing, to understand the contents of the image and give it an appropriate caption. Multiple datasets suitable for many applications have been proposed. The biggest challenge for researchers with natural language processing is that the datasets are incompatible with all languages. The researchers worked on translating the most famous English data sets with Google Translate to understand the content of the images in their mother tongue. In this paper, the proposed review aims to enhance the understanding of image captioning strategies and to survey previous research related to image captioning while examining the most popular databases in different languages, mostly English, translating into other languages using the latest models for describing images, summarizing evaluation measures, and comparing them.

View Publication

Publication Date

Fri Jan 01 2016

Journal Name

Engineering And Technology Journal

Face Retrieval Using Image Moments and Genetic Algorithm

Wathiq N.

...Show More Authors

Publication Date

Fri Sep 09 2022

Journal Name

Research Anthology On Improving Medical Imaging Techniques For Analysis And Intervention

Groupwise Non-Rigid Image Alignment Using Few Parameters

Ahmad Hashim

Bernard

Paul

Reyer

...Show More Authors

Groupwise non-rigid image alignment is a difficult non-linear optimization problem involving many parameters and often large datasets. Previous methods have explored various metrics and optimization strategies. Good results have been previously achieved with simple metrics, requiring complex optimization, often with many unintuitive parameters that require careful tuning for each dataset. In this chapter, the problem is restructured to use a simpler, iterative optimization algorithm, with very few free parameters. The warps are refined using an iterative Levenberg-Marquardt minimization to the mean, based on updating the locations of a small number of points and incorporating a stiffness constraint. This optimization approach is eff

View Publication

Publication Date

Sat Dec 30 2017

Journal Name

International Journal Of Science And Research (ijsr)

Color-based for tree yield fruits image counting

Image Segmentation

Object Labeling

Color Space

contrast stretching

morphological operations

Faisel G. Mohammed

Wejdan A. Amer

...Show More Authors

Identifying the total number of fruits on trees has long been of interest in agricultural crop estimation work. Yield prediction of fruits in practical environment is one of the hard and significant tasks to obtain better results in crop management system to achieve more productivity with regard to moderate cost. Utilized color vision in machine vision system to identify citrus fruits, and estimated yield information of the citrus grove in-real time. Fruit recognition algorithms based on color features to estimate the number of fruit. In the current research work, some low complexity and efficient image analysis approach was proposed to count yield fruits image in the natural scene. Semi automatic segmentation and yield calculation of fruit

View Publication

Publication Date

Fri Jul 01 2016

Journal Name

International Journal Of Computer Science And Mobile Computing

. Interpolative Absolute Block Truncation Coding for Image Compression

Ghadah

...Show More Authors

Publication Date

Sun Feb 24 2019

Journal Name

Iraqi Journal Of Physics

Adaptive inter frame compression using image segmented technique

Video compression

Image segmented

motion estimation

Ban Sabah

...Show More Authors

The computer vision branch of the artificial intelligence field is concerned with developing algorithms for analyzing video image content. Extracting edge information, which is the essential process in most pictorial pattern recognition problems. A new method of edge detection technique has been introduces in this research, for detecting boundaries.

Selection of typical lossy techniques for encoding edge video images are also discussed in this research. The concentration is devoted to discuss the Block-Truncation coding technique and Discrete Cosine Transform (DCT) coding technique. In order to reduce the volume of pictorial data which one may need to store or transmit,

View Publication Preview PDF

Publication Date

Thu Feb 28 2019

Journal Name

Journal Of Engineering

Digital Color Image Watermarking Using Encoded Frequent Mark

watermarking

security

robustness.

Abdulkareem Mohammed

Salih Hassan

...Show More Authors

With the increased development in digital media and communication, the need for methods to protection and security became very important factor, where the exchange and transmit date over communication channel led to make effort to protect these data from unauthentication access.

This paper present a new method to protect color image from unauthentication access using watermarking. The watermarking algorithm hide the encoded mark image in frequency domain using Discrete Cosine Transform. The main principle of the algorithm is encode frequent mark in cover color image. The watermark image bits are spread by repeat the mark and arrange in encoded method that provide algorithm more robustness and security. The propos

View Publication Preview PDF

(2)

Publication Date

Sat Feb 09 2019

Journal Name

Journal Of The College Of Education For Women

Medical Image Segmentation using Modified Interactive Thresholding Technique

Asst. instructor Noor Muwafak

...Show More Authors

Medical image segmentation is one of the most actively studied fields in the past few decades, as the development of modern imaging modalities such as magnetic resonance imaging (MRI) and computed tomography (CT), physicians and technicians nowadays have to process the increasing number and size of medical images. Therefore, efficient and accurate computational segmentation algorithms become necessary to extract the desired information from these large data sets. Moreover, sophisticated segmentation algorithms can help the physicians delineate better the anatomical structures presented in the input images, enhance the accuracy of medical diagnosis and facilitate the best treatment planning. Many of the proposed algorithms could perform w

View Publication Preview PDF

Publication Date

Sun Feb 25 2024

Journal Name

Baghdad Science Journal

Self-Localization of Guide Robots Through Image Classification

Convolutional Neural Network

Deep Learning

Guide Robot

Image Classification

Self- localization

Muhammad S.

Farhan B.

AKM B.

...Show More Authors

The field of autonomous robotic systems has advanced tremendously in the last few years, allowing them to perform complicated tasks in various contexts. One of the most important and useful applications of guide robots is the support of the blind. The successful implementation of this study requires a more accurate and powerful self-localization system for guide robots in indoor environments. This paper proposes a self-localization system for guide robots. To successfully implement this study, images were collected from the perspective of a robot inside a room, and a deep learning system such as a convolutional neural network (CNN) was used. An image-based self-localization guide robot image-classification system delivers a more accura

View Publication Preview PDF

(3)

(1)

Publication Date

Tue Jun 23 2020

Journal Name

Baghdad Science Journal

Content Based Image Retrieval (CBIR) by Statistical Methods

Content Based Image Retrieval

Histogram statistical characteristics

Test of- T

Trademark Image Retrieval

Fathala

...Show More Authors

An image retrieval system is a computer system for browsing, looking and recovering pictures from a huge database of advanced pictures. The objective of Content-Based Image Retrieval (CBIR) methods is essentially to extract, from large (image) databases, a specified number of images similar in visual and semantic content to a so-called query image. The researchers were developing a new mechanism to retrieval systems which is mainly based on two procedures. The first procedure relies on extract the statistical feature of both original, traditional image by using the histogram and statistical characteristics (mean, standard deviation). The second procedure relies on the T-

View Publication Preview PDF

(12)

(9)

Publication Date

Sat Dec 01 2018

Journal Name

Al-nahrain Journal Of Science

Image Classification Using Bag of Visual Words (BoVW)

SIFT

Euclidean distance

classification

k-nearest neighbor

Bag of Visual Words.

Rafal

...Show More Authors

In this paper two main stages for image classification has been presented. Training stage consists of collecting images of interest, and apply BOVW on these images (features extraction and description using SIFT, and vocabulary generation), while testing stage classifies a new unlabeled image using nearest neighbor classification method for features descriptor. Supervised bag of visual words gives good result that are present clearly in the experimental part where unlabeled images are classified although small number of images are used in the training process.

View Publication Preview PDF

(23)

1 2 ... 53 54 55 56 ... 2039 2040