Analysis Evolution of Image Caption Techniques: Combining Conventional and Modern Methods for Improvement

Nuha M. Khassaf; Nada Hussein M. Ali

doi:10.18178/joig.13.4.406-418

Details

Publication Date

Thu Aug 07 2025

Journal Name

Journal Of Image And Graphics

Volume

13

Issue Number

4

DOI

10.18178/joig.13.4.406-418

Choose Citation Style

Statistics

Analysis Evolution of Image Caption Techniques: Combining Conventional and Modern Methods for Improvement

Convolutional Neural Networks (CNN)

image caption

conventional methods

modern methods

hybrid approach

Nuha M. Khassaf

Nada Hussein M. Ali

...Show More Authors

This study explores the challenges in Artificial Intelligence (AI) systems in generating image captions, a task that requires effective integration of computer vision and natural language processing techniques. A comparative analysis between traditional approaches such as retrieval- based methods and linguistic templates) and modern approaches based on deep learning such as encoder-decoder models, attention mechanisms, and transformers). Theoretical results show that modern models perform better for the accuracy and the ability to generate more complex descriptions, while traditional methods outperform speed and simplicity. The paper proposes a hybrid framework that combines the advantages of both approaches, where conventional methods produce an initial description, which is then contextually, and refined using modern models. Preliminary estimates indicate that this approach could reduce the initial computational cost by up to 20% compared to relying entirely on deep models while maintaining high accuracy. The study recommends further research to develop effective coordination mechanisms between traditional and modern methods and to move to the experimental validation phase of the hybrid model in preparation for its application in environments that require a balance between speed and accuracy, such as real-time computer vision applications.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Sun Mar 15 2020

Journal Name

Al-academy

Techniques of Acting Performance in Fantasy Theatrical Show

Russil

...Show More Authors

View Publication

Publication Date

Sun Apr 23 2017

Journal Name

International Conference Of Reliable Information And Communication Technology

Classification of Arabic Writer Based on Clustering Techniques

Mohammed

...Show More Authors

Arabic text categorization for pattern recognitions is challenging. We propose for the first time a novel holistic method based on clustering for classifying Arabic writer. The categorization is accomplished stage-wise. Firstly, these document images are sectioned into lines, words, and characters. Secondly, their structural and statistical features are obtained from sectioned portions. Thirdly, F-Measure is used to evaluate the performance of the extracted features and their combination in different linkage methods for each distance measures and different numbers of groups. Finally, experiments are conducted on the standard KHATT dataset of Arabic handwritten text comprised of varying samples from 1000 writers. The results in the generatio

(6)

Publication Date

Wed May 31 2023

Journal Name

Iraqi Geological Journal

A Survey of Infill Well Location Optimization Techniques

Optimization

Reservoir simulation

Infill well drilling

Well placement

Zainab

Omar

...Show More Authors

The maximization of the net present value of the investment in oil field improvements is greatly aided by the optimization of well location, which plays a significant role in the production of oil. However, using of optimization methods in well placement developments is exceedingly difficult since the well placement optimization scenario involves a large number of choice variables, objective functions, and restrictions. In addition, a wide variety of computational approaches, both traditional and unconventional, have been applied in order to maximize the efficiency of well installation operations. This research demonstrates how optimization approaches used in well placement have progressed since the last time they were examined. Fol

View Publication Preview PDF

(4)

(2)

Publication Date

Sun Mar 30 2025

Journal Name

Iraqi Journal Of Science

Segmentation of Aerial Images Using Different Clustering Techniques

Maha A.

Firas A.

Tole

...Show More Authors

The segmentation of aerial images using different clustering techniques offers valuable insights into interpreting and analyzing such images. By partitioning the images into meaningful regions, clustering techniques help identify and differentiate various objects and areas of interest, facilitating various applications, including urban planning, environmental monitoring, and disaster management. This paper aims to segment color aerial images to provide a means of organizing and understanding the visual information contained within the image for various applications and research purposes. It is also important to look into and compare the basic workings of three popular clustering algorithms: K-Medoids, Fuzzy C-Mean (FCM), and Gaussia

View Publication

Publication Date

Wed May 24 2023

Journal Name

2023 9th International Conference On Information Technology Trends (itt)

A Comparative Study of Unauthorized Drone Detection Techniques

Charalampos

Piromalis

Izzat

Georgios

Hatem

...Show More Authors

View Publication

(2)

Publication Date

Tue Aug 15 2023

Journal Name

Al-academy

The aesthetic effect of vocal recitation in building the theatrical image

impact

recitation

theatrical image

Kazem

...Show More Authors

Between the duality of sound and image, the completeness of the actor’s personality at the director comes to announce the birth of the appropriate theatrical role for that character as the basic and inherent element of the artwork, within his working system in the pattern of vocal behavior as well as motor/signal behavior as he searches for aesthetic and skill proficiency at the same time.
This is done through the viewer’s relationship with the theatrical event, which the director considers as an area of active creative activity in relation to (the work of the actor) through vocal recitation and the signs it broadcasts in order to fulfill the requirements of the dramatic situation and what it requires of a visual vision drawn in t

View Publication Preview PDF

Publication Date

Mon May 01 2023

Journal Name

Indonesian Journal Of Electrical Engineering And Computer Science

Comparison hybrid techniques-based mixed transform using compression and quality metrics

Zainab

...Show More Authors

Image quality plays a vital role in improving and assessing image compression performance. Image compression represents big image data to a new image with a smaller size suitable for storage and transmission. This paper aims to evaluate the implementation of the hybrid techniques-based tensor product mixed transform. Compression and quality metrics such as compression-ratio (CR), rate-distortion (RD), peak signal-to-noise ratio (PSNR), and Structural Content (SC) are utilized for evaluating the hybrid techniques. Then, a comparison between techniques is achieved according to these metrics to estimate the best technique. The main contribution is to improve the hybrid techniques. The proposed hybrid techniques are consisting of discrete wavel

View Publication

(2)

Publication Date

Mon Dec 14 2020

Journal Name

2020 13th International Conference On Developments In Esystems Engineering (dese)

Anomaly Based Intrusion Detection System Using Hierarchical Classification and Clustering Techniques

H.

Suhaila N.

...Show More Authors

With the rapid development of computers and network technologies, the security of information in the internet becomes compromise and many threats may affect the integrity of such information. Many researches are focused theirs works on providing solution to this threat. Machine learning and data mining are widely used in anomaly-detection schemes to decide whether or not a malicious activity is taking place on a network. In this paper a hierarchical classification for anomaly based intrusion detection system is proposed. Two levels of features selection and classification are used. In the first level, the global feature vector for detection the basic attacks (DoS, U2R, R2L and Probe) is selected. In the second level, four local feature vect

View Publication

(3)

(2)

Publication Date

Thu Oct 12 2017

Journal Name

Iraqi Journal Of Laser

A Comparative Evaluation of Post-Operative Pain and Function after Gingival Depigmentation Using 940 Nm Diode Laser And Conventional Bur Method: 6 Months Study

Mahdi A.

...Show More Authors

The aim of the study was to evaluate the efficacy of diode laser (λ=940 nm) in the management of gingival hyperpigmentation compared to the conventional bur method. Materials and methods: Eighteen patients with gingival hyperpigmentation were selected for the study with an age between 12-37 years old. The site of treatment was the upper gingiva using diode laser for the right half and the conventional method for the left half. All patients were re-evaluated after the following intervals: 3 days, 7 days, 1 month and 6 months post-operation. Pain and functions were re-evaluated in each visit for a period of 1 day, 3 days and 1 week post-operation. Laser parameters included 1.5 W in continuous mode with an initiated tip (400 μm) placed in

View Publication Preview PDF

Publication Date

Wed Mar 08 2023

Journal Name

Sensors

A Critical Review of Remote Sensing Approaches and Deep Learning Techniques in Archaeology

Israa

Fanar M.

...Show More Authors

To date, comprehensive reviews and discussions of the strengths and limitations of Remote Sensing (RS) standalone and combination approaches, and Deep Learning (DL)-based RS datasets in archaeology have been limited. The objective of this paper is, therefore, to review and critically discuss existing studies that have applied these advanced approaches in archaeology, with a specific focus on digital preservation and object detection. RS standalone approaches including range-based and image-based modelling (e.g., laser scanning and SfM photogrammetry) have several disadvantages in terms of spatial resolution, penetrations, textures, colours, and accuracy. These limitations have led some archaeological studies to fuse/integrate multip

View Publication

(10)

(9)

1 2 ... 83 84 85 86 ... 2826 2827