Deep Learning and Fusion Techniques for High-Precision Image Matting:

Liqaa M.  Shoohi; Jamila H.  Saud

Details

Publication Date

Thu Mar 13 2025

Journal Name

Academia Open

Volume

10

Issue Number

1

Choose Citation Style

Statistics

View publication

26

Statistics

Deep Learning and Fusion Techniques for High-Precision Image Matting:

Deep image matting

computer vision

deep learning

fusion techniques

U-net

Liqaa M. Shoohi

Jamila H. Saud

...Show More Authors

General Background: Deep image matting is a fundamental task in computer vision, enabling precise foreground extraction from complex backgrounds, with applications in augmented reality, computer graphics, and video processing. Specific Background: Despite advancements in deep learning-based methods, preserving fine details such as hair and transparency remains a challenge. Knowledge Gap: Existing approaches struggle with accuracy and efficiency, necessitating novel techniques to enhance matting precision. Aims: This study integrates deep learning with fusion techniques to improve alpha matte estimation, proposing a lightweight U-Net model incorporating color-space fusion and preprocessing. Results: Experiments using the AdobeComposition-1k dataset demonstrate superior performance compared to traditional methods, achieving higher accuracy, faster processing speed, and improved boundary preservation. Novelty: The proposed model effectively combines deep learning with fusion techniques, enhancing matting quality while maintaining robustness across various environmental conditions. Implications: These findings highlight the potential of integrating fusion techniques with deep learning for image matting, offering valuable insights for future research in automated image processing applications, including augmented reality, gaming, and interactive video technologies. Highlights:   Better Precision: Fusion techniques enhance fine detail preservation. Faster Processing: Lightweight U-Net improves speed and accuracy. Wide Applications: Useful for AR, gaming, and video processing.   Keywords: Deep image matting, computer vision, deep learning, fusion techniques, U-Net

View Publication Preview PDF

Quick Preview PDF

Publication Date

Thu Aug 07 2025

Journal Name

Journal Of Image And Graphics

Analysis Evolution of Image Caption Techniques: Combining Conventional and Modern Methods for Improvement

Convolutional Neural Networks (CNN)

image caption

conventional methods

modern methods

hybrid approach

Nuha M.

Nada

...Show More Authors

This study explores the challenges in Artificial Intelligence (AI) systems in generating image captions, a task that requires effective integration of computer vision and natural language processing techniques. A comparative analysis between traditional approaches such as retrieval- based methods and linguistic templates) and modern approaches based on deep learning such as encoder-decoder models, attention mechanisms, and transformers). Theoretical results show that modern models perform better for the accuracy and the ability to generate more complex descriptions, while traditional methods outperform speed and simplicity. The paper proposes a hybrid framework that combines the advantages of both approaches, where conventional methods prod

View Publication Preview PDF

(2)

Publication Date

Fri Mar 18 2022

Journal Name

Aro-the Scientific Journal Of Koya University

Detecting Deepfakes with Deep Learning and Gabor Filters

Wildan Jameel

Suhad Malallah

Ayad Rodhan

...Show More Authors

The proliferation of many editing programs based on artificial intelligence techniques has contributed to the emergence of deepfake technology. Deepfakes are committed to fabricating and falsifying facts by making a person do actions or say words that he never did or said. So that developing an algorithm for deepfakes detection is very important to discriminate real from fake media. Convolutional neural networks (CNNs) are among the most complex classifiers, but choosing the nature of the data fed to these networks is extremely important. For this reason, we capture fine texture details of input data frames using 16 Gabor filters indifferent directions and then feed them to a binary CNN classifier instead of using the red-green-blue

View Publication

(12)

(4)

Publication Date

Fri Sep 01 2023

Journal Name

Journal Of Engineering

Iraqi Sentiment and Emotion Analysis Using Deep Learning

Emotion analysis

Sentiment analysis

CNN

GRU

Iraqi dialect

Anwar Abdul-Razzaq

Nada A. Z.

...Show More Authors

Analyzing sentiment and emotions in Arabic texts on social networking sites has gained wide interest from researchers. It has been an active research topic in recent years due to its importance in analyzing reviewers' opinions. The Iraqi dialect is one of the Arabic dialects used in social networking sites, characterized by its complexity and, therefore, the difficulty of analyzing sentiment. This work presents a hybrid deep learning model consisting of a Convolution Neural Network (CNN) and the Gated Recurrent Units (GRU) to analyze sentiment and emotions in Iraqi texts. Three Iraqi datasets (Iraqi Arab Emotions Data Set (IAEDS), Annotated Corpus of Mesopotamian-Iraqi Dialect (ACMID), and Iraqi Arabic Dataset (IAD)) col

View Publication Preview PDF

(11)

(8)

Publication Date

Fri Jul 18 2014

Journal Name

International Journal Of Computer Applications

3-Level Techniques Comparison based Image Recognition

3-level Techniques

image recognition

stationary wavelet transform

wavelet transform

feature extraction.

Zainab

Ahlam

...Show More Authors

Image recognition is one of the most important applications of information processing, in this paper; a comparison between 3-level techniques based image recognition has been achieved, using discrete wavelet (DWT) and stationary wavelet transforms (SWT), stationary-stationary-stationary (sss), stationary-stationary-wavelet (ssw), stationary-wavelet-stationary (sws), stationary-wavelet-wavelet (sww), wavelet-stationary- stationary (wss), wavelet-stationary-wavelet (wsw), wavelet-wavelet-stationary (wws) and wavelet-wavelet-wavelet (www). A comparison between these techniques has been implemented. according to the peak signal to noise ratio (PSNR), root mean square error (RMSE), compression ratio (CR) and the coding noise e (n) of each third

View Publication

Publication Date

Mon Oct 02 2023

Journal Name

Journal Of Engineering

Microgrid Integration Based on Deep Learning NARMA-L2 Controller for Maximum Power Point Tracking

Microgrid

Solar PV

HER

Maximum power point tracking

Deep learning

PO-MPPT

INC-MPPT

Enas Hamid

Nadia Qasim

Hanan Mikhael D.

...Show More Authors

This paper presents a hybrid energy resources (HER) system consisting of solar PV, storage, and utility grid. It is a challenge in real time to extract maximum power point (MPP) from the PV solar under variations of the irradiance strength. This work addresses challenges in identifying global MPP, dynamic algorithm behavior, tracking speed, adaptability to changing conditions, and accuracy. Shallow Neural Networks using the deep learning NARMA-L2 controller have been proposed. It is modeled to predict the reference voltage under different irradiance. The dynamic PV solar and nonlinearity have been trained to track the maximum power drawn from the PV solar systems in real time.

Moreover, the proposed controller i

View Publication Preview PDF

(1)

Publication Date

Sun Jun 20 2021

Journal Name

Baghdad Science Journal

Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network

Arabic alphabet

deep learning

speech classification

COVID-19

spectrogram

Asroni

Ku Ruhana

Cahya

Hasan Basri

...Show More Authors

Deep learning convolution neural network has been widely used to recognize or classify voice. Various techniques have been used together with convolution neural network to prepare voice data before the training process in developing the classification model. However, not all model can produce good classification accuracy as there are many types of voice or speech. Classification of Arabic alphabet pronunciation is a one of the types of voice and accurate pronunciation is required in the learning of the Qur’an reading. Thus, the technique to process the pronunciation and training of the processed data requires specific approach. To overcome this issue, a method based on padding and deep learning convolution neural network is proposed to

View Publication Preview PDF

(25)

(7)

Publication Date

Mon Jan 09 2023

Journal Name

2023 15th International Conference On Developments In Esystems Engineering (dese)

Deep Learning-Based Skin Cancer Identification

Sandhua M

Abir

Dhiya

Basheera M.

Sadiq H.

...Show More Authors

View Publication

(7)

(4)

Publication Date

Wed Mar 15 2023

Journal Name

International Journal Of Advances In Intelligent Informatics

An automatic lip reading for short sentences using deep learning nets

Maha Abd

Kadhim

...Show More Authors

One study whose importance has significantly grown in recent years is lip-reading, particularly with the widespread of using deep learning techniques. Lip reading is essential for speech recognition in noisy environments or for those with hearing impairments. It refers to recognizing spoken sentences using visual information acquired from lip movements. Also, the lip area, especially for males, suffers from several problems, such as the mouth area containing the mustache and beard, which may cover the lip area. This paper proposes an automatic lip-reading system to recognize and classify short English sentences spoken by speakers using deep learning networks. The input video extracts frames and each frame is passed to the Viola-Jone

View Publication

(10)

(4)

Publication Date

Thu Dec 16 2021

Journal Name

Translational Vision Science & Technology

A Hybrid Deep Learning Construct for Detecting Keratoconus From Corneal Maps

Ali H.

Zahraa M.

Zaid

Alexandru

Marcelo M.

Rossen M.

Siamak

...Show More Authors

View Publication

(41)

(37)

Publication Date

Sat Jan 19 2019

Journal Name

Artificial Intelligence Review

Survey on supervised machine learning techniques for automatic text classification

Kadhim A.I.

...Show More Authors

View Publication

(350)

(312)

1 2 ... 5 6 7 8 ... 1962 1963