An Overview of Audio-Visual Source Separation Using Deep Learning

Noorulhuda Mudhafar Sulaiman; Ahmed Al Tmeme; Mohammed Najah  Mahdi

doi:10.22153/kej.2023.06.003

Details

Publication Date

Fri Dec 01 2023

Journal Name

Al-khwarizmi Engineering Journal

Volume

19

Issue Number

4

DOI

10.22153/kej.2023.06.003

Choose Citation Style

Statistics

View publication

12

View original publication

1

Click abstract more

1

Abstract Views

512

Galley Views

446

Statistics

(1)

An Overview of Audio-Visual Source Separation Using Deep Learning

Noorulhuda Mudhafar Sulaiman

Ahmed Al Tmeme

Mohammed Najah Mahdi

...Show More Authors

In this article, the research presents a general overview of deep learning-based AVSS (audio-visual source separation) systems. AVSS has achieved exceptional results in a number of areas, including decreasing noise levels, boosting speech recognition, and improving audio quality. The advantages and disadvantages of each deep learning model are discussed throughout the research as it reviews various current experiments on AVSS. The TCD TIMIT dataset (which contains top-notch audio and video recordings created especially for speech recognition tasks) and the Voxceleb dataset (a sizable collection of brief audio-visual clips with human speech) are just a couple of the useful datasets summarized in the paper that can be used to test AVSS systems. In its basic form, this review aims to highlight the growing importance of AVSS in improving the quality of audio signals.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Sat Aug 30 2025

Journal Name

International Journal Of Social Sciences And English Literature

Critical Discourse Analysis of Online Platforms: An Overview

Nawal Fadhil

Shahad Saad

Alham Fadhl

...Show More Authors

The rise of online platforms has transformed the discourse landscape, enabling users to create and share content actively, thereby shaping public perceptions and societal narratives. Understanding the dynamics of this discourse is essential for comprehending its socio-political implications. This review aims to provide a comprehensive overview of Critical Discourse Analysis (CDA) concerning online platforms, exploring how language is utilized across various digital contexts to influence identity formation and social inequalities. Methodologically, the review systematically searches electronic databases, including Google Scholar and ProQuest, using keywords related to CDA and online platforms. A total of 30 relevant studies are purpo

View Publication

Publication Date

Sun Nov 01 2020

Journal Name

Iop Conference Series: Materials Science And Engineering

Face Recognition and Emotion Recognition from Facial Expression Using Deep Learning Neural Network

Ali

Zubaidah

Zainab

...Show More Authors

Abstract<p>Face recognition, emotion recognition represent the important bases for the human machine interaction. To recognize the person’s emotion and face, different algorithms are developed and tested. In this paper, an enhancement face and emotion recognition algorithm is implemented based on deep learning neural networks. Universal database and personal image had been used to test the proposed algorithm. Python language programming had been used to implement the proposed algorithm.</p>

View Publication

(8)

(2)

Publication Date

Tue Aug 10 2021

Journal Name

Design Engineering

Lossy Image Compression Using Hybrid Deep Learning Autoencoder Based On kmean Clusteri

Image compression

Convolutional Autoencoder (CAE)

k-mean algorithm

PSNR

Compression Rate (CR)

MSE

Clustering

CLIC

Kodak

deep learning

lossy

Mohammed

...Show More Authors

Image compression plays an important role in reducing the size and storage of data while increasing the speed of its transmission through the Internet significantly. Image compression is an important research topic for several decades and recently, with the great successes achieved by deep learning in many areas of image processing, especially image compression, and its use is increasing Gradually in the field of image compression. The deep learning neural network has also achieved great success in the field of processing and compressing various images of different sizes. In this paper, we present a structure for image compression based on the use of a Convolutional AutoEncoder (CAE) for deep learning, inspired by the diversity of human eye

Publication Date

Thu Feb 24 2022

Journal Name

Journal Of Educational And Psychological Researches

The Effect of using Project - Based Learning method in development intensive reading skills at middle school students

Project

Based Learning

Intensive Reading

Middle School Students

Feras Mohammed AL-Madani

...Show More Authors

The purpose of this research is to identify the effect of the use of project-based learning in the development of intensive reading skills at middle school students. The experimental design was chosen from one group to suit the nature of the research and its objectives. The research group consisted of 35 students. For the purpose of the research, the following materials and tools were prepared: (List of intensive reading skills, intensive reading skills test, teacher's guide, student book). The results of the study showed that there were statistically significant differences at (0.05) in favor of the post-test performance of intensive reading skills. The statistical analysis also showed that the project-based learning approach has a high

View Publication Preview PDF

Publication Date

Sun Jun 15 2025

Journal Name

Iraqi Journal Of Laser

Performance Enhancement of Metasurface Grating Polarizer Using Deep Learning for Quantum Key Distribution Systems

metasurface

polarizer

grating

deep learning

neural network and surrogate model

Hayder Sami Jassim

Shelan

...Show More Authors

Metasurface polarizers are essential optical components in modern integrated optics and play a vital role in many optical applications including Quantum Key Distribution systems in quantum cryptography. However, inverse design of metasurface polarizers with high efficiency depends on the proper prediction of structural dimensions based on required optical response. Deep learning neural networks can efficiently help in the inverse design process, minimizing both time and simulation resources requirements, while better results can be achieved compared to traditional optimization methods. Hereby, utilizing the COMSOL Multiphysics Surrogate model and deep neural networks to design a metasurface grating structure with high extinction rat

View Publication Preview PDF

Publication Date

Mon Apr 01 2024

Journal Name

Telkomnika (telecommunication Computing Electronics And Control)

Classification of grapevine leaves images using VGG-16 and VGG-19 deep learning nets

Maha

Firas A.

Tole

...Show More Authors

The successful implementation of deep learning nets opens up possibilities for various applications in viticulture, including disease detection, plant health monitoring, and grapevine variety identification. With the progressive advancements in the domain of deep learning, further advancements and refinements in the models and datasets can be expected, potentially leading to even more accurate and efficient classification systems for grapevine leaves and beyond. Overall, this research provides valuable insights into the potential of deep learning for agricultural applications and paves the way for future studies in this domain. This work employs a convolutional neural network (CNN)-based architecture to perform grapevine leaf image classifi

View Publication

(16)

(13)

Publication Date

Fri Jan 01 2021

Journal Name

Artificial Intelligence For Covid-19

An Efficient Mixture of Deep and Machine Learning Models for COVID-19 and Tuberculosis Detection Using X-Ray Images in Resource Limited Settings

Ali H.

Rami N.

Zahraa M.

Javier

...Show More Authors

View Publication

(33)

(28)

Publication Date

Sun Mar 01 2020

Journal Name

Baghdad Science Journal

Eyewitnesses’ Visual Recollection in Suspect Identification by using Facial Appearance Model

Active Appearance Model

Facial Morphology

Suspect Identification

Horkaew

...Show More Authors

Facial recognition has been an active field of imaging science. With the recent progresses in computer vision development, it is extensively applied in various areas, especially in law enforcement and security. Human face is a viable biometric that could be effectively used in both identification and verification. Thus far, regardless of a facial model and relevant metrics employed, its main shortcoming is that it requires a facial image, against which comparison is made. Therefore, closed circuit televisions and a facial database are always needed in an operational system. For the last few decades, unfortunately, we have experienced an emergence of asymmetric warfare, where acts of terrorism are often committed in secluded area with no

View Publication Preview PDF

(3)

Publication Date

Tue Jun 01 2021

Journal Name

Al-khwarizmi Engineering Journal

Effect of Environmental Factors on the Accuracy of a Quality Inspection System Based on Transfer Learning

Ahmed

Faiz F.

Wisam S.

...Show More Authors

In this research, a study is introduced on the effect of several environmental factors on the performance of an already constructed quality inspection system, which was designed using a transfer learning approach based on convolutional neural networks. The system comprised two sets of layers, transferred layers set from an already trained model (DenseNet121) and a custom classification layers set. It was designed to discriminate between damaged and undamaged helical gears according to the configuration of the gear regardless to its dimensions, and the model showed good performance discriminating between the two products at ideal conditions of high-resolution images.

So, this study aimed at testing the system performance at poor s

View Publication Preview PDF

(1)

Publication Date

Thu Aug 31 2023

Journal Name

Journal Européen Des Systèmes Automatisés

Deep Learning Approach for Oil Pipeline Leakage Detection Using Image-Based Edge Detection Techniques

Muhammad H.

Ali H.

...Show More Authors

Natural gas and oil are one of the mainstays of the global economy. However, many issues surround the pipelines that transport these resources, including aging infrastructure, environmental impacts, and vulnerability to sabotage operations. Such issues can result in leakages in these pipelines, requiring significant effort to detect and pinpoint their locations. The objective of this project is to develop and implement a method for detecting oil spills caused by leaking oil pipelines using aerial images captured by a drone equipped with a Raspberry Pi 4. Using the message queuing telemetry transport Internet of Things (MQTT IoT) protocol, the acquired images and the global positioning system (GPS) coordinates of the images' acquisition are

View Publication

(13)

(5)

1 2 ... 5 6 7 8 ... 2564 2565