An Overview of Audio-Visual Source Separation Using Deep Learning

Noorulhuda Mudhafar Sulaiman; Ahmed Al Tmeme; Mohammed Najah  Mahdi

doi:10.22153/kej.2023.06.003

Details

Publication Date

Fri Dec 01 2023

Journal Name

Al-khwarizmi Engineering Journal

Volume

19

Issue Number

4

DOI

10.22153/kej.2023.06.003

Choose Citation Style

Statistics

View publication

19

View original publication

1

Click abstract more

1

Abstract Views

758

Galley Views

816

Statistics

(5)

(2)

An Overview of Audio-Visual Source Separation Using Deep Learning

Noorulhuda Mudhafar Sulaiman

Ahmed Al Tmeme

Mohammed Najah Mahdi

...Show More Authors

In this article, the research presents a general overview of deep learning-based AVSS (audio-visual source separation) systems. AVSS has achieved exceptional results in a number of areas, including decreasing noise levels, boosting speech recognition, and improving audio quality. The advantages and disadvantages of each deep learning model are discussed throughout the research as it reviews various current experiments on AVSS. The TCD TIMIT dataset (which contains top-notch audio and video recordings created especially for speech recognition tasks) and the Voxceleb dataset (a sizable collection of brief audio-visual clips with human speech) are just a couple of the useful datasets summarized in the paper that can be used to test AVSS systems. In its basic form, this review aims to highlight the growing importance of AVSS in improving the quality of audio signals.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Sun Feb 25 2024

Journal Name

Baghdad Science Journal

Oil spill classification based on satellite image using deep learning techniques

Classification

Marine

Oil spill

satellite images

deep learning

Abubakar Salihu

Noorfa Haszlinna

Siti Zaiton Mohd

Razana

...Show More Authors

An oil spill is a leakage of pipelines, vessels, oil rigs, or tankers that leads to the release of petroleum products into the marine environment or on land that happened naturally or due to human action, which resulted in severe damages and financial loss. Satellite imagery is one of the powerful tools currently utilized for capturing and getting vital information from the Earth's surface. But the complexity and the vast amount of data make it challenging and time-consuming for humans to process. However, with the advancement of deep learning techniques, the processes are now computerized for finding vital information using real-time satellite images. This paper applied three deep-learning algorithms for satellite image classification

View Publication Preview PDF

(11)

(8)

Publication Date

Fri Mar 01 2024

Journal Name

Baghdad Science Journal

Deep Learning Techniques in the Cancer-Related Medical Domain: A Transfer Deep Learning Ensemble Model for Lung Cancer Prediction

Breast cancer

Cancer prediction

Deep learning

Ensemble learning

Lung cancer

Machine learning

Medical engineering.

Omar Abdullatif

Mohammed Jawad

Zenah Hadi Saied

...Show More Authors

Problem: Cancer is regarded as one of the world's deadliest diseases. Machine learning and its new branch (deep learning) algorithms can facilitate the way of dealing with cancer, especially in the field of cancer prevention and detection. Traditional ways of analyzing cancer data have their limits, and cancer data is growing quickly. This makes it possible for deep learning to move forward with its powerful abilities to analyze and process cancer data. Aims: In the current study, a deep-learning medical support system for the prediction of lung cancer is presented. Methods: The study uses three different deep learning models (EfficientNetB3, ResNet50 and ResNet101) with the transfer learning concept. The three models are trained using a

View Publication Preview PDF

(12)

(7)

Publication Date

Tue Dec 05 2023

Journal Name

Baghdad Science Journal

Indoor/Outdoor Deep Learning Based Image Classification for Object Recognition Applications

Deep learning

GoogleNet

Image classification

Indoor/outdoor

Transfer learning.

Omar Abdullatif

Mohammed Jawad

Zenah Hadi

...Show More Authors

With the rapid development of smart devices, people's lives have become easier, especially for visually disabled or special-needs people. The new achievements in the fields of machine learning and deep learning let people identify and recognise the surrounding environment. In this study, the efficiency and high performance of deep learning architecture are used to build an image classification system in both indoor and outdoor environments. The proposed methodology starts with collecting two datasets (indoor and outdoor) from different separate datasets. In the second step, the collected dataset is split into training, validation, and test sets. The pre-trained GoogleNet and MobileNet-V2 models are trained using the indoor and outdoor se

View Publication Preview PDF

(7)

(1)

Publication Date

Mon Jan 08 2024

Journal Name

Al-academy

Aesthetic visual discourse and the audio system in theatrical performance

Talib

...Show More Authors

Technologies of the theatre show elements get great transferring concerning the creative embodiment of its aesthetic elements including its raws, forms, parts and masses in an attempt to achieve the prin cipal expressive progress to embody the main theme of the idea and the intended subject .
This is within the criterion of supporting the way to deal with technologies (décor elements , lighting music tones vocal affects , fashion and makeup) that achieving the emagintional appropriate atmosbheres which the writer and the director of the theatre show aim to make it present and succeed by furming active participation tunches of the ability of the cinogra . phic – element dsigners in order to invlve the theatre space atmospheres in cl

View Publication Preview PDF

Publication Date

Sat Oct 01 2022

Journal Name

Baghdad Science Journal

COVID-19 Diagnosis System using SimpNet Deep Model

COVID-19

Deep Learning

SimpNet

X-ray Images

Tarza Hasan

Fattah

Berivan Hasan

...Show More Authors

After the outbreak of COVID-19, immediately it converted from epidemic to pandemic. Radiologic images of CT and X-ray have been widely used to detect COVID-19 disease through observing infrahilar opacity in the lungs. Deep learning has gained popularity in diagnosing many health diseases including COVID-19 and its rapid spreading necessitates the adoption of deep learning in identifying COVID-19 cases. In this study, a deep learning model, based on some principles has been proposed for automatic detection of COVID-19 from X-ray images. The SimpNet architecture has been adopted in our study and trained with X-ray images. The model was evaluated on both binary (COVID-19 and No-findings) classification and multi-class (COVID-19, No-findings

View Publication Preview PDF

(8)

Publication Date

Sat Jun 06 2020

Journal Name

Journal Of The College Of Education For Women

Image classification with Deep Convolutional Neural Network Using Tensorflow and Transfer of Learning

Convolutional Neural Network (CNN)

Synthetic Aperture Radar (SAR)

TensorFlow

Transfer learning

Visual Geometry Group (VGG16)

Aseel Sami

MatheelEmaduldin

...Show More Authors

The deep learning algorithm has recently achieved a lot of success, especially in the field of computer vision. This research aims to describe the classification method applied to the dataset of multiple types of images (Synthetic Aperture Radar (SAR) images and non-SAR images). In such a classification, transfer learning was used followed by fine-tuning methods. Besides, pre-trained architectures were used on the known image database ImageNet. The model VGG16 was indeed used as a feature extractor and a new classifier was trained based on extracted features.The input data mainly focused on the dataset consist of five classes including the SAR images class (houses) and the non-SAR images classes (Cats, Dogs, Horses, and Humans). The Conv

View Publication Preview PDF

(1)

Publication Date

Sun Jun 20 2021

Journal Name

Baghdad Science Journal

Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network

Arabic alphabet

deep learning

speech classification

COVID-19

spectrogram

Asroni

Ku Ruhana

Cahya

Hasan Basri

...Show More Authors

Deep learning convolution neural network has been widely used to recognize or classify voice. Various techniques have been used together with convolution neural network to prepare voice data before the training process in developing the classification model. However, not all model can produce good classification accuracy as there are many types of voice or speech. Classification of Arabic alphabet pronunciation is a one of the types of voice and accurate pronunciation is required in the learning of the Qur’an reading. Thus, the technique to process the pronunciation and training of the processed data requires specific approach. To overcome this issue, a method based on padding and deep learning convolution neural network is proposed to

View Publication Preview PDF

(25)

(7)

Publication Date

Tue Dec 21 2021

Journal Name

Mendel

Hybrid Deep Learning Model for Singing Voice Separation

R.

A.

...Show More Authors

Monaural source separation is a challenging issue due to the fact that there is only a single channel available; however, there is an unlimited range of possible solutions. In this paper, a monaural source separation model based hybrid deep learning model, which consists of convolution neural network (CNN), dense neural network (DNN) and recurrent neural network (RNN), will be presented. A trial and error method will be used to optimize the number of layers in the proposed model. Moreover, the effects of the learning rate, optimization algorithms, and the number of epochs on the separation performance will be explored. Our model was evaluated using the MIR-1K dataset for singing voice separation. Moreover, the proposed approach achi

View Publication

(4)

Publication Date

Mon Nov 21 2022

Journal Name

College Of Islamic Sciences

Sources of audio images in the poetry of the Islamic

audio felt

Entehe abbes ageborye

...Show More Authors

الحمدُ للهِ رب العالمين ، والصلاة والسلام على نبيه الأمين محمد r وعلى آله الطيبين الطاهرين ، وأصحابه الغر الميامين:

تعد الصورة السمعية مفهوما بيانيا نجده في البلاغة العربية واضحاً مؤثرا، مؤديا دورا جوهريا في إيصال الفكرة التي يروم الأديب إيصالها إلى المتلقي ولا تبدو السمعية واضحة إلاّ إذا نظر إليها في حالة أدبيه تهز كيان الشاعر &nbsp

View Publication Preview PDF

Publication Date

Mon Jan 01 2024

Journal Name

Bio Web Of Conferences

An overview of machine learning classification techniques

Amer F.A.H.

Tasnim Hasan Kadhim

...Show More Authors

Machine learning (ML) is a key component within the broader field of artificial intelligence (AI) that employs statistical methods to empower computers with the ability to learn and make decisions autonomously, without the need for explicit programming. It is founded on the concept that computers can acquire knowledge from data, identify patterns, and draw conclusions with minimal human intervention. The main categories of ML include supervised learning, unsupervised learning, semisupervised learning, and reinforcement learning. Supervised learning involves training models using labelled datasets and comprises two primary forms: classification and regression. Regression is used for continuous output, while classification is employed

View Publication Preview PDF

(83)

(67)

1 2 3 4 ... 2715 2716 2717 2718