doi:10.22153/kej.2023.06.003

Details

Publication Date

Fri Dec 01 2023

Journal Name

Al-khwarizmi Engineering Journal

Volume

19

Issue Number

4

DOI

10.22153/kej.2023.06.003

Choose Citation Style

Statistics

View publication

2

View original publication

1

Click abstract more

1

Abstract Views

259

Galley Views

153

Statistics

An Overview of Audio-Visual Source Separation Using Deep Learning

Noorulhuda Mudhafar

Ahmed

Mohammed Najah

...Show More Authors

In this article, the research presents a general overview of deep learning-based AVSS (audio-visual source separation) systems. AVSS has achieved exceptional results in a number of areas, including decreasing noise levels, boosting speech recognition, and improving audio quality. The advantages and disadvantages of each deep learning model are discussed throughout the research as it reviews various current experiments on AVSS. The TCD TIMIT dataset (which contains top-notch audio and video recordings created especially for speech recognition tasks) and the Voxceleb dataset (a sizable collection of brief audio-visual clips with human speech) are just a couple of the useful datasets summarized in the paper that can be used to test AVSS systems. In its basic form, this review aims to highlight the growing importance of AVSS in improving the quality of audio signals.

View Publication Preview PDF

Quick Preview PDF

1 2 3 4 ... 2882 2883 2884 2885