Dual Stages of Speech Enhancement Algorithm Based on Super Gaussian Speech Models

Humam Awad  Hussein; Shams Moaied  Hameed; Basheera M.  Mahmmod; Sadiq H. Abdulhussain; Abir Jaafar  Hussain

doi:10.31026/j.eng.2023.09.01

Details

Publication Date

Fri Sep 01 2023

Journal Name

Journal Of Engineering

Volume

29

Issue Number

09

DOI

10.31026/j.eng.2023.09.01

Choose Citation Style

Statistics

View publication

21

View pdf

2

Abstract Views

1.91K

Galley Views

1.77K

Statistics

(8)

(6)

Dual Stages of Speech Enhancement Algorithm Based on Super Gaussian Speech Models

Speech Enhancement Algorithms (SEA)

Gaussian speech model

Laplacian speech model

Discrete Tchebichef Transform (DTT)

Discrete Tchebichef-Krawtchouk Transform (DTKT)

Humam Awad Hussein

Shams Moaied Hameed

Basheera M. Mahmmod

Sadiq H. Abdulhussain

Abir Jaafar Hussain

...Show More Authors

Various speech enhancement Algorithms (SEA) have been developed in the last few decades. Each algorithm has its advantages and disadvantages because the speech signal is affected by environmental situations. Distortion of speech results in the loss of important features that make this signal challenging to understand. SEA aims to improve the intelligibility and quality of speech that different types of noise have degraded. In most applications, quality improvement is highly desirable as it can reduce listener fatigue, especially when the listener is exposed to high noise levels for extended periods (e.g., manufacturing). SEA reduces or suppresses the background noise to some degree, sometimes called noise suppression algorithms. In this research, the design of SEA based on different speech models (Laplacian model or Gaussian model) has been implemented using two types of discrete transforms, which are Discrete Tchebichef Transform and Discrete Tchebichef-Krawtchouk Transforms. The proposed estimator consists of dual stages of a wiener filter that can effectively estimate the clean speech signal. The evaluation measures' results show the proposed SEA's ability to enhance the noisy speech signal based on a comparison with other types of speech models and a self-comparison based on different types and levels of noise. The presented algorithm's improvements ratio regarding the average SNRseq are 1.96, 2.12, and 2.03 for Buccaneer, White, and Pink noise, respectively.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Tue Oct 29 2019

Journal Name

Journal Of Engineering

Mobile-based Human Emotion Recognition based on Speech and Heart rate

smartphone

neural network

smartwatch

speech signal

heart rate

Huda Majed

Hamid Mohammed

...Show More Authors

Mobile-based human emotion recognition is very challenging subject, most of the approaches suggested and built in this field utilized various contexts that can be derived from the external sensors and the smartphone, but these approaches suffer from different obstacles and challenges. The proposed system integrated human speech signal and heart rate, in one system, to leverage the accuracy of the human emotion recognition. The proposed system is designed to recognize four human emotions; angry, happy, sad and normal. In this system, the smartphone is used to record user speech and send it to a server. The smartwatch, fixed on user wrist, is used to measure user heart rate while the user is speaking and send it, via Bluetooth,

View Publication Preview PDF

(1)

Publication Date

Thu Apr 25 2019

Journal Name

Engineering And Technology Journal

Improvement of Harris Algorithm Based on Gaussian Scale Space

Difference of Gaussian

Harris corner detection.

Abdul Amir

Rafal

...Show More Authors

Features is the description of the image contents which could be corner, blob or edge. Corners are one of the most important feature to describe image, therefore there are many algorithms to detect corners such as Harris, FAST, SUSAN, etc. Harris is a method for corner detection and it is an efficient and accurate feature detection method. Harris corner detection is rotation invariant but it isn’t scale invariant. This paper presents an efficient harris corner detector invariant to scale, this improvement done by using gaussian function with different scales. The experimental results illustrate that it is very useful to use Gaussian linear equation to deal with harris weakness.

View Publication Preview PDF

(1)

Publication Date

Tue May 06 2025

Journal Name

Algorithms

Speech Enhancement Algorithms: A Systematic Literature Review

Sally Taha

Basheera M.

...Show More Authors

A growing and pressing need for Speech Enhancement Algorithms (SEAs) has emerged with the proliferation of hearing devices and mobile devices that aim to improve speech intelligibility without sacrificing speech quality. Recently, a tremendous number of studies have been conducted in the field of speech enhancement. This study aims to map the field of speech enhancement by conducting a systematic literature review to provide comprehensive details of recently proposed SEAs. This systematic review aims to highlight research trends in SEAs and direct researchers to the most important topics published between 2015 and 2024. It attempts to address seven key research questions related to this topic. Moreover, it covers articles available

View Publication

(15)

(14)

Publication Date

Sun Jun 20 2021

Journal Name

Baghdad Science Journal

Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network

Arabic alphabet

deep learning

speech classification

COVID-19

spectrogram

Asroni

Ku Ruhana

Cahya

Hasan Basri

...Show More Authors

Deep learning convolution neural network has been widely used to recognize or classify voice. Various techniques have been used together with convolution neural network to prepare voice data before the training process in developing the classification model. However, not all model can produce good classification accuracy as there are many types of voice or speech. Classification of Arabic alphabet pronunciation is a one of the types of voice and accurate pronunciation is required in the learning of the Qur’an reading. Thus, the technique to process the pronunciation and training of the processed data requires specific approach. To overcome this issue, a method based on padding and deep learning convolution neural network is proposed to

View Publication Preview PDF

(25)

(7)

Publication Date

Wed Nov 06 2024

Journal Name

2024 17th International Conference On Development In Esystem Engineering (dese)

Speech Enhancement: A Review of Various Approaches, Trends, and challenges

Basheera M.

Sadiq H.

Taghreed Mohammed

Muntadher

Abir

Dhiya

...Show More Authors

View Publication

(3)

(2)

Publication Date

Sat Jun 01 2013

Journal Name

مجلة كلية بغداد للعلوم الاقتصادية الجامعة

Proposed family speech recognition

Speech recognition

Speech Analysis

Speaker Recognition Using Neural Networks

Denoise

Wavelet.

Sawsan

...Show More Authors

Speech recognition is a very important field that can be used in many applications such as controlling to protect area, banking, transaction over telephone network database access service, voice email, investigations, House controlling and management ... etc. Speech recognition systems can be used in two modes: to identify a particular person or to verify a person’s claimed identity. The family speaker recognition is a modern field in the speaker recognition. Many family speakers have similarity in the characteristics and hard to identify between them. Today, the scope of speech recognition is limited to speech collected from cooperative users in real world office environments and without adverse microphone or channel impairments.

Publication Date

Mon Dec 31 2012

Journal Name

Al-khwarizmi Engineering Journal

Speech Compression Using Multecirculerletet Transform

Sound

Speech Compression

MCT

DWT

Sulaiman

Ali. K.

...Show More Authors

Compressing the speech reduces the data storage requirements, leading to reducing the time of transmitting the digitized speech over long-haul links like internet. To obtain best performance in speech compression, wavelet transforms require filters that combine a number of desirable properties, such as orthogonality and symmetry.The MCT bases functions are derived from GHM bases function using 2D linear convolution .The fast computation algorithm methods introduced here added desirable features to the current transform. We further assess the performance of the MCT in speech compression application. This paper discusses the effect of using DWT and MCT (one and two dimension) on speech compression. DWT and MCT performances in terms of comp

View Publication Preview PDF

Publication Date

Mon Jun 05 2023

Journal Name

Journal Of Engineering

Isolated Word Speech Recognition Using Mixed Transform

Mixed Transform

Radon Transform

Discrete Wavelet Transform

Discrete Multicircularlet Transform

Dynamic Time Warping

Sadiq Jassim

Shahad Mujeeb

...Show More Authors

Methods of speech recognition have been the subject of several studies over the past decade. Speech recognition has been one of the most exciting areas of the signal processing. Mixed transform is a useful tool for speech signal processing; it is developed for its abilities of improvement in feature extraction. Speech recognition includes three important stages, preprocessing, feature extraction, and classification. Recognition accuracy is so affected by the features extraction stage; therefore different models of mixed transform for feature extraction were proposed. The properties of the recorded isolated word will be 1-D, which achieve the conversion of each 1-D word into a 2-D form. The second step of the word recognizer requires, the

View Publication Preview PDF

(1)

Publication Date

Sat Oct 01 2022

Journal Name

Al–bahith Al–a'alami

COMMUNICATION SKILLS THROUGH THE LANGUAGE OF SPEECH

language

communication

speech

Raaed

...Show More Authors

Language is the realistic and sensitive basis for any communication between two or more parties. It is an important workshop that prepares meanings and coding them according to a linguistic structure governed by agreed rules that speak to and coexist with everyone.

Whereas the forms of communication are: personal, mediator and mass, none of them can move away from language in their dealings and communication patterns. Since each has its own characteristics and skills, it must be launched in its fields through verbal and non-verbal symbols and wears the elements of influential language as intended.

It makes the recipient face two things: whether he fails to understand those symbols hence its purpose fail, or he meditates s

View Publication Preview PDF

Publication Date

Sun Jan 02 2022

Journal Name

Journal Of The College Of Languages (jcl)

Pragmatics and Speech Act- History, Importance and Stages of Development: הפרגמטיקה ופעולת־הדיבור- התולדות, החשיבות ושלבי ההתפתחות (יישמוים בלשון העברית)

Pragmatics

Speech act

Hebrew Language

John Austin

John Searle

Direct and Indirect.

פרגמטיקה

פעולת הדיבור

הלשון העברית

ג'ון אוסטין

ג'ון סירל

פעולות דיבור עקיפות וישירות.

Noor Fadhel

Ghazwan Majeed

...Show More Authors

The present study stresses two of the most significant aspects of linguistic approach: Pragmatics” and the “Speech Act Theory”, revealing its importance and the stages and levels of development through Hebrew language’s speech acts analysis including (political speech, the Holy Bible, Hebrew stories).

Chronologically, Pragmatics has always been the center of linguists’ interests due to its importance in linguistic decryptions, particularly, through “Speech Act Theory” that has been initiated and developed by the most prominent philosophers and linguistics.

The prese

View Publication Preview PDF

1 2 3 4 ... 2661 2662 2663 2664