Isolated Word Speech Recognition Using Mixed Transform

Sadiq Jassim  Abou-Loukh; Shahad Mujeeb  Abdul-Razzaq

doi:10.31026/j.eng.2013.10.06

Details

Publication Date

Mon Jun 05 2023

Journal Name

Journal Of Engineering

Volume

19

Issue Number

10

DOI

10.31026/j.eng.2013.10.06

Choose Citation Style

Statistics

View publication

8

Abstract Views

1.06K

Galley Views

498

Statistics

(1)

Isolated Word Speech Recognition Using Mixed Transform

Mixed Transform

Radon Transform

Discrete Wavelet Transform

Discrete Multicircularlet Transform

Dynamic Time Warping

Sadiq Jassim Abou-Loukh

Shahad Mujeeb Abdul-Razzaq

...Show More Authors

Methods of speech recognition have been the subject of several studies over the past decade. Speech recognition has been one of the most exciting areas of the signal processing. Mixed transform is a useful tool for speech signal processing; it is developed for its abilities of improvement in feature extraction. Speech recognition includes three important stages, preprocessing, feature extraction, and classification. Recognition accuracy is so affected by the features extraction stage; therefore different models of mixed transform for feature extraction were proposed. The properties of the recorded isolated word will be 1-D, which achieve the conversion of each 1-D word into a 2-D form. The second step of the word recognizer requires, the application of 2-D FFT, Radon transform, the 1-D IFFT,and 1-D discrete wavelet transforms were used in the first proposed model, while discrete multicircularlet transform was used in the second proposed model. The final stage of the proposed models includes the use of the dynamic time warping algorithm for recognition tasks. The performance of the proposed systems was evaluated using forty different isolated Arabic words that are recorded fifteen times in a studio for speaker dependant. The result shows recognition accuracy of (91% and 89%) using discrete wavelet transform type Daubechies (Db1) and (Db4) respectively, and the accuracy score between (87%-93%) was achieved using
discrete multicircularlet transform for 9 sub bands.

View Publication Preview PDF

Quick Preview PDF

Publication Date

Fri Apr 15 2016

Journal Name

International Journal Of Computer Applications

Hybrid Techniques based Speech Recognition

Hybrid techniques

speech recognition

multi-wavelet transform

wavelet transform

stationary wavelet transform

feature extraction

dynamic time warping.

Ahlam

Zainab

Tariq

...Show More Authors

Information processing has an important application which is speech recognition. In this paper, a two hybrid techniques have been presented. The first one is a 3-level hybrid of Stationary Wavelet Transform (S) and Discrete Wavelet Transform (W) and the second one is a 3-level hybrid of Discrete Wavelet Transform (W) and Multi-wavelet Transforms (M). To choose the best 3-level hybrid in each technique, a comparison according to five factors has been implemented and the best results are WWS, WWW, and MWM. Speech recognition is performed on WWS, WWW, and MWM using Euclidean distance (Ecl) and Dynamic Time Warping (DTW). The match performance is (98%) using DTW in MWM, while in the WWS and WWW are (74%) and (78%) respectively, but when using (

View Publication

Publication Date

Mon Dec 31 2012

Journal Name

Al-khwarizmi Engineering Journal

Speech Compression Using Multecirculerletet Transform

Sound

Speech Compression

MCT

DWT

Sulaiman

Ali. K.

...Show More Authors

Compressing the speech reduces the data storage requirements, leading to reducing the time of transmitting the digitized speech over long-haul links like internet. To obtain best performance in speech compression, wavelet transforms require filters that combine a number of desirable properties, such as orthogonality and symmetry.The MCT bases functions are derived from GHM bases function using 2D linear convolution .The fast computation algorithm methods introduced here added desirable features to the current transform. We further assess the performance of the MCT in speech compression application. This paper discusses the effect of using DWT and MCT (one and two dimension) on speech compression. DWT and MCT performances in terms of comp

View Publication Preview PDF

Publication Date

Sat Oct 31 2020

Journal Name

International Journal Of Intelligent Engineering And Systems

Speech Emotion Recognition Using MELBP Variants of Spectrogram Image

Speech emotion

Spectrogram image

Multi-block extended local binary pattern (MELBP)

Deep beliefnetwork (DBN)

Short term fourier transform (STFT)

Suhaila N.

...Show More Authors

View Publication Preview PDF

(10)

(5)

Publication Date

Thu Nov 01 2018

Journal Name

2018 1st Annual International Conference On Information And Sciences (aicis)

Speech Emotion Recognition Using Minimum Extracted Features

Speech emotion recognition

Minimum feature extraction

ZCR

12 MFCC

Random forest

Wisal Hashim

Rafah Shihab

Mohammed Najm

...Show More Authors

Recognizing speech emotions is an important subject in pattern recognition. This work is about studying the effect of extracting the minimum possible number of features on the speech emotion recognition (SER) system. In this paper, three experiments performed to reach the best way that gives good accuracy. The first one extracting only three features: zero crossing rate (ZCR), mean, and standard deviation (SD) from emotional speech samples, the second one extracting only the first 12 Mel frequency cepstral coefficient (MFCC) features, and the last experiment applying feature fusion between the mentioned features. In all experiments, the features are classified using five types of classification techniques, which are the Random Forest (RF),

View Publication Preview PDF

(15)

(7)

Publication Date

Mon Jan 09 2023

Journal Name

2023 15th International Conference On Developments In Esystems Engineering (dese)

Deep Learning-Based Speech Enhancement Algorithm Using Charlier Transform

Sally Antoin

Hala Jassim

Hayder Saadi Radeaf

Basheera M.

Sadiq H.

...Show More Authors

View Publication

(17)

(9)

Publication Date

Sat Jun 01 2013

Journal Name

مجلة كلية بغداد للعلوم الاقتصادية الجامعة

Proposed family speech recognition

Speech recognition

Speech Analysis

Speaker Recognition Using Neural Networks

Denoise

Wavelet.

Sawsan

...Show More Authors

Speech recognition is a very important field that can be used in many applications such as controlling to protect area, banking, transaction over telephone network database access service, voice email, investigations, House controlling and management ... etc. Speech recognition systems can be used in two modes: to identify a particular person or to verify a person’s claimed identity. The family speaker recognition is a modern field in the speaker recognition. Many family speakers have similarity in the characteristics and hard to identify between them. Today, the scope of speech recognition is limited to speech collected from cooperative users in real world office environments and without adverse microphone or channel impairments.

Publication Date

Thu Feb 07 2019

Journal Name

Journal Of The College Of Education For Women

SPEECH RECOGNITION OF ARABIC WORDS USING ARTIFICIAL NEURAL NETWORKS

Dr. Sadiq jassim

...Show More Authors

The speech recognition system has been widely used by many researchers using different
methods to fulfill a fast and accurate system. Speech signal recognition is a typical
classification problem, which generally includes two main parts: feature extraction and
classification. In this paper, a new approach to achieve speech recognition task is proposed by
using transformation techniques for feature extraction methods; namely, slantlet transform
(SLT), discrete wavelet transforms (DWT) type Daubechies Db1 and Db4. Furthermore, a
modified artificial neural network (ANN) with dynamic time warping (DTW) algorithm is
developed to train a speech recognition system to be used for classification and recognition
purposes. T

View Publication Preview PDF

Publication Date

Sat Jan 01 2022

Journal Name

Proceedings Of International Conference On Computing And Communication Networks

Speech Gender Recognition Using a Multilayer Feature Extraction Method

Husam

...Show More Authors

View Publication

(2)

(1)

Publication Date

Mon May 01 2023

Journal Name

Indonesian Journal Of Electrical Engineering And Computer Science

Comparison hybrid techniques-based mixed transform using compression and quality metrics

Zainab

...Show More Authors

Image quality plays a vital role in improving and assessing image compression performance. Image compression represents big image data to a new image with a smaller size suitable for storage and transmission. This paper aims to evaluate the implementation of the hybrid techniques-based tensor product mixed transform. Compression and quality metrics such as compression-ratio (CR), rate-distortion (RD), peak signal-to-noise ratio (PSNR), and Structural Content (SC) are utilized for evaluating the hybrid techniques. Then, a comparison between techniques is achieved according to these metrics to estimate the best technique. The main contribution is to improve the hybrid techniques. The proposed hybrid techniques are consisting of discrete wavel

View Publication

(4)

Publication Date

Tue Jul 01 2025

Journal Name

Ain Shams Engineering Journal

Deep neural networks for speech enhancement and speech recognition: A systematic review

Sureshkumar

Syed Abdul

Faisul Arif

Raja

Mohd Khair

Syaril

June Francis

Sadiq H.

Basheera M.

Nurbek

Aigul

...Show More Authors

View Publication

(23)

1 2 3 4 ... 456 457 458 459