Spin-Image Descriptors for Text-Independent Speaker Recognition

Suhaila N. Mohammed

doi:10.1007/978-3-030-33582-3_21

Details

Publication Date

Sat Nov 02 2019

Journal Name

Advances In Intelligent Systems And Computing

DOI

10.1007/978-3-030-33582-3_21

Choose Citation Style

Statistics

View publication

10

Statistics

(7)

(2)

Spin-Image Descriptors for Text-Independent Speaker Recognition

Suhaila N. Mohammed

...Show More Authors

Building a system to identify individuals through their speech recording can find its application in diverse areas, such as telephone shopping, voice mail and security control. However, building such systems is a tricky task because of the vast range of differences in the human voice. Thus, selecting strong features becomes very crucial for the recognition system. Therefore, a speaker recognition system based on new spin-image descriptors (SISR) is proposed in this paper. In the proposed system, circular windows (spins) are extracted from the frequency domain of the spectrogram image of the sound, and then a run length matrix is built for each spin, to work as a base for feature extraction tasks. Five different descriptors are generated from the run length matrix within each spin and the final feature vector is then used to populate a deep belief network for classification purpose. The proposed SISR system is evaluated using the English language Speech Database for Speaker Recognition (ELSDSR) database. The experimental results were achieved with 96.46 accuracy; showing that the proposed SISR system outperforms those reported in the related current research work in terms of recognition accuracy.

View Publication

Publication Date

Fri Jan 01 2010

Journal Name

Thesis

Design and Implementation proposed Encoding and Hiding Text in an Image

Cryptography

RSA

Steganography

Digital signature

Nada Abdul Aziz

...Show More Authors

NAA Mustafa, University of Sulaimani, Ms. c Thesis, 2010 - Cited by 4

View Publication

Publication Date

Thu Dec 01 2016

Journal Name

2016 Ieee Symposium Series On Computational Intelligence (ssci)

A fusion of time-domain descriptors for improved myoelectric hand control

Rami N.

Ahmed

Ali

Adel

...Show More Authors

View Publication

(40)

(34)

Publication Date

Mon Aug 01 2016

Journal Name

2016 38th Annual International Conference Of The Ieee Engineering In Medicine And Biology Society (embc)

Myoelectric feature extraction using temporal-spatial descriptors for multifunction prosthetic hand control

Rami N.

Ali

Ahmed

Adel

...Show More Authors

View Publication

(11)

(9)

Publication Date

Sun Dec 06 2009

Journal Name

Baghdad Science Journal

Mean-field Solution of the mixed spin-1 and spin-5/2Ising system with different single-ion anisotropies

"mixed-spin Ising model

ferrimagnet

single-ion anisotropy

compensation point. "

Fuad. T.

...Show More Authors

The mixed-spin ferrimagnetic Ising system consists of two-dimensional sublattices A and B with spin values and respectively .By used the mean-field approximation MFA of Ising model to find magnetism( ).In order to determined the best stabile magnetism , Gibbs free energy employ a variational method based on the Bogoliubov inequality .The ground-state (Phase diagram) structure of our system can easily be determined at , we find six phases with different spins values depend on the effect of a single-ion anisotropies .these lead to determined the second , first orders transition ,and the tricritical points as well as the compensation phenomenon .

View Publication Preview PDF

Publication Date

Tue Dec 01 2009

Journal Name

Iraqi Journal Of Physics

Laplacian Operator as Speaker Identification Parameter

Speaker Identification Parameter

S. K.

...Show More Authors

New speaker identification test’s feature, extracted from the differentiated form of the wave file, is presented. Differentiation operation is performed by an operator similar to the Laplacian operator. From the differentiated record’s, two parametric measures have been extracted and used as identifiers for the speaker; i.e. mean-value and number of zero-crossing points.

View Publication Preview PDF

Publication Date

Sun Jun 06 2010

Journal Name

Baghdad Science Journal

Using Neural Network with Speaker Applications

"Speaker recognition

data enhancement

MLP"

Alaa noori

Samira faris

...Show More Authors

In Automatic Speech Recognition (ASR) the non-linear data projection provided by a one hidden layer Multilayer Perceptron (MLP), trained to recognize phonemes, and has previous experiments to provide feature enhancement substantially increased ASR performance, especially in noise. Previous attempts to apply an analogous approach to speaker identification have not succeeded in improving performance, except by combining MLP processed features with other features. We present test results for the TIMIT database which show that the advantage of MLP preprocessing for open set speaker identification increases with the number of speakers used to train the MLP and that improved identification is obtained as this number increases beyond sixty.

View Publication Preview PDF

Publication Date

Sat Dec 30 2023

Journal Name

Traitement Du Signal

Optimizing Acoustic Feature Selection for Estimating Speaker Traits: A Novel Threshold-Based Approach

Umniah

...Show More Authors

View Publication

(1)

Publication Date

Sun Dec 04 2011

Journal Name

Baghdad Science Journal

Modifying Hebbian Network for Text Cipher

Hebbian Network

Neural Network

Text Security.

Noor Adnan

...Show More Authors

The objective of this work is to design and implement a cryptography system that enables the sender to send message through any channel (even if this channel is insecure) and the receiver to decrypt the received message without allowing any intruder to break the system and extracting the secret information. This work modernize the feedforward neural network, so the secret message will be encrypted by unsupervised neural network method to get the cipher text that can be decrypted using the same network to get the original text. The security of any cipher system depends on the security of the related keys (that are used by the encryption and the decryption processes) and their corresponding lengths. In this work, the key is the final weights

View Publication Preview PDF

Publication Date

Thu Jan 01 2015

Journal Name

Journal Of Theoretical And Applied Information Technology

Graph based text representation for document clustering

Text Representation Schemes

Dependency Graph

Document Clustering

Sparsity Problem

Semantic Problem.

Asma Khazaal Abdulsahib

SITI SAKIRA KAMARUDDIN

...Show More Authors

Advances in digital technology and the World Wide Web has led to the increase of digital documents that are used for various purposes such as publishing and digital library. This phenomenon raises awareness for the requirement of effective techniques that can help during the search and retrieval of text. One of the most needed tasks is clustering, which categorizes documents automatically into meaningful groups. Clustering is an important task in data mining and machine learning. The accuracy of clustering depends tightly on the selection of the text representation method. Traditional methods of text representation model documents as bags of words using term-frequency index document frequency (TFIDF). This method ignores the relationship an

Preview PDF

(15)

Publication Date

Sat Jan 01 2022

Journal Name

International Journal Of Nonlinear Analysis And Applications

Human recognition by utilizing voice recognition and visual recognition

Deep learning Convolutional Neural Networks Human Recognition voice recognition visual recognition

Sukaina

Samera

Mahir

...Show More Authors

Audio-visual detection and recognition system is thought to become the most promising methods for many applications includes surveillance, speech recognition, eavesdropping devices, intelligence operations, etc. In the recent field of human recognition, the majority of the research be- coming performed presently is focused on the reidentification of various body images taken by several cameras or its focuses on recognized audio-only. However, in some cases these traditional methods can- not be useful when used alone such as in indoor surveillance systems, that are installed close to the ceiling and capture images right from above in a downwards direction and in some cases people don't look straight the cameras or it cannot be added in some

View Publication Preview PDF

1 2 3 4 ... 663 664 665 666