BEYOND WORDS: HARNESSING SPEECH SOUND FOR SPEAKER AGE AND GENDER DETECTION USING 1D CNN ARCHITECTURE WITH SELF-ATTENTION MECHANISM

Umniah Hameed jaid

doi:10.5455/jjcit.71-1703265368

Details

Publication Date

Mon Jan 01 2024

Journal Name

Jordanian Journal Of Computers And Information Technology

DOI

10.5455/jjcit.71-1703265368

Choose Citation Style

Statistics

View publication

3

Statistics

BEYOND WORDS: HARNESSING SPEECH SOUND FOR SPEAKER AGE AND GENDER DETECTION USING 1D CNN ARCHITECTURE WITH SELF-ATTENTION MECHANISM

Umniah Hameed jaid

...Show More Authors

Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.

View Publication

Publication Date

Tue Dec 01 2009

Journal Name

Iraqi Journal Of Physics

Laplacian Operator as Speaker Identification Parameter

Speaker Identification Parameter

S. K.

...Show More Authors

New speaker identification test’s feature, extracted from the differentiated form of the wave file, is presented. Differentiation operation is performed by an operator similar to the Laplacian operator. From the differentiated record’s, two parametric measures have been extracted and used as identifiers for the speaker; i.e. mean-value and number of zero-crossing points.

View Publication Preview PDF

Publication Date

Sun Jun 06 2010

Journal Name

Baghdad Science Journal

Charge density distributions for odd-A of 2s-1d shell nuclei

"charge density distributions for odd- of shell nuclei

root mean square charge radii

occupation numbers of higher states for and nuclei"

Altaf Abdul Majeed

...Show More Authors

An analytical expression for the charge density distributions is derived based on the use of occupation numbers of the states and the single particle wave functions of the harmonic oscillator potential with size parameters chosen to reproduce the observed root mean square charge radii for all considered nuclei. The derived expression, which is applicable throughout the whole region of shell nuclei, has been employed in the calculations concerning the charge density distributions for odd- of shell nuclei, such as and nuclei. It is found that introducing an additional parameters, namely and which reflect the difference of the occupation numbers of the states from the prediction of the simple shell model leads to obtain a remarkabl

View Publication Preview PDF

(1)

Publication Date

Tue May 07 2019

Journal Name

Acm Journal On Emerging Technologies In Computing Systems

Neuromemrisitive Architecture of HTM with On-Device Learning and Neurogenesis

Abdullah M.

Dhireesha

...Show More Authors

Hierarchical temporal memory (HTM) is a biomimetic sequence memory algorithm that holds promise for invariant representations of spatial and spatio-temporal inputs. This article presents a comprehensive neuromemristive crossbar architecture for the spatial pooler (SP) and the sparse distributed representation classifier, which are fundamental to the algorithm. There are several unique features in the proposed architecture that tightly link with the HTM algorithm. A memristor that is suitable for emulating the HTM synapses is identified and a new Z-window function is proposed. The architecture exploits the concept of synthetic synapses to enable potential synapses in the HTM. The crossbar for the SP avoids dark spots caused by unutil

View Publication

(14)

Publication Date

Sat Feb 02 2019

Journal Name

Journal Of The College Of Education For Women

Poetical Words

أ.د .داود

...Show More Authors

Poetical Words

View Publication Preview PDF

Publication Date

Tue Dec 05 2023

Journal Name

Baghdad Science Journal

AlexNet Convolutional Neural Network Architecture with Cosine and Hamming Similarity/Distance Measures for Fingerprint Biometric Matching

Biometric Cryptosystem

Convolutional Neural Network

Cosine Similarity

Fingerprint Matching

Information Security

Ahmed Sabah Ahmed

Huda Kadhim

Abeer

...Show More Authors

In information security, fingerprint verification is one of the most common recent approaches for verifying human identity through a distinctive pattern. The verification process works by comparing a pair of fingerprint templates and identifying the similarity/matching among them. Several research studies have utilized different techniques for the matching process such as fuzzy vault and image filtering approaches. Yet, these approaches are still suffering from the imprecise articulation of the biometrics’ interesting patterns. The emergence of deep learning architectures such as the Convolutional Neural Network (CNN) has been extensively used for image processing and object detection tasks and showed an outstanding performance compare

View Publication Preview PDF

(4)

(2)

Publication Date

Sat Sep 30 2017

Journal Name

College Of Islamic Sciences

Significant words Collect and study

د.عمار عيسى

...Show More Authors

The aim of this research is to collect the semantically restricted vocabulary from linguistic vocabulary and make it regular in one wire with an in-depth study. This study is important in detecting the exact meanings of the language. On the genre, as shown in this research, and our purpose to reveal this phenomenon, where it shows the accuracy of Arabic in denoting the meanings, the research has overturned more than sixty-seven words we extracted from the stomachs of the glossaries and books of language, and God ask safety intent and payment of opinion.

View Publication Preview PDF

Publication Date

Fri Mar 29 2024

Journal Name

Iraqi Journal Of Science

Evaluating the Performance and Behavior of CNN, LSTM, and GRU for Classification and Prediction Tasks

Hasanen S.

Nada Hussain

Nada A.Z.

...Show More Authors

Deep learning (DL) plays a significant role in several tasks, especially classification and prediction. Classification tasks can be efficiently achieved via convolutional neural networks (CNN) with a huge dataset, while recurrent neural networks (RNN) can perform prediction tasks due to their ability to remember time series data. In this paper, three models have been proposed to certify the evaluation track for classification and prediction tasks associated with four datasets (two for each task). These models are CNN and RNN, which include two models (Long Short Term Memory (LSTM)) and GRU (Gated Recurrent Unit). Each model is employed to work consequently over the two mentioned tasks to draw a road map of deep learning mod

View Publication

(9)

(2)

Publication Date

Sat Mar 02 2019

Journal Name

Biochem. Cell.arch.

EVALUATION OF PRIMARY IMPLANTS STABILITY IN IMMEDIATE AND DELAYED TREATMENT PROTOCOLS ACCORDING TO BONE DENSITY, JAWS, GENDER AND AGE UTILIZING PERIOTEST M DEVICE

sahad

maha

saif

...Show More Authors

Publication Date

Mon Oct 20 2025

Journal Name

Journal Of Baghdad College Of Dentistry

Effect of gender, age and tooth loss on the dimensions of incisive canal, and buccal bone anterior to the canal (Computed Tomography study)

Ryaheen G

Ahlam A

...Show More Authors

Background: The incisive canal is an anatomical structure with an important location in the anterior maxilla, analyzing this canal and its relation to the bone anterior to the canal is necessary during dental implant. Aim of this study is evaluated effect of gender, age and tooth loss in area of maxillary central incisors teeth on the dimensions of incisive canal and buccal bone anterior to the canal using spiral computed tomography. Materials and Methods: Sample consists of prospective study for 156 subjects for both gender, they divided into two groups, 120 dentate group (60 male and 60 female) with age ranging from (20-70) and 36 edentate group (with missing maxillary central incisors) (18 male and 18 female) with age ranging from (50-70

View Publication Preview PDF

Publication Date

Wed Mar 15 2023

Journal Name

Journal Of The Turkish-german Gynecological Association

Obstetric and neonatal complications in large for gestational age pregnancy with late gestational diabetes

Hyperglycemia

macrosomia

third-trimester gestational diabetes

Shaymaa Kadhim

Hayder

Zina Ismaiel

Rand

...Show More Authors

View Publication Preview PDF

(3)

(1)

1 2 ... 6 7 8 9 ... 2115 2116