BEYOND WORDS: HARNESSING SPEECH SOUND FOR SPEAKER AGE AND GENDER DETECTION USING 1D CNN ARCHITECTURE WITH SELF-ATTENTION MECHANISM

Umniah Hameed jaid

doi:10.5455/jjcit.71-1703265368

Details

Publication Date

Mon Jan 01 2024

Journal Name

Jordanian Journal Of Computers And Information Technology

DOI

10.5455/jjcit.71-1703265368

Choose Citation Style

Statistics

View publication

6

Statistics

BEYOND WORDS: HARNESSING SPEECH SOUND FOR SPEAKER AGE AND GENDER DETECTION USING 1D CNN ARCHITECTURE WITH SELF-ATTENTION MECHANISM

Umniah Hameed jaid

...Show More Authors

Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.

View Publication

Publication Date

Sat Jan 31 2026

Journal Name

International Journal Of Intelligent Engineering And Systems

Low-complexity Deep Learning for Joint Channel-type Identification and SNR Estimation in MIMO-OFDM Using CNN–BRNN with LUT Labels

Deep learning

MIMO-OFDM

Channel estimation

SNR estimation

CNN–BRNN

Alamouti

Space time block code

Quasi-orthogonal STBC (QO-STBC)

Yasmine

Furat N.

...Show More Authors

Channel estimation (CE) is essential for wireless links but becomes progressively onerous as Fifth Generation (5G) Multi-Input Multi-Output (MIMO) systems and extensive fading expand the search space and increase latency. This study redefines CE support as the process of learning to deduce channel type and signal-tonoise ratio (SNR) directly from per-tone Orthogonal Frequency-Division Multiplexing (OFDM) observations,with blind channel state information (CSI). We trained a dual deep model that combined Convolutional Neural Networks (CNNs) with Bidirectional Recurrent Neural Networks (BRNNs). We used a lookup table (LUT) label for channel type (class indices instead of per-tap values) and ordinal supervision for SNR (0–20 dB,5-dB steps). T

View Publication Preview PDF

Publication Date

Mon Jan 01 2024

Journal Name

Fifth International Conference On Applied Sciences: Icas2023

A modified Mobilenetv2 architecture for fire detection systems in open areas by deep learning

Muthanna S.

Nada A. Z.

Amel H.

...Show More Authors

This research describes a new model inspired by Mobilenetv2 that was trained on a very diverse dataset. The goal is to enable fire detection in open areas to replace physical sensor-based fire detectors and reduce false alarms of fires, to achieve the lowest losses in open areas via deep learning. A diverse fire dataset was created that combines images and videos from several sources. In addition, another self-made data set was taken from the farms of the holy shrine of Al-Hussainiya in the city of Karbala. After that, the model was trained with the collected dataset. The test accuracy of the fire dataset that was trained with the new model reached 98.87%.

Publication Date

Mon Jun 26 2023

Journal Name

International Conference On Scientific Research & Innovation (icsri 2022)

Age and gender profile of coronavirus disease 2019 (COVID 19) in Quarantine Center in Baghdad, Iraq

COVID-19

Age

Gender

Recovered

Deceased

Maysaa

...Show More Authors

View Publication

Publication Date

Mon Jun 26 2023

Journal Name

International Conference On Scientific Research & Innovation (icsri 2022)

Age and gender profile of coronavirus disease 2019 (COVID 19) in Quarantine Center in Baghdad, Iraq

COVID-19

Age

Gender

Recovered

Deceased.

Fadhaa O.

Sinai W.

Hanan J.

Faheema J. Abo

Fadhaa O.

...Show More Authors

Publication Date

Sun Apr 07 2013

Journal Name

Journal Of Educational And Psychological Researches

skills and its relationship to the with concept of self in social The chidren kindergarten age (4-6) years

skills

self in social

The chidren kindergarten

سميرة عبد الحسين كاظم

نجلاء فاضل رحيم

...Show More Authors

The interest in Multi social skills and self-concept is extremely important for many of the scholars of education and psychology has taken a great deal in their writings and their interests as they see that social skills training is to make sure of the same, and that whenever enable the individual from acquiring social skills whenever asserted itself.The research aims know social skills and self-concept and their relationship to the children Riyadh age (4-6 years), and the research sample consisted of(200) boys and girls from kindergarten in the city of Baghdad Bjanbey Rusafa second and Karkh second.And to the objectives of the research realized the researcher has built two measures of social skills a

View Publication Preview PDF

Publication Date

Sat Sep 30 2023

Journal Name

Wasit Journal Of Computer And Mathematics Science

Real time handwriting recognition system using CNN algorithms

Maryam

...Show More Authors

Abstract— The growing use of digital technologies across various sectors and daily activities has made handwriting recognition a popular research topic. Despite the continued relevance of handwriting, people still require the conversion of handwritten copies into digital versions that can be stored and shared digitally. Handwriting recognition involves the computer's strength to identify and understand legible handwriting input data from various sources, including document, photo-graphs and others. Handwriting recognition pose a complexity challenge due to the diversity in handwriting styles among different individuals especially in real time applications. In this paper, an automatic system was designed to handwriting recognition

View Publication

(1)

Publication Date

Sun Mar 01 2015

Journal Name

Baghdad Science Journal

The Effect of Age and Gender on Fetuin-A and Some Biochemical parameters in Blood Sera of Iraqi patients with T2DM: A comparative study

: Fetuin-A

HbA1c

Lipid profile

Protein

Albumin

Globulin

Walla E.

Amal R.

...Show More Authors

The serum protein test includes measurement of the level of total protein(albumin, globulin). Fetuin-A is a blood protein made in liver. It can inhibit insulin receptor, enhance insulin sensitivity and make the individuals more likely to develop type 2 diabetes, then disorder in lipid profile (Total cholesterol(TC), low density lipoprotein cholesterol (LDL-c), high density lipoprotein cholesterol (HDL-c), Triglyceride(TG) and very low density lipoprotein cholesterol (VLDL-c) . To evaluate Fetuin-A, total protein, albumin, globulin, HbAlc and lipid profile in 200 adult and elderly Iraqi patients with type 2 Diabetes Mellitus were taken and compare them with 200 subjects as a healthy control. The laboratory analysis(for patients and

View Publication Preview PDF

(1)

Publication Date

Wed Oct 09 2024

Journal Name

Engineering, Technology & Applied Science Research

Improving Pre-trained CNN-LSTM Models for Image Captioning with Hyper-Parameter Optimization

CNN pre-trained models

LSTM

activation function

hyper-parameters

overfitting

Nuha M.

Nada

...Show More Authors

The issue of image captioning, which comprises automatic text generation to understand an image’s visual information, has become feasible with the developments in object recognition and image classification. Deep learning has received much interest from the scientific community and can be very useful in real-world applications. The proposed image captioning approach involves the use of Convolution Neural Network (CNN) pre-trained models combined with Long Short Term Memory (LSTM) to generate image captions. The process includes two stages. The first stage entails training the CNN-LSTM models using baseline hyper-parameters and the second stage encompasses training CNN-LSTM models by optimizing and adjusting the hyper-parameters of

View Publication

(6)

(5)

Publication Date

Fri Mar 06 2026

Journal Name

Journal Of Baghdad College Of Dentistry

Computed tomographic measurement of maxillary sinus volume and dimension in correlation to the age and gender (comparative study among individuals with dentate and edentulous maxilla)

Hussein H

Jamal A

...Show More Authors

Background : Although development and progress in various diagnostic methods, but still identification of remnants of skeletal and decomposing parts of human is one of the most difficult skills in forensic medicine . Gender and age estimation is also considering an important problem in the identification of unknown skull. The aims of study: To estimate volume and dimension of maxillary sinus in individuals with dentate and edentulous maxillae using CT scan, and to correlate the maxillary sinus volume in relation to gender and age. Materials and Methods : This study included 120 patients ranged from (40-69 years), divided into two groups, dentate group with fully dentate maxilla and edentulous group with complete edentulous maxilla, and e

View Publication Preview PDF

Publication Date

Thu May 21 2020

Journal Name

Journal Of Strategic Research In Social Science (josress)

A Pragmatic Study of Gender Differences in the Use of Speech Acts in Selected Suicide Notes

Bushra

...Show More Authors

DBNRSK Sayed, Journal of Strategic Research in Social Science (JoSReSS), 2020

View Publication

1 2 3 4 ... 2191 2192 2193 2194