Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.
The sound in the cinema and television occupies a large space in the level of use and expression. In addition to the functional aspect of the elements of the sound such as the dialogue, music, effects and silence, in shaping and supporting the narrative structure of the image in the dramatic work, it has today become and in light of the technical developments of the sound, an aesthetic value in the structure and formulation of the contents and ideas presented in the work. The sound also created a variety of forms before the work-factories in the artistic functioning, which enhances the emotional and expressive dimension of the image, and the researcher, as a result of many new developments in the expression o
... Show MoreThe Sound of the letter (ق) in the Contemporary Arabic Dialects
we conclude that Alaotaby in his Designation, with respect to defects in speech or speech pathology, he cited a number of terms function on speech defects in voice and accent like , Aphasia, Alokla, Alaay, Alramz, Alhasr, Alfadm,and Alaghop, and pointed to the sound stop as a result of an accident or a problem or the speech organ deny the will of the speech, which refers to the refrain defect in sound organic.
He also marked the disorders individual sound caused by the bug of sample and disability among individual like Alokla, node and aphasia - which hinders communication as well as other factors such as irregular sound product and not reporting to be into the future toward the Aljamjamah and whispering, and it can be said that he po
Value Engineering is an analytical study on projects or services using a specific procedure and a multidisciplinary working group, works for the identification and classification of the project functions; either for a better perfuming of these functions or to lessen the total project cost or the two together. Value Engineering main aim is on finding innovative alternatives, without effecting the basic requirements of the project, its methodology based on the functional balancing between the three elements of production "performance, quality and cost". This methodology based on the "functional analysis", had shown high possibilities in solving any problem facing the production procedure , achieve better investment for available re
... Show MoreTargeted current research study of the relationship between guilt and self-consciousness and consisted of the research community of students from the open educational college, as has been selected students in the Department of Counseling and psychological science department and the researcher used guilt, prepared Scale (Ansari, 2003), and the measure of selfawareness prepared (Shammari 0.2000), and extracted his Alsekoumtria characteristics, Fastkhrjt alternatives after a presentation to a group of experts and specialists in the field of psychological counseling psychology, education, science and psychological validity and reliability Alvakronbach manner and retesting reaching reliability coefficient of guilt ((0.85) and awarenes
... Show MoreWords in a language do not exist in isolation but in close connection with each other ,teaming up in one way or another known to the Russian semasiology M. M. Pokrovsky , one of the first to realize the systematic nature of the lexicon, wrote about the second half of the nineteenth century : „the Words and their meanings do not live separate from each other life, but are joined together in our minds), regardless of our consciousness to different groups , and the basis for grouping is the similarity or direct contrast in the main value.
Kinetic and mechanism studies of the oxidation of oxalic acid by Cerium sulphate have been carried out in acid medium sulphuric acid. The uv- vis. Spectrophotometric technique was used to follow up the reaction and the selected wavelength to be followed was 320 nm. The kinetic study showed that the order of reaction is first order in Ce(IV) and fractional in oxalic acid. The effect of using different concentration of sulphuric acid on the rate of the reaction has been studied a and it was found that the rate decreased with increasing the acid concentration. Classical organic tests was used to identify the product of the oxidation reaction, the product was just bubbles of CO2.
In recent days, the escalating need to seamlessly transfer data traffic without discontinuities across the Internet network has exerted immense pressure on the capacity of these networks. Consequently, this surge in demand has resulted in the disruption of traffic flow continuity. Despite the emergence of intelligent networking technologies such as software-defined networking, network cloudification, and network function virtualization, they still need to improve their performance. Our proposal provides a novel solution to tackle traffic flow continuity by controlling the selected packet header bits (Differentiated Services Code Point (DSCP)) that govern the traffic flow priority. By setting the DSCP bits, we can determine the appropriate p
... Show More