Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.
Deep learning (DL) plays a significant role in several tasks, especially classification and prediction. Classification tasks can be efficiently achieved via convolutional neural networks (CNN) with a huge dataset, while recurrent neural networks (RNN) can perform prediction tasks due to their ability to remember time series data. In this paper, three models have been proposed to certify the evaluation track for classification and prediction tasks associated with four datasets (two for each task). These models are CNN and RNN, which include two models (Long Short Term Memory (LSTM)) and GRU (Gated Recurrent Unit). Each model is employed to work consequently over the two mentioned tasks to draw a road map of deep learning mod
... Show MoreDr. Qahtan Al-Madfa’i’s architecture has been characterized by a particular characteristic that may be unique and extreme at the same time, that is the use of the distinctive three-dimensional structural coverings and the exploitation of structural construction to give an extra aesthetic touch to the composition of the building, to achieve the application of his universal ideas, which he strongly believed and defended.
In the period of the marked urban decline that the country undergoes now, which urges us toward making a comparison between the beginning of the modern Iraqi architecture and its ascending path up to its peak and the periods of its decline until it reached a very
... Show MoreRecommendation systems are now being used to address the problem of excess information in several sectors such as entertainment, social networking, and e-commerce. Although conventional methods to recommendation systems have achieved significant success in providing item suggestions, they still face many challenges, including the cold start problem and data sparsity. Numerous recommendation models have been created in order to address these difficulties. Nevertheless, including user or item-specific information has the potential to enhance the performance of recommendations. The ConvFM model is a novel convolutional neural network architecture that combines the capabilities of deep learning for feature extraction with the effectiveness o
... Show MorePiracy on phonograms is now, rightly, the crime of the electronic age. Despite the protection sought by States to provide for such registrations, whether at the level of national legislation or international agreements and conventions, but piracy has been and continues to pose a significant threat to the rights of the producers of those recordings, especially as it is a profitable way for hackers to get a lot of money in a way Illegal, which is contrary to the rules of legitimate competition. Hence, this research highlights the legal protection of producers of phonograms in light of the Iraqi Copyright Protection Act No. (3) of 1971, as amended.
Two prevalent neurodevelopment disorders in children are attention deficit hyperactivity disorder (ADHD) and autism spectrum disorder (ASD). The fifth version of the Diagnostic and Statistical Manual of Mental Disorders describes autism as a condition marked by limitations in social communication as well as restricted, repetitive behavior patterns. While impulsivity, hyperactivity, and lack of concentration are signs of attention deficit hyperactivity disorder. Boys experience it more frequently than girls do. This study sought for possible factors that put children at risk for autism and attention deficit hyperactivity disorder, and it investigated the association between neurodevelopment disorders in children and parental risk factor i
... Show Moreزاد الاهتمام بالأطفال ذوي اضطراب الانتباه المصحوب بالنشاط الزائد نظراً لانتشاره بين الأطفال في عمر المرحلة الابتدائية حيث تراوحت نسبته ما بين 3% إلى 20% ومعظمهم من الذكور ، وأن انتشاره يقع في مختلف الطبقات الاجتماعية بالنسبة لعوائل هؤلاء الأطفال كما أن المشكلات المتعلقة به لا تنتهي بانتهاء مرحلة الطفولة ، وغالباً ما تمتد إلى مرحلة المراهقة حيث توصل ويز و هتكمانWeiss&Hechtman,1989 إلى أن هناك علامات م
... Show MoreAutomatic speaker recognition may achieve remarkable performance in matched training and test conditions. Conversely, results drop significantly in incompatible noisy conditions. Furthermore, feature extraction significantly affects performance. Mel-frequency cepstral coefficients MFCCs are most commonly used in this field of study. The literature has reported that the conditions for training and testing are highly correlated. Taken together, these facts support strong recommendations for using MFCC features in similar environmental conditions (train/test) for speaker recognition. However, with noise and reverberation present, MFCC performance is not reliable. To address this, we propose a new feature 'entrocy' for accurate and robu
... Show MoreThis paper is devoted to investigate the effect of internal curing technique on the properties of self-compacting concrete (SCC). In this study, SCC is produced by using silica fume (SF) as partial replacement by weight of cement with percentage of (5%), sand is partially replaced by volume with saturated fine lightweight aggregate (LWA) which is thermostone chips as internal curing material in three percentages of (5%, 10% and 15%) for SCC, two external curing conditions water and air. The experimental work was divided into three parts: in the first part, the workability tests of fresh SCC were conducted. The second part included conducting compressive strength test and modulus of rupture test at ages of (7, 28 and 90). The third part i
... Show More