Beyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attention-based convolutional neural network (CNN) model. To address age ambiguity, we evaluate the effects of different loss functions such as focal loss and Kullback-Leibler (KL) divergence loss. Additionally, we evaluate the accuracy of the estimation at different durations of speech. Experimental results from the Common Voice dataset underscore the efficacy of our approach, showcasing an accuracy of 87% for male speakers, 91% for female speakers and 89% overall accuracy, and an accuracy of 99.1% for gender prediction.
Gender classification is a critical task in computer vision. This task holds substantial importance in various domains, including surveillance, marketing, and human-computer interaction. In this work, the face gender classification model proposed consists of three main phases: the first phase involves applying the Viola-Jones algorithm to detect facial images, which includes four steps: 1) Haar-like features, 2) Integral Image, 3) Adaboost Learning, and 4) Cascade Classifier. In the second phase, four pre-processing operations are employed, namely cropping, resizing, converting the image from(RGB) Color Space to (LAB) color space, and enhancing the images using (HE, CLAHE). The final phase involves utilizing Transfer lea
... Show MoreThe purpose of this study is to explore whether the adoption of Beyond Budgeting (BB) as a management accounting practice (MAP) contributes to developing intellectual capital (IC) and creating value in Iraqi companies. This requires an understanding of the views of the Iraqi managers about the nature of the information provided by this practice, which may be used to determine whether this information is relevant in the management of IC in the context of Iraq. This research aims also to explore the challenges of the adoption of the BB in planning and controlling IC in Iraq. The study adopts a qualitative approach and an interpretive paradigm. It also adopts a semi-structured interview method of collecting data from executive managers
... Show MoreMany problems were encountered during the drilling operations in Zubair oilfield. Stuckpipe, wellbore instability, breakouts and washouts, which increased the critical limits problems, were observed in many wells in this field, therefore an extra non-productive time added to the total drilling time, which will lead to an extra cost spent. A 1D Mechanical Earth Model (1D MEM) was built to suggest many solutions to such types of problems. An overpressured zone is noticed and an alternative mud weigh window is predicted depending on the results of the 1D MEM. Results of this study are diagnosed and wellbore instability problems are predicted in an efficient way using the 1D MEM. Suitable alternative solutions are presented
... Show MoreDr. Qahtan Al-Madfa’i’s architecture has been characterized by a particular characteristic that may be unique and extreme at the same time, that is the use of the distinctive three-dimensional structural coverings and the exploitation of structural construction to give an extra aesthetic touch to the composition of the building, to achieve the application of his universal ideas, which he strongly believed and defended.
In the period of the marked urban decline that the country undergoes now, which urges us toward making a comparison between the beginning of the modern Iraqi architecture and its ascending path up to its peak and the periods of its decline until it reached a very
... Show MoreMicro-perforated panel (MPP) absorber is increasingly gaining popularity as an alternative sound absorber in buildings compared to the well-known synthetic porous materials. A single MPP has a typical feature of a Helmholtz resonator with a high amplitude of absorption but a narrow absorption frequency bandwidth. To improve the bandwidth, a single MPP can be cascaded with another single MPP to form a double-layer MPP. This paper proposes the introduction of inhomogeneous perforation in the double-layer MPP system (DL-iMPP) to enhance the absorption bandwidth of a double-layer MPP. Mathematical models are proposed using the equivalent electrical circuit model and are validated with experiments with good agreement. It is revealed that the DL-
... Show MorePalm vein recognition technology is a one of the most effective biometric technologies for personal identification. Palm acquisition techniques are either contact-based or contactless-based. The contactless-based palm vein system is considered more accurate and efficient when used in modern applications, but it may suffer from problems like pose variations and the delay in the matching process. This paper proposes a contactless-based identification system for palm vein that involves two main steps; First, the central region of the palm is cropped using fast extract region of interest algorithm, then the features are extracted and classified using altered structure of Residual Attention Network, which is a developed version of convolution
... Show MoreElastic electron scattering form factors, charge density distributions and charge,
neutron and matter root mean square (rms) radii for P
24
PMg, P
28
PSi and P
32
PS nuclei are
studied using the effect of occupation numbers. Single-particle radial wave functions
of harmonic-oscillators (HO) potential are used. In general, the results of elastic
charge form factors showed good agreement with experimental data. The occupation
numbers are taken to reproduce the quantities mentioned above. The inclusion of
occupation numbers enhances the form factors to become closer to the data. For the
calculated charge density distributions, the results show good agreement with
experimental data except the fail to
Speech is the first invented way of communication that human used age before the invention of writing. In this paper, proposed method for speech analyses to extract features by using multiwavelet Transform (Repeated Row Preprocessing).The proposed system depends on the Euclidian differences of the coefficients of the multiwavelet Transform to determine the beast features of speech recognition. Each sample value in the reference file is computed by taking the average value of four samples for the same data (four speakers for the same phoneme). The result of the input data to every frame value in the reference file using the Euclidian distance to determine the frame with the minimum distance is said to be the "Best Match". Simulatio
... Show More