When optimizing the performance of neural network-based chatbots, determining the optimizer is one of the most important aspects. Optimizers primarily control the adjustment of model parameters such as weight and bias to minimize a loss function during training. Adaptive optimizers such as ADAM have become a standard choice and are widely used for their invariant parameter updates' magnitudes concerning gradient scale variations, but often pose generalization problems. Alternatively, Stochastic Gradient Descent (SGD) with Momentum and the extension of ADAM, the ADAMW, offers several advantages. This study aims to compare and examine the effects of these optimizers on the chatbot CST dataset. The effectiveness of each optimizer is evaluated based on its sparse-categorical loss during training and BLEU in the inference phase, utilizing a neural generative attention-based additive scoring function. Despite memory constraints that limited ADAMW to ten epochs, this optimizer showed promising results compared to configurations using early stopping techniques. SGD provided higher BLEU scores for generalization but was very time-consuming. The results highlight the importance of finding a balance between optimization performance and computational efficiency, positioning ADAMW as a promising alternative when training efficiency and generalization are primary concerns.
Abstract
Leadership has now become a process for applying methods and techniques that make the Organization at the top of its competitive pyramid a greater market share. Leadership has become a focus for all leaders and managers، and leaders and managers are increasingly seeking to develop their skills and leadership skills. The research started with a clear problem of specific questions to ensure that the general objective of the research is to describe the characteristics of the leader and to clarify the dimensions of empowering the workers and to highlight the role of the leader in empowering the workers. The study examines the relation between the role of the leader in
... Show Morehe planning process is generally aimed at developing the city and making it meet the needs of different citizens. The green areas constitute one of the basic needs of the city and with the rapid and unusual growth in the size of cities, especially in the third world countries, which is often embodied in capitals. Which was achieved as a result of many reasons, including political, economic and social and even enshrined through some of the decisions that were issued and the city of Baghdad, but a clear example of these cities. The city and the environment are inseparable terms. The city is where people spend their lives and their daily experiences, and the environment is the center in w
... Show MoreRecommender Systems are tools to understand the huge amount of data available in the internet world. Collaborative filtering (CF) is one of the most knowledge discovery methods used positively in recommendation system. Memory collaborative filtering emphasizes on using facts about present users to predict new things for the target user. Similarity measures are the core operations in collaborative filtering and the prediction accuracy is mostly dependent on similarity calculations. In this study, a combination of weighted parameters and traditional similarity measures are conducted to calculate relationship among users over Movie Lens data set rating matrix. The advantages and disadvantages of each measure are spotted. From the study, a n
... Show MoreThis study deals with the impact of leadership styles in its three main dimensions (democratic, autocratic, lenient) as an independent variable of the dimensions of functional combustion (emotional stress, inhumanity, personal achievement). The research sought to achieve a set of goals, the most prominent of which are: studying the reality of the researched organization to identify the leadership patterns used in its management and its impact on the phenomenon of functional combustion, Moreover, knowing the extent of support for these established patterns and their contribution to mitigating the phenomenon of functional combustion in the organization's environment, and testing the impact of these leadership patte
... Show MoreThe research aimed to demonstrate the possibility of benefiting from the coordination between real estate and income tax as the independent variable on the tax outcome as the dependent variable as the dependent variable. Which were practiced within rented buildings, as information was obtained from real estate owners, and the annual controls for the year 2021 were relied upon in the process of calculating the tax amounts expected to be obtained. used in the tax inventory process lacks seriousness and continuous updating
Objectives: The aim of this study to assess instructional labor support behaviors among laboring
women in teaching hospitals in Hilla city.
Methodology: A descriptive analytic study was concluded to select a sample purposely of one hundred
multipara laboring women in maternity hospital in Hilla city and data was collected through
questionnaire form during February (1
st to March 30th) 2014. A descriptive statistical method was used
to analyze the data.
Results: The result showed that the highest percentage of study sample was at age (20-24) years, most
of them was house wife, more than third graduate from primary school, and more than half of them
lived in rural area, (86%) of study sample delivered normal deli
A batch adsorption system was applied to study the adsorption of methylene blue from aqueous solution by Iraqi bentonite and treated bentonite with different amount of zinc oxide (ZnO). The adsorption capacities of methylene blue onto bentonite were evaluated. The equilibrium between liquid and solid phase was described by Langmuir model better than the Freundlich model. Langmuir and Freundlich constants have been determined. The separation factor or equilibrium parameter, RL which is used to predict if an adsorption system is favourable or unfavourable was calculated for all cases.
The purpose of present work is to study the relationship of the deformed shape of the nucleus with the radioactivity of nuclei for (Uranium-238 and Thorium-232) series. To achieve our purposes we have been calculated the quadruple deformation parameter (β2) and the eccentricity (e) and compare the radioactive series with the change of the and (e) as indicator for the changing in the nucleus shape with the radioactivity. To obtain the value of quadruple deformation parameter (β2), the adopted value of quadruple transition probability B (E2; 0+ → 2+) was calculated from Global Best fit equation. While the eccentricity (e) was calculated from the values of the minor and major ellipsoid axis’s (a & b). From the results, it is obvi
... Show MoreOne of the unique properties of laser heating applications is its powerful ability for precise pouring of energy on the needed regions in heat treatment applications. The rapid rise in temperature at the irradiated region produces a high temperature gradient, which contributes in phase metallurgical changes, inside the volume of the irradiated material. This article presents a comprehensive numerical work for a model based on experimentally laser heated AISI 1110 steel samples. The numerical investigation is based on the finite element method (FEM) taking in consideration the temperature dependent material properties to predict the temperature distribution within the irradiated material volume. The finite element analysis (FEA) was carried
... Show MoreBeyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attentio
... Show More