When optimizing the performance of neural network-based chatbots, determining the optimizer is one of the most important aspects. Optimizers primarily control the adjustment of model parameters such as weight and bias to minimize a loss function during training. Adaptive optimizers such as ADAM have become a standard choice and are widely used for their invariant parameter updates' magnitudes concerning gradient scale variations, but often pose generalization problems. Alternatively, Stochastic Gradient Descent (SGD) with Momentum and the extension of ADAM, the ADAMW, offers several advantages. This study aims to compare and examine the effects of these optimizers on the chatbot CST dataset. The effectiveness of each optimizer is evaluated based on its sparse-categorical loss during training and BLEU in the inference phase, utilizing a neural generative attention-based additive scoring function. Despite memory constraints that limited ADAMW to ten epochs, this optimizer showed promising results compared to configurations using early stopping techniques. SGD provided higher BLEU scores for generalization but was very time-consuming. The results highlight the importance of finding a balance between optimization performance and computational efficiency, positioning ADAMW as a promising alternative when training efficiency and generalization are primary concerns.
Abstract
This paper concerned with study the effect of a graphite micro powder mixed in the kerosene dielectric fluid during powder mixing electric discharge machining (PMEDM) of high carbon high chromium AISI D2 steel. The type of electrode (copper and graphite), the pulse current and the pulse-on time and mixing powder in kerosene dielectric fluid are taken as the process main input parameters. The material removal rate MRR, the tool wear ratio TWR and the work piece surface roughness (SR) are taken as output parameters to measure the process performance. The experiments are planned using response surface methodology (RSM) design procedure. Empirical models are developed for MRR, TWR and SR, using the analysis
... Show MoreThis research highlights the light on the general framework of accounting discloser in the Islamic banks, and show the types and the concepts of Cost Efficiency, In this present study, the sample included Fourteen Islamic banks, where the data was collected from the annual financial reports. Accordingly, the study in order to achieve the aims and access to the results based on the analytical method and the descriptive analysis, and conducted a Simple & Multiple Linear Regression analysis, in order to test hypotheses of the research by using of statistical analysis software (SPSS). The research has arrived to many results such as: the commitment of Islamic banks working in the Kingdome of Bahrain (Wholesale) to the requirements of the
... Show MoreThe Ant System Algorithm (ASA) is a member of the ant colony algorithms family in swarm intelligence methods (part of the Artificial Intelligence field), which is based on the behavior of ants seeking a path and a source of food in their colonies. The aim of This algorithm is to search for an optimal solution for Combinational Optimization Problems (COP) for which is extremely difficult to find solution using the classical methods like linear and non-linear programming methods.
The Ant System Algorithm was used in the management of water resources field in Iraq, specifically for Haditha dam which is one of the most important dams in Iraq. The target is to find out an efficient management system for
... Show MoreThe research aims to demonstrate the impact of the acceptable solvency of the National Insurance Company on the investment activity in it, as the research assumes the existence of a statistically significant relationship between the acceptable solvency variable and the investment variable, and the researcher took the National Insurance Company as a place to conduct the research, as it is the first insurance company Watania was established in the fifties of the last century, and it has a long history in the practice of investment activity, and the company’s financial statements for the period from (2010-2019) were relied on, and the annual reports issued by the company as well as records are tools for gathering information, and
... Show MoreBecause of the fierce competition between service organizations on the one hand and the increasing demands of customers on the other. Therefore, these organizations sought to distinguish their service by taking care of all aspects. One of these important aspects is the service encounter environment and its reflection on customer emotions, so we choose the current research to clarify the importance and impact on customer satisfaction, the problem of research is how the interest of Iraqi restaurants in the service encounter environment and how to care about its elements and whether this interest is sufficient to reflect the satisfaction of the customer. the goal of the current research was to clarify how much the application of the
... Show MoreThe determination of critical micelle concentration of selected non-ionic surfactants (Tween 20,40 and 80) have been investigated using magnetic water(MW)as an aqueous medium.Conductometry technique is used to determine critical micelle concentration.The effect of alcohol addition and temperature variation at the range(293.15 -303.15K) are also pursued. It is concluded that the process of micellization is spontaneous and endothermic because of the observed free energy of micellization (ΔGom) , enthalpy change of micellization (ΔHom), and entropy change of micellization (ΔSom) for the system was also studied.The properties of the non-ionic surfactants were studied, both in absence and presence of
... Show MoreBrowse Iraqi academic journals and research papers
The goal of this work is demonstrating, through the gradient observation of a of type linear ( -systems), the possibility for reducing the effect of any disturbances (pollution, radiation, infection, etc.) asymptotically, by a suitable choice of related actuators of these systems. Thus, a class of ( -system) was developed based on finite time ( -system). Furthermore, definitions and some properties of this concept -system and asymptotically gradient controllable system ( -controllable) were stated and studied. More precisely, asymptotically gradient efficient actuators ensuring the weak asymptotically gradient compensation system ( -system) of known or unknown disturbances are examined. Consequently, under convenient hypo
... Show MoreThe development in the presentation and presentation of the service in order to distinguish them from the same, was one of the most important reasons to choose the current issue to upgrade the level of service, especially in the Iraqi restaurant sector, which has become today of the important sectors successful. The problem of research was to try to answer a range of questions: to what extent are Iraqi restaurants interested in physical service factors? Do Iraqi restaurants apply physical factors in a way that leads to customer satisfaction? Are Iraqi restaurants interested in the satisfaction of their customers? The objective of the current research is to try to determine the extent to which the
... Show More