When optimizing the performance of neural network-based chatbots, determining the optimizer is one of the most important aspects. Optimizers primarily control the adjustment of model parameters such as weight and bias to minimize a loss function during training. Adaptive optimizers such as ADAM have become a standard choice and are widely used for their invariant parameter updates' magnitudes concerning gradient scale variations, but often pose generalization problems. Alternatively, Stochastic Gradient Descent (SGD) with Momentum and the extension of ADAM, the ADAMW, offers several advantages. This study aims to compare and examine the effects of these optimizers on the chatbot CST dataset. The effectiveness of each optimizer is evaluated based on its sparse-categorical loss during training and BLEU in the inference phase, utilizing a neural generative attention-based additive scoring function. Despite memory constraints that limited ADAMW to ten epochs, this optimizer showed promising results compared to configurations using early stopping techniques. SGD provided higher BLEU scores for generalization but was very time-consuming. The results highlight the importance of finding a balance between optimization performance and computational efficiency, positioning ADAMW as a promising alternative when training efficiency and generalization are primary concerns.
Research Summary
It highlights the importance of assessing the demand for money function in Iraq through the understanding of the relationship between him and affecting the variables by searching the stability of this function and the extent of their influence in the Iraqi dinar exchange rate in order to know the amount of their contribution to the monetary policies of the Iraqi economy fee, as well as through study behavior of the demand for money function in Iraq and analyze the determinants of the demand for money for the period 1991-2013 and the impact of these determinants in the demand for money in Iraq.
And that the problem that we face is how to estimate the total demand for money in
... Show MorePurpose – The Cloud computing (CC) and its services have enabled the information centers of organizations to adapt their informatic and technological infrastructure and making it more appropriate to develop flexible information systems in the light of responding to the informational and knowledge needs of their users. In this context, cloud-data governance has become more complex and dynamic, requiring an in-depth understanding of the data management strategy at these centers in terms of: organizational structure and regulations, people, technology, process, roles and responsibilities. Therefore, our paper discusses these dimensions as challenges that facing information centers in according to their data governance and the impa
... Show MoreIn this paper fractional Maxwell fluid equation has been solved. The solution is in the Mettag-Leffler form. For the corresponding solutions for ordinary Maxwell fluid are obtained as limiting case of general solutions. Finally, the effects of different parameters on the velocity and shear stress profile are analyzed through plotting the velocity and shear stress profile.
In this study, a traumatic spinal cord injury (TSCI) classification system is proposed using a convolutional neural network (CNN) technique with automatically learned features from electromyography (EMG) signals for a non-human primate (NHP) model. A comparison between the proposed classification system and a classical classification method (k-nearest neighbors, kNN) is also presented. Developing such an NHP model with a suitable assessment tool (i.e., classifier) is a crucial step in detecting the effect of TSCI using EMG, which is expected to be essential in the evaluation of the efficacy of new TSCI treatments. Intramuscular EMG data were collected from an agonist/antagonist tail muscle pair for the pre- and post-spinal cord lesi
... Show MoreThe present work aims to fabricate n-i-p forward perovskite solar cell (PSC) withئ structure (FTO/ compact TiO2/ compact TiO2/ MAPbI3 Perovskite/ hole transport layer/ Au). P3HT, CuI and Spiro-OMeTAD were used as hole transport layers. A nano film of 25 nm gold layer was deposited once between the electron transport layer and the perovskite layer, then between the hole transport layer and the perovskite layer. The performance of the forward-perovskite solar cell was studied. Also, the role of each electron transport layer and the hole transport layer in the perovskite solar cell was presented. The structural, morphological and electrical properties were studied with X-ray diffractometer, field emission s
... Show MoreThe aim of this research is analysis the effect of the changes in (GDA, g, inflation) at average and standard economic curriculum in composition of the models, depending on SPSS program in analysis, and according to available date from central bank of Iraq and during the period from 2003 to 2018 and by using OLS and estimate of the equation and the results showed a statistical significance relation in incorporeal level 5% and the R2 value equal to 92.1 refer to the changes in independent variables explain 92% of changes of unemployment and the independent variables effect are very limit depend on estimated parameters in the model and respectively (0.986,0.229,-0.060), the research recommended necessity to active the inve
... Show MoreTitanium alloy (Ti-6Al-4V or Gr.23) was widely used as a dental alloy. In the current study, polymerization of eugenol (PE) on Gr.23 titanium alloys was conducted by an electrochemical process before and after being treated by Micro Arc Oxidation (MAO). The formed films were characterized by scanning electron microscopy (SEM), energy-dispersive X-ray spectroscopy (EDS), and X-ray diffraction (XRD). The corrosion behavior of Gr.23 alloy in an artificial saliva environment at a temperature range of 293–323 K has been studied and assessed by means of electrochemical polarization and impedance spectroscopy techniques. Three cases are taken into consideration; bare Gr.23, Gr.23 coated by PE, and Gr.23 coated by PE after MAO treatment. The maxi
... Show More
The current research variables have received increasing attention in the recent period because they are one of the important issues affecting the future of organizations, as a result of the speed of environmental variables that have greatly affected organizations and for the purpose of explaining the relationships and links between research variables, as this research presents a test "the type and direction of the relationship between strategic foresight capabilities As an independent variable and green creativity "as a respondent variable. A set of questions has arisen about the basic research problem, including what is the nature and level of interest in the research variables (strategic foresight capabilities an
... Show MoreMarkov chains are an application of stochastic models in operation research, helping the analysis and optimization of processes with random events and transitions. The method that will be deployed to obtain the transient solution to a Markov chain problem is an important part of this process. The present paper introduces a novel Ordinary Differential Equation (ODE) approach to solve the Markov chain problem. The probability distribution of a continuous-time Markov chain with an infinitesimal generator at a given time is considered, which is a resulting solution of the Chapman-Kolmogorov differential equation. This study presents a one-step second-derivative method with better accuracy in solving the first-order Initial Value Problem
... Show MoreThe education, especially higher education, is an essentially factor in the progress of any society, if we consider the higher education, represents the top of the education`s pyramid which take part in developing the human resources and provide the human staff to raise the productive efficiency, and improve the social , economic level
In order to face the increasing importance of higher education, great capabilities and expenditures must be available in a continous way, such expe
... Show More