When optimizing the performance of neural network-based chatbots, determining the optimizer is one of the most important aspects. Optimizers primarily control the adjustment of model parameters such as weight and bias to minimize a loss function during training. Adaptive optimizers such as ADAM have become a standard choice and are widely used for their invariant parameter updates' magnitudes concerning gradient scale variations, but often pose generalization problems. Alternatively, Stochastic Gradient Descent (SGD) with Momentum and the extension of ADAM, the ADAMW, offers several advantages. This study aims to compare and examine the effects of these optimizers on the chatbot CST dataset. The effectiveness of each optimizer is evaluated based on its sparse-categorical loss during training and BLEU in the inference phase, utilizing a neural generative attention-based additive scoring function. Despite memory constraints that limited ADAMW to ten epochs, this optimizer showed promising results compared to configurations using early stopping techniques. SGD provided higher BLEU scores for generalization but was very time-consuming. The results highlight the importance of finding a balance between optimization performance and computational efficiency, positioning ADAMW as a promising alternative when training efficiency and generalization are primary concerns.
Abstract
This research deals with Building A probabilistic Linear programming model representing, the operation of production in the Middle Refinery Company (Dura, Semawa, Najaif) Considering the demand of each product (Gasoline, Kerosene,Gas Oil, Fuel Oil ).are random variables ,follows certain probability distribution, which are testing by using Statistical programme (Easy fit), thes distribution are found to be Cauchy distribution ,Erlang distribution ,Pareto distribution ,Normal distribution ,and General Extreme value distribution . &
... Show MoreAbstract :
The research aims to Estimate the Strength of Strategic Innovation application in terms of application strength , and on the overall level in number of Iraqi Industrial business organizations . After wards determine whether their is differerences among those organizations in application process for the dimensions , and for the overall process .
The Research revealed number of conclusions including that the process of strategic innovation is applied in a good Level , and demonstrates the desier of the industrial companies Leaders to Launch beyond the familiar products , and to provide new products that
... Show MoreThis study deals with the impact of leadership styles in its three main dimensions (democratic, autocratic, lenient) as an independent variable of the dimensions of functional combustion (emotional stress, inhumanity, personal achievement). The research sought to achieve a set of goals, the most prominent of which are: studying the reality of the researched organization to identify the leadership patterns used in its management and its impact on the phenomenon of functional combustion, Moreover, knowing the extent of support for these established patterns and their contribution to mitigating the phenomenon of functional combustion in the organization's environment, and testing the impact of these leadership patte
... Show MoreBeyond the immediate content of speech, the voice can provide rich information about a speaker's demographics, including age and gender. Estimating a speaker's age and gender offers a wide range of applications, spanning from voice forensic analysis to personalized advertising, healthcare monitoring, and human-computer interaction. However, pinpointing precise age remains intricate due to age ambiguity. Specifically, utterances from individuals at adjacent ages are frequently indistinguishable. Addressing this, we propose a novel, end-to-end approach that deploys Mozilla's Common Voice dataset to transform raw audio into high-quality feature representations using Wav2Vec2.0 embeddings. These are then channeled into our self-attentio
... Show MoreThe changes that have occurred in the business environment and scientific and technological progress, as well as the complexity of administrative problems resulting from its practice of various activities, have led to an increase in the responsibilities entrusted to it, and for the purpose of achieving its strategic objectives, which has made the pillars of corporate governance an inevitable matter required by the nature of modern scientific management of the governorate, the success that companies seek is based on the fertile environment and the dialectical relationship between the individual and the company, and to achieve this success there must be a compatible and harmonious audit environment between the internal and external
... Show MoreThe research aimed to demonstrate the possibility of benefiting from the coordination between real estate and income tax as the independent variable on the tax outcome as the dependent variable as the dependent variable. Which were practiced within rented buildings, as information was obtained from real estate owners, and the annual controls for the year 2021 were relied upon in the process of calculating the tax amounts expected to be obtained. used in the tax inventory process lacks seriousness and continuous updating
E-learning is a necessity imposed by the Corona pandemic, which has disrupted various educational institutions in the world, but some of these institutions have not been affected and education has continued with them, due to their flexible educational system that was able to employ technology in the continuity of the educational process in the so-called e-learning, because It has characteristics that make it the most suitable alternative to avoid the consequences of the Corona pandemic and its damage to the educational process, as e-learning is one of the modern methods that contribute to enhancing the effectiveness of the learner, and enabling him to assume greater responsibility compared to traditional education, so the learner becomes
... Show MoreThe aim of this paper is to design suitable neural network (ANN) as an alternative accurate tool to evaluate concentration of Copper in contaminated soils. First, sixteen (4x4) soil samples were harvested from a phytoremediated contaminated site located in Baghdad city in Iraq. Second, a series of measurements were performed on the soil samples. Third, design an ANN and its performance was evaluated using a test data set and then applied to estimate the concentration of Copper. The performance of the ANN technique was compared with the traditional laboratory inspecting using the training and test data sets. The results of this study show that the ANN technique trained on experimental measurements can be successfully applied to the rapid est
... Show MoreSequence covering array (SCA) generation is an active research area in recent years. Unlike the sequence-less covering arrays (CA), the order of sequence varies in the test case generation process. This paper reviews the state-of-the-art of the SCA strategies, earlier works reported that finding a minimal size of a test suite is considered as an NP-Hard problem. In addition, most of the existing strategies for SCA generation have a high order of complexity due to the generation of all combinatorial interactions by adopting one-test-at-a-time fashion. Reducing the complexity by adopting one-parameter- at-a-time for SCA generation is a challenging process. In addition, this reduction facilitates the supporting for a higher strength of cove
... Show MoreThe present research aims to design an electronic system based on cloud computing to develop electronic tasks for students of the University of Mosul. Achieving this goal required designing an electronic system that includes all theoretical information, applied procedures, instructions, orders for computer programs, and identifying its effectiveness in developing Electronic tasks for students of the University of Mosul. Accordingly, the researchers formulated three hypotheses related to the cognitive and performance aspects of the electronic tasks. To verify the research hypotheses, a sample of (91) students is intentionally chosen from the research community, represented by the students of the college of education for humanities and col
... Show More