When optimizing the performance of neural network-based chatbots, determining the optimizer is one of the most important aspects. Optimizers primarily control the adjustment of model parameters such as weight and bias to minimize a loss function during training. Adaptive optimizers such as ADAM have become a standard choice and are widely used for their invariant parameter updates' magnitudes concerning gradient scale variations, but often pose generalization problems. Alternatively, Stochastic Gradient Descent (SGD) with Momentum and the extension of ADAM, the ADAMW, offers several advantages. This study aims to compare and examine the effects of these optimizers on the chatbot CST dataset. The effectiveness of each optimizer is evaluated based on its sparse-categorical loss during training and BLEU in the inference phase, utilizing a neural generative attention-based additive scoring function. Despite memory constraints that limited ADAMW to ten epochs, this optimizer showed promising results compared to configurations using early stopping techniques. SGD provided higher BLEU scores for generalization but was very time-consuming. The results highlight the importance of finding a balance between optimization performance and computational efficiency, positioning ADAMW as a promising alternative when training efficiency and generalization are primary concerns.
A series of batch demulsification runs were carried out to evaluate the final emulsified water content of emulsion samples after the exposure to microwave. An experimental study was conducted to evaluate the effects of a set of operating variables on the demulsification performance. Several microwave irradiation demulsification runs were carried out at different irradiation powers (700, 800, and 900 watt), using water-in-oil emulsion samples containing different water contents (20-80%, 30-70%, and 50-50%) and salt contents (10000, 20000, and 30000 ppm). It was found that the best separation efficiency was obtained at 900watt, 50% water content and 160 s of irradiation time. Experimental results showed that microwave radiation method can
... Show MoreThe objective of this study is to apply Artificial Neural Network for heat transfer analysis of shell-and-tube heat exchangers widely used in power plants and refineries. Practical data was obtained by using industrial heat exchanger operating in power generation department of Dura refinery. The commonly used Back Propagation (BP) algorithm was used to train and test networks by divided the data to three samples (training, validation and testing data) to give more approach data with actual case. Inputs of the neural network include inlet water temperature, inlet air temperature and mass flow rate of air. Two outputs (exit water temperature to cooling tower and exit air temperature to second stage of air compressor) were taken in ANN.
... Show MoreThe administration on the basis of the activities designed to evaluate the performance of activities in terms of cost, time and quality by identifying activities that add value and those that are no add value and enables the administration of making up their own continuous improvement in production, through lower costs and reduce the time and improve the quality and reduce the incidence of spoilage and waste, y based search Ally premise that (the continuous improvement of the adoption of management style on the basis of the activities helps management in decision-making wise to reduce costs) to prove the hypothesis has sought research to achieve its goal of Alkadivh and Alkoppelan &nb
... Show MoreThe efficiency evaluation of the railway lines performance is done through a set of indicators and criteria, the most important are transport density, the productivity of enrollee, passenger vehicle production, the productivity of freight wagon, and the productivity of locomotives. This study includes an attempt to calculate the most important of these indicators which transport density index from productivity during the four indicators, using artificial neural network technology. Two neural networks software are used in this study, (Simulnet) and (Neuframe), the results of second program has been adopted. Training results and test to the neural network data used in the study, which are obtained from the international in
... Show MoreDeep learning convolution neural network has been widely used to recognize or classify voice. Various techniques have been used together with convolution neural network to prepare voice data before the training process in developing the classification model. However, not all model can produce good classification accuracy as there are many types of voice or speech. Classification of Arabic alphabet pronunciation is a one of the types of voice and accurate pronunciation is required in the learning of the Qur’an reading. Thus, the technique to process the pronunciation and training of the processed data requires specific approach. To overcome this issue, a method based on padding and deep learning convolution neural network is proposed to
... Show MoreABSTRACT:
Interest rates are one of the important aspects that affect the banking business directly, which is characterized by unstable dynamic dynamics, which must be viewed on a daily and continuous basis through the macroeconomic view, which directly affects the bank’s income realized from loans as interest received or interest paid on its deposits as an expense. Hence the earnings per share. The relationship between interest rates and between net income and earnings per share was measured and a correlation was found between them, and then the effect between them was measured using regression equations and they were applied and th
... Show MoreAbstract The Object of the study aims to identify the effectiveness of using the 7E’s learning cycle to learn movement chains on uneven bars, for this purpose, we used the method SPSS. On a sample composed (20) students on collage of physical education at the university of Baghdad Chosen as two groups experimental and control group (10) student for each group, and for data collection, we used SPSS After collecting the results and having treated them statistically, we conclude the use 7E’s learning cycle has achieved remarkable positive progress, but it has diverged between to methods, On this basis, the study recommended the necessity of applying 7E’s learning cycle strategy in learning the movement chain on uneven bar
... Show MoreThe research aimed to measure the reality of monetary policy and its role in neutralizing the impact of fluctuations in total domestic oil prices, through the most important monetary policy variable (money supply). An example of this is using a simple technique in the previous example, turning it into a straightforward user interface by (Judd and Kunee). After estimating the impact of the policy with the domestic gross domestic oil prices in Iraq, the effect of fluctuations in the domestic gross domestic oil prices in the simple regression model, while the morale of oil prices was not proven with a negative sign, while the morale of money supply and their impact on the increase of the domestic was proven in the multiple regressio
... Show MoreThis qualitative study was conducted on eight types of commercial baking yeast which available in local markets to estimate their fermentation activity as affecting the Bread industry and the impact of the salt added to DoughLeavening, The results showed a great variation in the fermentation capacity of yeast samples (their role in swelling the dough), most notably the sample value Y3 and least sample Y7 and reached 80% and 20% respectively, and the value of Leavening by using the two types of yeast with addition of three levels of salt (0 , 1 and 2%) have 20.0 , 19.7 and 15.7 of the sample Y3, compared with 10.5 , 10.3 and 8.8 of the sample Y7 for each of the levels of salt respectively, reflect
... Show MoreCadmium sulphide CdS films with 200 nm have been prepared by thermal evaporation technique on glass substrate at substrate room temperature under vacuum of 10-5mbar.In this paper, the effect of Dielectric Barrier Discharge plasma on the optical properties of the CdS film. The prepared films were exposed to different time intervals (0, 3, 5, 8) min. For every sample, the Absorption A, absorption coefficient α , energy gap Eg ,extinction coefficient K and dielectric constant ε were studied. It is found that the energy gap were decreased with exposure time, and absorption , Absorption coefficient, refractive index, extinction coefficient, dielectric constant increased with time of exposure to the plasma. Our study conside
... Show More