When optimizing the performance of neural network-based chatbots, determining the optimizer is one of the most important aspects. Optimizers primarily control the adjustment of model parameters such as weight and bias to minimize a loss function during training. Adaptive optimizers such as ADAM have become a standard choice and are widely used for their invariant parameter updates' magnitudes concerning gradient scale variations, but often pose generalization problems. Alternatively, Stochastic Gradient Descent (SGD) with Momentum and the extension of ADAM, the ADAMW, offers several advantages. This study aims to compare and examine the effects of these optimizers on the chatbot CST dataset. The effectiveness of each optimizer is evaluated based on its sparse-categorical loss during training and BLEU in the inference phase, utilizing a neural generative attention-based additive scoring function. Despite memory constraints that limited ADAMW to ten epochs, this optimizer showed promising results compared to configurations using early stopping techniques. SGD provided higher BLEU scores for generalization but was very time-consuming. The results highlight the importance of finding a balance between optimization performance and computational efficiency, positioning ADAMW as a promising alternative when training efficiency and generalization are primary concerns.
PVA, Starch/PVA, and Starch/PVA/sugar samples of different
concentrations (10, 20, 30 and 40 % wt/wt) were prepared by casting
method. DSC analysis was carried; the results showed only one glass
transition temperature (Tg) for the samples involved, which suggest
that starch/PVA and starch/PVA/sugar blends are miscible. The
miscibility is attributed to the hydrogen bonds between PVA and
starch. This is in a good agreement with (FTIR) results. Tg and Tm
decrease with starch and sugar content compared with that for
(PVA). Systematic decrease in ultimate strength, due to starch and
sugar ratio increase, is attributed to (PVA), which has more hydroxyl
groups that made its ultimate strength higher than that for
Poly vinyl alcohol has been studied for its ability to form crystallites by using annealing method. Semicrystalline films of poly vinyl alcohol (PVA) were prepared by casting 11.5 wt. % and 13 wt. % PVA aqueous solution onto glass slides at annealing temperature range 90 -120°C and duration time 15- 60 minute. This allowed the macromolecules to form crystallites, small regions of folded and compacted chains separated by amorphous regions where single PVA chain may pass through several of these crystallites. Degree of crystallinity of PVA films (hydrogels) was determined by method of density; on the other hand the swelling behavior was conducted by the determination of water uptake, wet degree of crystallinity, gel fraction and solubilit
... Show MoreElectrical Discharge Machining (EDM) is a widespread Nontraditional Machining (NTM) processes for manufacturing of a complicated geometry or very hard metals parts that are difficult to machine by traditional machining operations. Electrical discharge machining is a material removal (MR) process characterized by using electrical discharge erosion. This paper discusses the optimal parameters of EDM on high-speed steel (HSS) AISI M2 as a workpiece using copper and brass as an electrode. The input parameters used for experimental work are current (10, 24 and 42 A), pulse on time (100, 150 and 200 µs), and pulse off time (4, 12 and 25 µs) that have effect on the material removal rate (MRR), electrode wear rate (EWR) and wear ratio (WR). A
... Show MoreTraditional programs and the tedious and financially costly processes they require are no longer the best choice for content makers. The continuous development and development have led to the emergence of competitive software that offers capabilities that are more suitable for aesthetic needs, as it breaks down stereotypical frameworks from the familiar to the unfamiliar to be more suitable for graphic subjects in terms of dealing with the requirements of the digital content industry. Video for communication platforms, as it has more advantages than traditional software and the flexibility and high quality it offers at the level of the final product, All of this contributed to supplementing the image with aesthetic employments with data
... Show MoreAbstract
The aim of the current research is to identify the Effect of the alternative evaluation strategy on the achievement of fourth-grade female students in the subject of biology. The researchers adopted the zero hypothesis to prove the research objectives, which is there is no statistically significant difference at the level (0.05) between the average scores of the experimental group who study according to the alternative evaluation strategy and the average scores of the control group who study in accordance with the traditional method. The researchers selected the experimental partial adjustment design of the experimental and control groups with the post-test. The researchers intentionally selected (Al-fed
... Show MoreThis article proposes a new technique for determining the rate of contamination. First, a generative adversarial neural network (ANN) parallel processing technique is constructed and trained using real and secret images. Then, after the model is stabilized, the real image is passed to the generator. Finally, the generator creates an image that is visually similar to the secret image, thus achieving the same effect as the secret image transmission. Experimental results show that this technique has a good effect on the security of secret information transmission and increases the capacity of information hiding. The metric signal of noise, a structural similarity index measure, was used to determine the success of colour image-hiding t
... Show MoreInformation from 54 Magnetic Resonance Imaging (MRI) brain tumor images (27 benign and 27 malignant) were collected and subjected to multilayer perceptron artificial neural network available on the well know software of IBM SPSS 17 (Statistical Package for the Social Sciences). After many attempts, automatic architecture was decided to be adopted in this research work. Thirteen shape and statistical characteristics of images were considered. The neural network revealed an 89.1 % of correct classification for the training sample and 100 % of correct classification for the test sample. The normalized importance of the considered characteristics showed that kurtosis accounted for 100 % which means that this variable has a substantial effect
... Show MoreIt is believed that culture plays an important role in the ELF classroom activities (Al- Mutawa, & Kilani, 1989:87). It is important for the teacher to recognize potential negative (culturally based) perceptions of their learners. In Iraq, for instance, it is not. Uncommon to meet silent expressionless students that arc supposedly English language learners. It is possible for the beginner to interpret this negatively as a lack of interest in the study of English. This interpretation may play a harmful role in the classroom methodology. An instructor has to be intercultural competent to be an effective teacher. It will be more effective if the instructor adopts a consistent style of instruction to allow learners to adapt within the bounds of
... Show MoreThe evolution in the field of Artificial Intelligent (AI) with its training algorithms make AI very important in different aspect of the life. The prediction problem of behavior of dynamical control system is one of the most important issue that the AI can be employed to solve it. In this paper, a Convolutional Multi-Spike Neural Network (CMSNN) is proposed as smart system to predict the response of nonlinear dynamical systems. The proposed structure mixed the advantages of Convolutional Neural Network (CNN) with Multi -Spike Neural Network (MSNN) to generate the smart structure. The CMSNN has the capability of training weights based on a proposed training algorithm. The simulation results demonstrated that the proposed
... Show More