When optimizing the performance of neural network-based chatbots, determining the optimizer is one of the most important aspects. Optimizers primarily control the adjustment of model parameters such as weight and bias to minimize a loss function during training. Adaptive optimizers such as ADAM have become a standard choice and are widely used for their invariant parameter updates' magnitudes concerning gradient scale variations, but often pose generalization problems. Alternatively, Stochastic Gradient Descent (SGD) with Momentum and the extension of ADAM, the ADAMW, offers several advantages. This study aims to compare and examine the effects of these optimizers on the chatbot CST dataset. The effectiveness of each optimizer is evaluated based on its sparse-categorical loss during training and BLEU in the inference phase, utilizing a neural generative attention-based additive scoring function. Despite memory constraints that limited ADAMW to ten epochs, this optimizer showed promising results compared to configurations using early stopping techniques. SGD provided higher BLEU scores for generalization but was very time-consuming. The results highlight the importance of finding a balance between optimization performance and computational efficiency, positioning ADAMW as a promising alternative when training efficiency and generalization are primary concerns.
Öz
Arzı Kanber/Kamber hikayesi Anadolu, Rumeli, Azerbaycan, Türkmenistan ve Irak gibi Türk dünyasının birçok yerinde birden fazla varyantı bulunan, çok sevilen ve yaygın olarak anlatılan aşk ve dramatik maceralı bir halk hikayesidir. Türk halk hikayelerinin en popüler olanlarından biri sayılan Arzı Kanber/Kamber hikayesi, Anadolu'nun birçok yöresinde bilinmesine rağmen Irak Türkmenleri arasında daha çok sevildiği ve yaygın olarak anlatıldığı tespit edilen birden fazla varyantından da görülebilir. Irak Türkmenleri arasında günümüze kadar hikayenin iki varyantı tespit edilmi
... Show MoreAbstract
Objectives: To find out the association between enhancing learning needs and demographic characteristic of (gender, education level and age).
Methods: This study was conducted on purposive sample was selected to obtain representative and accurate data consisting of (90) patients who are in a peroid of recovering from myocardial infarction at Missan Center for Cardiac Diseases and Surgery, (10) patients were excluded for the pilot study, Data were analyzed using descriptive statistical data analysis approach of frequency, percentage, and analysis of variance (ANOVA).
Results: The study finding shows, there was sign
... Show MoreIn study of effective bioactive compounds, we have synthesized the Co((ІІ), Mn(ІІ), Fe(ІІ), Cu(ІІ), Ni(ІІ), and Zn(ІІ) complexes of the Schiff base derived from trimethoprim and2'-amino-4-chlorobenzophenone and characterized by spectroscopic (NMR, IR, Mass, UV–vis,), analytical, TGA studies and magnetic data .The solution electronic spectral study suggests the stoichiometry of the synthesized complexes and Elemental analysis detected the square planer and octahedral geometry of the compounds. The prepared metal complexes presented promoted efficiency versus the screened bacterial (Escherichia Coli and Staphylococcus aureus) antibacterial efficacy against (Staphylococcus aureus, Salmonella spp., E. coli, Vibrio spp., Pseudomona
... Show MoreTwo Schiff bases, namely, 3-(benzylidene amino) -2-thioxo-6-methyl 2,5-dihydropyrimidine-4(3H)-one (LS])and 3-(benzylidene amino)-6-methyl pyrimidine 4(3H, 5H)-dione(LA)as chelating ligands), were used to prepare some complexes of Cr(III), La(III), and Ce(III)] ions. Standard physico-chemical procedures including metal analysis M%, element microanalysis (C.H.N.S) , magnetic susceptibility, conductometric measurements, FT-IR and UV-visible Spectra were used to identify Metal (III) complexes and Schiff bases (LS) and (LA). According to findings, a [Cr(III) complex] showed six coordinated octahedral geometry, while [La(III), and Ce(III) complexes]were structured with coordination number seven. Schiff's bases a
... Show MoreCommercial graphite (CGT) powder was used as an adsorbent surface for cationic dye, Janus green (JG), from aqueous solutions. This study aims to highlight the practical significance of using inexpensive CGT as an efficient adsorbent for the removal of JG dye from industrial wastewater. CGT was characterized by Fourier transform infrared spectroscopy, scanning electron microscopy, and X-ray diffraction. The adsorption process was investigated by examining parameters like the weight of the adsorbent, contact time, and temperature. Pseudo-second-order kinetic (PSO), pseudo-first-order, and intraparticle diffusion were used for analyzing the kinetic data. JG dye's adsorption kinetics fit the PSO kinetic model well (R2= 0.999). Furthermo
... Show MoreA simple chemistry method approach was used to synthesise new ligand derivate from L-ascorbic acid and its complexes. All of them were water-soluble and are used quite extensively in the medical and pharmaceutical fields. This study synthesised the new ligand derivative from L-ascorbic acid-base using the following steps: A 5,6-O-isopropylidene-L-ascorbic acid was prepared by reacting dry acetone with L-ascorbic acid followed by reacting it with trichloroacetic acid to yield [chloro(carboxylic)methylidene]-5,6-O-isopropylidene-L-ascorbic acid in the second stage. In the third stage, the derivative was reacted with (methyl(6-methyl-2-pyridylmethyl)amine to create a new ligand (ONMILA). This novel ligand was identified using a number
... Show More