Data scarcity is a major challenge when training deep learning (DL) models. DL demands a large amount of data to achieve exceptional performance. Unfortunately, many applications have small or inadequate data to train DL frameworks. Usually, manual labeling is needed to provide labeled data, which typically involves human annotators with a vast background of knowledge. This annotation process is costly, time-consuming, and error-prone. Usually, every DL framework is fed by a significant amount of labeled data to automatically learn representations. Ultimately, a larger amount of data would generate a better DL model and its performance is also application dependent. This issue is the main barrier for many applications dismissing the use of DL. Having sufficient data is the first step toward any successful and trustworthy DL application. This paper presents a holistic survey on state-of-the-art techniques to deal with training DL models to overcome three challenges including small, imbalanced datasets, and lack of generalization. This survey starts by listing the learning techniques. Next, the types of DL architectures are introduced. After that, state-of-the-art solutions to address the issue of lack of training data are listed, such as Transfer Learning (TL), Self-Supervised Learning (SSL), Generative Adversarial Networks (GANs), Model Architecture (MA), Physics-Informed Neural Network (PINN), and Deep Synthetic Minority Oversampling Technique (DeepSMOTE). Then, these solutions were followed by some related tips about data acquisition needed prior to training purposes, as well as recommendations for ensuring the trustworthiness of the training dataset. The survey ends with a list of applications that suffer from data scarcity, several alternatives are proposed in order to generate more data in each application including Electromagnetic Imaging (EMI), Civil Structural Health Monitoring, Medical imaging, Meteorology, Wireless Communications, Fluid Mechanics, Microelectromechanical system, and Cybersecurity. To the best of the authors’ knowledge, this is the first review that offers a comprehensive overview on strategies to tackle data scarcity in DL.
This study was achieved to investigate the accumulation of some heavy metals included: Cadmium, Lead and Nickel in the tissues (gill, intestine, liver, muscles and skin) of Silurus triostegus Heckel, 1843 (Siluriformes, Siluridae) and its larval stage of the nematode Contracaecum sp. (Rhabditida, Anisakidae). As well as to assess the infection patterns of Contracaecum among S. triostegus specimens which were purchased fresh from the local market in Baghdad. One hundred and nine nematodes specimens in larval stage were recovered from the fish host; the overall prevalence of Contracaecum sp. was 38.6%. The sex of the host was not significantly (P ˃ 0.05) associated with the infection of this nematode.
Results showed that the overall me
It is often needed in demographic research to modern statistical tools are flexible and convenient to keep up with the type of data available in Iraq in terms of the passage of the country far from periods of war and economic sanctions and instability of the security for a period of time . So, This research aims to propose the use of style nonparametric splines as a substitute for some of the compounds of analysis within the model Lee-Carter your appreciation rate for fertility detailed variable response in Iraq than the period (1977 - 2011) , and then predict for the period (2012-2031). This goal was achieved using a style nonparametric decomposition of singular value vehicles using the main deltoid , and then estimate the effect of time-s
... Show MoreWe have provided in this research model multi assignment with fuzzy function goal has been to build programming model is correct Integer Programming fogging after removing the case from the objective function data and convert it to real data .Pascal triangular graded mean using Pascal way to the center of the triangular.
The data processing to get rid of the case fogging which is surrounded by using an Excel 2007 either model multi assignment has been used program LNDO to reach the optimal solution, which represents less than what can be from time to accomplish a number of tasks by the number of employees on the specific amount of the Internet, also included a search on some of the
... Show Moren this research, some thermophysical properties of ethylene glycol with water (H2O) and two solvent mixtures dimethylformamide/ water (DMF + H2O) were studied. The densities (ρ) and viscosities (η) of ethylene glycol in water and a mixed solvent dimethylformamide (DMF + H2O) were determined at 298.15 K, t and a range of concentrations from 0.1 to1.0 molar. The ρ and η values were subsequently used to calculate the thermodynamics of mixing including the apparent molar volume (ϕv), partial molar volume (ϕvo) at infinite dilution. The solute-solute interaction is presented by Sv results from the equation ∅_v=ϕ_v^o+S_v √m. The values of viscosity (B) coefficients and Falkenhagen coefficient(A) of the Jone-Dole equation and Gibbs free
... Show MoreThe compound [K1] was synthesized from the reaction of dichloromethane with linear alkyl benzene (Lab9) using ethanol as a solvent, and from(chloro methyl)-4-nonylbenzene) [K1] it was possible to synthesize the compound Z(4-(nonan-3-yl)phenyl) methane amine) [K2] containing the amine group by synthesized from [K2] reaction with appropriate phenolic aldehydes and using Ethanol as a solvent in the preparation of vinyl chloride4-(((4-nonylbenzyl)imino)methyl)phenol-4-(((4-nonylbenzyl)imino methyl)benzene-1,3diol) [K3-K4] bases has been used. Preparation of a number of Phenolic polymers4-(2- hydroxy-3.5-dimethylbenzyl)-2-methyl-6-(((4-4-(2hyroxy-3, 5-dimethylbenzyl)-2-methyl-6(((4 nonylbenzyl) imino) methyl) benzene-phenolnonylbenzyl) imino) me
... Show MoreThe study in duded isolation and identification of microbial isolates from oral cavity to 10 volunteers, diagnosed within the three groups: Staphylococcus aureus, Staphylococcus epidermidis, Streptococcus spp. and Candida albicans . The sensitivity test of all isolates bacteria Streptococcus spp. , S. aureus and S. epidermidis showed high resistance to Ampicillin(100)%,followed Methicillin (88.88)% and Amoxicillin / clavulanic acid(77.77)%, while the resistance for each of Vancomycin and Amoxicillin were (66.66)%, and the resistance to Erythromycin and Pencillin (55.55)% to each of them. The results showed less resistance to Trimethoprim (22.22)% and Cefalotine (11.11)% of all bacteria isolate. Investigation of the pre
... Show MoreThe aim of study was making comparison in some kinematics variables in (100) meter butterfly swimming to first and second ranking in championship 2003 Espana, so noticed there is no such like this study in our country in comparison study for international champions therefore not specific and scientific discovering to these advanced levels, also the researchers depend on group of kinematics variables when the comparison making and it was included (50 meter the first, 50 meter the second, the differences between the first (50) meter and the second , more over basic variables in (100) meter butterfly , after having the results and treat it statistically the researchers reaches to two conclusions which was: • Success the first rank in startin
... Show MoreIn this work, some of new 2-benzylidenehydrazinecarbothioamide derivatives have been prepared by condensation of thiosemicarbazide and different substituted aromatic benzaldehydes in presence of glacial acetic acid to give compounds (1-6), these compounds have characterized by its physical properties and spectroscopic methods. This work also included theoretical study to prove the ability of these compounds as corrosion inhibitors; The program package of Gaussian 09W with its graphical user interface GaussView 5.0 had used for this purpose; the methods of Density Functional Theory (DFT) with basis set of 6-311G (d,p) / hybrid function of B3LYP and semiempirical method of PM3 have been used, the study included theoretical simulation
... Show More