Data scarcity is a major challenge when training deep learning (DL) models. DL demands a large amount of data to achieve exceptional performance. Unfortunately, many applications have small or inadequate data to train DL frameworks. Usually, manual labeling is needed to provide labeled data, which typically involves human annotators with a vast background of knowledge. This annotation process is costly, time-consuming, and error-prone. Usually, every DL framework is fed by a significant amount of labeled data to automatically learn representations. Ultimately, a larger amount of data would generate a better DL model and its performance is also application dependent. This issue is the main barrier for many applications dismissing the use of DL. Having sufficient data is the first step toward any successful and trustworthy DL application. This paper presents a holistic survey on state-of-the-art techniques to deal with training DL models to overcome three challenges including small, imbalanced datasets, and lack of generalization. This survey starts by listing the learning techniques. Next, the types of DL architectures are introduced. After that, state-of-the-art solutions to address the issue of lack of training data are listed, such as Transfer Learning (TL), Self-Supervised Learning (SSL), Generative Adversarial Networks (GANs), Model Architecture (MA), Physics-Informed Neural Network (PINN), and Deep Synthetic Minority Oversampling Technique (DeepSMOTE). Then, these solutions were followed by some related tips about data acquisition needed prior to training purposes, as well as recommendations for ensuring the trustworthiness of the training dataset. The survey ends with a list of applications that suffer from data scarcity, several alternatives are proposed in order to generate more data in each application including Electromagnetic Imaging (EMI), Civil Structural Health Monitoring, Medical imaging, Meteorology, Wireless Communications, Fluid Mechanics, Microelectromechanical system, and Cybersecurity. To the best of the authors’ knowledge, this is the first review that offers a comprehensive overview on strategies to tackle data scarcity in DL.
This is the first record of a new species of cyanobacteria Westiellopsis akinetica in the Iraqi environment, Samples were collected on June 2013 and the existence of it was not documented before. We isolated and purified this species ten years ago in Iraq, but we couldn't identify accurately based on all taxonomic handbooks. This is due to the species features being different from the other documented species in the available taxonomic lectures. It resembled many species by morphological characteristics such as Fischerella muscicola, Fischerella thermalis, Westiellopsis biateralis SA16. Westiellopsis interrupta, Westiellopsis persica SA33, Westiellopsis prolifica and Symphyonema bifilamentata. Describing a new species of the Westiellops
... Show MoreThis research deals with the study of top soil electrical conductive regions located within Baghdad City. The research included measuring the dissolved soil material extraction Electrical Conductivity (EC) with an aqueous solution for the top (0-30 cm) soil layer of the study area. As the electrical conductivity values increase by increasing the amount of dissolved salts in principle, we can consider that the aim of this research is to predict the amount and distribution of (soil contamination with salts) which is represented by the (Salt Index), this factor calculated for each soil representative sample taken from the region with a depth of (30 cm). Laboratory (EC) test values measured by the use of solutions (EC) digital meter for the ex
... Show MoreReliability analysis methods are used to evaluate the safety of reinforced concrete structures by evaluating the limit state function 𝑔(𝑋𝑖). For implicit limit state function and nonlinear analysis , an advanced reliability analysis methods are needed. Monte Carlo simulation (MCS) can be used in this case however, as the number of input variables increases, the time required for MCS also increases, making it a time consuming method especially for complex problems with implicit performance functions. In such cases, MCS-based FORM (First Order Reliability Method) and Artificial Neural Network-based FORM (ANN FORM) have been proposed as alternatives. However, it is important to note that both MCS-FORM and ANN-FORM can also be time-con
... Show MoreThis study was aimed to evaluate the effect of spraying nano chitosan loaded with NPK fertilizer and nettle leaf and green tea extracts on the growth and productivity of potato for the spring and fall seasons of 2021.It was conducted at private farm in Wasit Governorate, Iraq, as a factorial experiment (5 × 5) within randomized complete block design using three replicates. The first factor included spraying with four concentrations of chitosan nanoparticles loaded with NPK fertilizer 0, 10. 15 and 20% in addition to chemical fertilization treatment, the second factor was spraying nettle leaf extract 25 and 35 gL-1 and green tea extract with 2 and 4 g.L-1, in addition to the control treatment, spraying with distilled water only. The
... Show MoreBackground: The anterior knee pain is an important chief complaint of the patients with knee osteoarthritis due to patellofemoral pathology. The pain receptors denervation can be achieved by circumferential denervation of the patellar area by a process of electrocautery.
Objectives: The aim of current study is to assess the pain after total knee arthroplasty (TKA) by patelloplastywith and without circumferential denervation via electrocautery at a minimum follow up with 1 year separately for each patient.
Type of the study:Cross- sectional study.
Methods: Thirty five patients,with mean age of about (62.8) years, were enrolled in this pros
... Show MoreEndothelin-I (ET-I) is one of the potent vasoconstrictors secreted from endothelial cells when needed. Many studies revealed the elevation of serum ET-I with human diabetes and microangiopathies. Since insulin resistance is a case of mixed diabetic and pre-diabetic cases, many risk factors beyond obesity and inflammation are proposed. The current study aims to demonstrate the association between serum ET-I and asymmetric dimethylarginine (ADMA) and insulin resistance in type 2 diabetes mellitus (T2DM). Sera of 73 subjects were enrolled currently (control= 35 subjects, and 38 with T2DM for more than 7 years), aged (40-60) years old, with distinct body mass index (BMI) ≤ 25 for control volunteers and (BMI) ≥ 25 for obesity and diabetes
... Show MoreIntroduction: Cardiovascular diseases are the main cause of death among type 2 diabetic patients. Higher levels of plasminogen activator urokinase receptor have been found to predict morbidity and mortality across acute and chronic diseases in the common populace. This study aims to explore the role of serum plasminogen activator urokinase receptor levels as a cardiometabolic risk factor among type 2 diabetic Iraqi patients. Methods: Seventy type 2 diabetic patients (40 male and 30 female) (mean age: 46.20±7.56 years) participated in this study; 35 patients were with cardiovascular disease and 35 were without cardiovascular disease; their ages range was 40-55 years. In addition, 30 individuals who apparently healthy were selected a
... Show More