Data scarcity is a major challenge when training deep learning (DL) models. DL demands a large amount of data to achieve exceptional performance. Unfortunately, many applications have small or inadequate data to train DL frameworks. Usually, manual labeling is needed to provide labeled data, which typically involves human annotators with a vast background of knowledge. This annotation process is costly, time-consuming, and error-prone. Usually, every DL framework is fed by a significant amount of labeled data to automatically learn representations. Ultimately, a larger amount of data would generate a better DL model and its performance is also application dependent. This issue is the main barrier for many applications dismissing the use of DL. Having sufficient data is the first step toward any successful and trustworthy DL application. This paper presents a holistic survey on state-of-the-art techniques to deal with training DL models to overcome three challenges including small, imbalanced datasets, and lack of generalization. This survey starts by listing the learning techniques. Next, the types of DL architectures are introduced. After that, state-of-the-art solutions to address the issue of lack of training data are listed, such as Transfer Learning (TL), Self-Supervised Learning (SSL), Generative Adversarial Networks (GANs), Model Architecture (MA), Physics-Informed Neural Network (PINN), and Deep Synthetic Minority Oversampling Technique (DeepSMOTE). Then, these solutions were followed by some related tips about data acquisition needed prior to training purposes, as well as recommendations for ensuring the trustworthiness of the training dataset. The survey ends with a list of applications that suffer from data scarcity, several alternatives are proposed in order to generate more data in each application including Electromagnetic Imaging (EMI), Civil Structural Health Monitoring, Medical imaging, Meteorology, Wireless Communications, Fluid Mechanics, Microelectromechanical system, and Cybersecurity. To the best of the authors’ knowledge, this is the first review that offers a comprehensive overview on strategies to tackle data scarcity in DL.
Abstract:
Witness the current business environment changes rapidly reflected on the performance of the facility wishing to stay , which is no longer style reaction enough to handle installations with their environment , and quickly began to lose its luster with the emergence of a message and the vision of contemporary business environment from a set of parts interacting with each other and the concept of behavioral includes all dimensions of performance, it is imperative to adopt a system installations influence variables and positive interaction through the development of strategic plans and the use of implementation and follow-up strategies to ensure the effectiveness of the method for meas
... Show MoreThe aim of this study is to investigate the kinetics of copper removal from aqueous solutions using an electromembrane extraction (EME) system. To achieve this, a unique electrochemical cell design was adopted comprising two glass chambers, a supported liquid membrane (SLM), a graphite anode, and a stainless-steel cathode. The SLM consisted of a polypropylene flat membrane infused with 1-octanol as a solvent and bis(2-ethylhexyl) phosphate (DEHP) as a carrier. The impact of various factors on the kinetics constant rate was outlined, including the applied voltage, initial pH of the donor phase solution, and initial copper concentration. The results demonstrated a significant influence of the applied voltage on enhancing the rate of c
... Show MoreTwo‐dimensional buoyancy‐induced flow and heat transfer inside a square enclosure partially occupied by copper metallic foam subjected to a symmetric side cooling and constant heat flux bottom heating was tested numerically. Finite Element Method was employed to solve the governing partial differential equations of the flow field and the Local Thermal Equilibrium model was used for the energy equation. The system boundaries were defined as lower heated wall by constant heat flux, cooled lateral walls, and insulated top wall. The three parameters elected to conduct the study are heater length (7 ≤
The formula of Ijarah and Ijarah ending with ownership is one of the investment formulas in Islamic banks, so this research has shed light on it in order to benefit from the experiences of the research sample banks, This research aims to find a reliable way for Iraqi Islamic banks, namely (leasing and leasing ending with ownership) in order to invest their money without usurious interests, The problem of the research emerges through the lack of awareness of the Iraqi Islamic banks to work with different Islamic financing formulas and their inability to invest their money through the adoption of their administrations for different formulas, including the leasing, and this is reflected in the decrease and fluctuation of its profits, Theref
... Show MoreBackground: There are many congenital anomalies associated with cleft lip and/or palate. This research is to study the prevalence of congenitally missing teeth and supernumerary teeth in this population group. Materials and Method: One hundred eight cleft lip and/or palate Iraqi patients had participated in this study (57 male, 51 female), 3-12 years of age. 26 of them had orthopantomogram were within (6-12) years of age were inspected for congenitally missing teeth and supernumerary teeth. Patients whom age range 3-5 years were checked for the congenitally missing teeth by clinical examination with strongly insisting the teeth were not missed due to caries or trauma. Results: There were 19(73.076%) patients with 41 congenitally missing tee
... Show Morerhabditid Mesorhabditis franseni Fuchs, 1933 (Family, Mesorhabditidae) and pratylenchid nematode Pratylenchus goodeyi Sher and Allen, 1953 (Family, Pratylenchidae). They were illustrated by molecular aspects. All specimens of both genera were cultured and reproduced for DNA extraction. M. franseni (IRQ.ZAh2 PP528819.1 isolate) was characterized. P. goodeyi (IRQ.ZAh5 PP535537 isolate) was also characterized. Selected specimens of these two species were molecularly characterized using the partial ITS-rRNA gene sequences. The ITS-rRNA sequence of IRQ.ZAh2 PP528819.1 isolate had a range of (98.62%-100%) sequence homology with ITS-rRNA sequence of M. franseni available in NCBI database. While, the ITS-rRNA sequence of IRQ.ZAh5 PP535537 isolate h
... Show MoreOne of the most common public liver diseases over the world is fatty liver which contain alcoholic and non-alcoholic fatty liver. One-fourth among general population are impact Non-Alcoholic Fatty Liver Disease (NAFLD) in the worldwide.Retinol binding protein 4 (RBP4) is known as an adipokine, mainly synthesized and secreted from the liver and form adipose tissues. RBP4 acts as a transporter and specifically bound to retinol from liver to others tissues. Visfatin is an adipocytokine and mainly produced from visceral fat tissue, skeletal muscles as well as liver. Vitamin A absorbed, transported as retinyl esters to the liver then hydrolyzed to the retinol form and storage in hepatic stellate cells (HSCs) after reesterified with rigly
... Show MoreBackground: Recurrent Aphthous Stomatitis (RAS) is the most common painful oral mucosal disease, affecting approximately 20% of the population. RAS presents with a wide spectrum of severity ranging from a minor nuisance to complete debility. Many of factors thought to have been involved in its etiology; that might have at the same time a direct or indirect impact upon oxidant/antioxidant system and trigger free radicals production. The aim of this study was to determine the possible association of oxidant/total antioxidant status and recurrent aphthous stomatitis (RAS). Subjects, materials and methods: The study consisted of thirty patients with recurrent aphthous stomatitis and thirty healthy controls from which saliva and blood samples we
... Show MoreThe current research aims to build a training program for chemistry teachers based on the knowledge economy and its impact on the productive thinking of their students. To achieve the objectives of the research, the following hypothesis was formulated:
There is no statistically significant difference at (0.05) level of significance between the average grades of the students participating in the training program according to the knowledge economy and the average grades of the students who did not participate in the training program in the test of productive thinking. The study sample consisted of (288) second intermediate grade students divided into (152) for the control group
... Show More