Data scarcity is a major challenge when training deep learning (DL) models. DL demands a large amount of data to achieve exceptional performance. Unfortunately, many applications have small or inadequate data to train DL frameworks. Usually, manual labeling is needed to provide labeled data, which typically involves human annotators with a vast background of knowledge. This annotation process is costly, time-consuming, and error-prone. Usually, every DL framework is fed by a significant amount of labeled data to automatically learn representations. Ultimately, a larger amount of data would generate a better DL model and its performance is also application dependent. This issue is the main barrier for many applications dismissing the use of DL. Having sufficient data is the first step toward any successful and trustworthy DL application. This paper presents a holistic survey on state-of-the-art techniques to deal with training DL models to overcome three challenges including small, imbalanced datasets, and lack of generalization. This survey starts by listing the learning techniques. Next, the types of DL architectures are introduced. After that, state-of-the-art solutions to address the issue of lack of training data are listed, such as Transfer Learning (TL), Self-Supervised Learning (SSL), Generative Adversarial Networks (GANs), Model Architecture (MA), Physics-Informed Neural Network (PINN), and Deep Synthetic Minority Oversampling Technique (DeepSMOTE). Then, these solutions were followed by some related tips about data acquisition needed prior to training purposes, as well as recommendations for ensuring the trustworthiness of the training dataset. The survey ends with a list of applications that suffer from data scarcity, several alternatives are proposed in order to generate more data in each application including Electromagnetic Imaging (EMI), Civil Structural Health Monitoring, Medical imaging, Meteorology, Wireless Communications, Fluid Mechanics, Microelectromechanical system, and Cybersecurity. To the best of the authors’ knowledge, this is the first review that offers a comprehensive overview on strategies to tackle data scarcity in DL.
Field experiments were carried out for the autumn season 2022- 2021 in the field of College of Agricultural Engineering Sciences - University of Baghdad - Jadiriyah Complex –Station A- to study a combination of organic fertilizer (Vermicompost) and cow manure as well as a control treatment (soil only) intertwined with Spraying with silicon, calcium and distilled water (control) in the growth and production of three cultivars of beet (Cylindra, Dark Red, Red) within the design of Completely Randomized Block Design at three replications, The number of treatments was 9 for each replicate. The means were compared according to the least significant difference (L.S.D) at a probability lev
The second half of the last century witnessed a great scientific revolution that was able to bring about wide changes in various fields, including the field of physical education, which plays a fundamental role in the process of change for the better, and which knocked all the doors of modern science in various aspects and from this perspective we see that students have different capabilities And interests and motives, which require providing a differentiated education, and this depends on the necessity of knowing each student and on the school’s ability to know appropriate strategies for teaching each student so there is no single way to teach so the research problem comes in experimenting with an educational method that works on
... Show MoreThe research aimed at designing teaching sessions using the self-scheduling strategy with a competitive style in learning handball as well as identifying differences between pre and post tests in both groups in learning short and long passes in handball. The researchers used the experimental method on 2nd-grade secondary school students. The researchers concluded using the self-scheduling strategy due to its positive effect on learning short and long handball passes in handball. Finally, the researchers recommended applying strategies and styles in teaching different school levels as well as making similar studies using teaching strategies and styles for learning handball skills in students.
Recommender Systems are tools to understand the huge amount of data available in the internet world. Collaborative filtering (CF) is one of the most knowledge discovery methods used positively in recommendation system. Memory collaborative filtering emphasizes on using facts about present users to predict new things for the target user. Similarity measures are the core operations in collaborative filtering and the prediction accuracy is mostly dependent on similarity calculations. In this study, a combination of weighted parameters and traditional similarity measures are conducted to calculate relationship among users over Movie Lens data set rating matrix. The advantages and disadvantages of each measure are spotted. From the study, a n
... Show MoreAdministrative procedures in various organizations produce numerous crucial records and data. These
records and data are also used in other processes like customer relationship management and accounting
operations.It is incredibly challenging to use and extract valuable and meaningful information from these data
and records because they are frequently enormous and continuously growing in size and complexity.Data
mining is the act of sorting through large data sets to find patterns and relationships that might aid in the data
analysis process of resolving business issues. Using data mining techniques, enterprises can forecast future
trends and make better business decisions.The Apriori algorithm has bee
The aim of this research is to study the effect of welded joint design (Butt joint and lap joint) on thejoint strength during tension and fatigue loading with different current of welding (40,50,60,70,80) ^per, and different type of wire welding. The result of this research is showed that the effect of fatigue loading on the type of joint is more than the effect of tension loading on it. And the butt joint welding is better than the lap joint welding during the fatigue loaded.The experimental results of the effect of W'elding current showed that more increasing and more decreasing the value of the heat input, during the welding was found to produce mechanical brittleness on the buttjoint welding during the static and dynamic loading. Also i
... Show Moremixtures of cyclohexane + n-decane and cyclohexane + 1-pentanol have been measured at 298.15, 308.15, 318.15, and 328.15 K over the whole mole fraction range. From these results, excess molar volumes, VE , have been calculated and fitted to the Flory equations. The VE values are negative and positive over the whole mole fraction range and at all temperatures. The excess refractive indices nE and excess viscosities ?E have been calculated from experimental refractive indices and viscosity measurements at different temperature and fitted to the mixing rules equations and Heric – Coursey equation respectively to predict theoretical refractive indices, we found good agreement between them for binary mixtures in this study. The variation of th
... Show More