Data scarcity is a major challenge when training deep learning (DL) models. DL demands a large amount of data to achieve exceptional performance. Unfortunately, many applications have small or inadequate data to train DL frameworks. Usually, manual labeling is needed to provide labeled data, which typically involves human annotators with a vast background of knowledge. This annotation process is costly, time-consuming, and error-prone. Usually, every DL framework is fed by a significant amount of labeled data to automatically learn representations. Ultimately, a larger amount of data would generate a better DL model and its performance is also application dependent. This issue is the main barrier for many applications dismissing the use of DL. Having sufficient data is the first step toward any successful and trustworthy DL application. This paper presents a holistic survey on state-of-the-art techniques to deal with training DL models to overcome three challenges including small, imbalanced datasets, and lack of generalization. This survey starts by listing the learning techniques. Next, the types of DL architectures are introduced. After that, state-of-the-art solutions to address the issue of lack of training data are listed, such as Transfer Learning (TL), Self-Supervised Learning (SSL), Generative Adversarial Networks (GANs), Model Architecture (MA), Physics-Informed Neural Network (PINN), and Deep Synthetic Minority Oversampling Technique (DeepSMOTE). Then, these solutions were followed by some related tips about data acquisition needed prior to training purposes, as well as recommendations for ensuring the trustworthiness of the training dataset. The survey ends with a list of applications that suffer from data scarcity, several alternatives are proposed in order to generate more data in each application including Electromagnetic Imaging (EMI), Civil Structural Health Monitoring, Medical imaging, Meteorology, Wireless Communications, Fluid Mechanics, Microelectromechanical system, and Cybersecurity. To the best of the authors’ knowledge, this is the first review that offers a comprehensive overview on strategies to tackle data scarcity in DL.
Tight reservoirs have attracted the interest of the oil industry in recent years according to its significant impact on the global oil product. Several challenges are present when producing from these reservoirs due to its low to extra low permeability and very narrow pore throat radius. Development strategy selection for these reservoirs such as horizontal well placement, hydraulic fracture design, well completion, and smart production program, wellbore stability all need accurate characterizations of geomechanical parameters for these reservoirs. Geomechanical properties, including uniaxial compressive strength (UCS), static Young’s modulus (Es), and Poisson’s ratio (υs), were measured experimentally using both static and dynamic met
... Show MoreIn this work, two different laser dye solutions were used to host highly-pure silicon nitride nanoparticles as scattering centers to fabricate random gain media. The laser dye was dissolved in three different solvents (ethanol, methanol and acetone) and the final results were obtained for methanol only. The silicon nitride nanoparticles were synthesized by dc reactive magnetron sputtering technique with average particle size of 35 nm. The random gain medium was made as a solid rod with high spectral efficiency and low production cost. Optical emission with narrow linewidth was detected at 532-534 nm as 9 mg of silicon nitride nanoparticles were added to the 10 -5 M dye solution. The FWHM of 0.3 and 3.52 nm was determined for Rhodamine B and
... Show MoreDistribution of light intensity in the flat photobioreactor for microalgae cultivation as a step design for production of bio-renewable energy was addressed in the current study. Five sizes of bioreactors with specific distances from the main light source were adopted as independent variables in experiential design model. The results showed that the bioreactor’s location according to the light source, determines the nature of light intensity distribution in the reactor body. However, the cross-section area plays an important role in determining the suitable location of reactor to achieve required light homogeneity. This area could change even the expected response of the light passing through the reactor if Beer-Lambert's law is adopted.
... Show MoreThe problem of finding the cyclic decomposition (c.d.) for the groups ), where prime upper than 9 is determined in this work. Also, we compute the Artin characters (A.ch.) and Artin indicator (A.ind.) for the same groups, we obtain that after computing the conjugacy classes, cyclic subgroups, the ordinary character table (o.ch.ta.) and the rational valued character table for each group.
Ag nanoparticles were prepared using Nd:YAG laser from Ag matel in distilled water using different energies laser (100 and 600) mJ using 200 pulses, and study the effect of the preparation conditions on the structural characteristics of and then study the effect of nanoparticles on the rate of killing the two types of bacteria particles (Staph and E.coli). The goal is to prepare the nanoparticle effectively used to kill bacteria.
Mobile Wireless sensor networks have acquired a great interest recently due to their capability to provide good solutions and low-priced in multiple fields. Internet of Things (IoT) connects different technologies such as sensing, communication, networking, and cloud computing. It can be used in monitoring, health care and smart cities. The most suitable infrastructure for IoT application is wireless sensor networks. One of the main defiance of WSNs is the power limitation of the sensor node. Clustering model is an actual way to eliminate the inspired power during the transmission of the sensed data to a central point called a Base Station (BS). In this paper, efficient clustering protocols are offered to prolong network lifetime. A kern
... Show More