A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications

Ali H. Al-Timemy

doi:10.1186/s40537-023-00727-2

Details

Publication Date

Fri Apr 14 2023

Journal Name

Journal Of Big Data

Volume

10

DOI

10.1186/s40537-023-00727-2

Choose Citation Style

Statistics

View publication

27

View pdf

1

Statistics

(784)

(766)

A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications

Ali H. Al-Timemy

...Show More Authors

Abstract<p>Data scarcity is a major challenge when training deep learning (DL) models. DL demands a large amount of data to achieve exceptional performance. Unfortunately, many applications have small or inadequate data to train DL frameworks. Usually, manual labeling is needed to provide labeled data, which typically involves human annotators with a vast background of knowledge. This annotation process is costly, time-consuming, and error-prone. Usually, every DL framework is fed by a significant amount of labeled data to automatically learn representations. Ultimately, a larger amount of data would generate a better DL model and its performance is also application dependent. This issue is the main barrier for many applications dismissing the use of DL. Having sufficient data is the first step toward any successful and trustworthy DL application. This paper presents a holistic survey on state-of-the-art techniques to deal with training DL models to overcome three challenges including small, imbalanced datasets, and lack of generalization. This survey starts by listing the learning techniques. Next, the types of DL architectures are introduced. After that, state-of-the-art solutions to address the issue of lack of training data are listed, such as Transfer Learning (TL), Self-Supervised Learning (SSL), Generative Adversarial Networks (GANs), Model Architecture (MA), Physics-Informed Neural Network (PINN), and Deep Synthetic Minority Oversampling Technique (DeepSMOTE). Then, these solutions were followed by some related tips about data acquisition needed prior to training purposes, as well as recommendations for ensuring the trustworthiness of the training dataset. The survey ends with a list of applications that suffer from data scarcity, several alternatives are proposed in order to generate more data in each application including Electromagnetic Imaging (EMI), Civil Structural Health Monitoring, Medical imaging, Meteorology, Wireless Communications, Fluid Mechanics, Microelectromechanical system, and Cybersecurity. To the best of the authors’ knowledge, this is the first review that offers a comprehensive overview on strategies to tackle data scarcity in DL.</p>

View Publication Preview PDF

Quick Preview PDF

Publication Date

Sun Nov 14 2021

Journal Name

Palarch's Journal Of Archaeology Of Egypt/egyptology

Blended Learning in Teaching English to University Students

Majid Rasim

...Show More Authors

QJ Rashid, IH Abdul-Abbas, MR Younus, PalArch's Journal of Archaeology of Egypt/Egyptology, 2021 - Cited by 4

View Publication

Publication Date

Wed Jun 01 2011

Journal Name

Journal Of Al-nahrain University Science

Breaking Knapsack Cipher Using Population Based Incremental Learning

Nasreen J.

...Show More Authors

View Publication

Publication Date

Mon Oct 30 2023

Journal Name

Iraqi Journal Of Science

Machine Learning Approach for Facial Image Detection System

Hind Moutaz

...Show More Authors

HM Al-Dabbas, RA Azeez, AE Ali, Iraqi Journal of Science, 2023

View Publication

(8)

Publication Date

Wed May 12 2021

Journal Name

Annals Of The Romanian Society For Cell Biology

Contribution Ratio of Cognitive Learning Outcome in the Performance of the Two Skills of Mastering by Parallel Spherical Standing and Equilibrium on the Balance Beam

Cognitive learning outcome.

Huda

...Show More Authors

The purpose of this paper is to identify the statistical indicators of the searched variables and identify the relationship between the cognitive learning outcome and the performance of the two mastering skills by parallel spherical standing and equilibrium on the balance beam. And the identification of the percentage of the cognitive learning outcome contribution to the performance of the two mastering skills by parallel spherical standing and the equilibrium on the balance beam. The two researchers used the descriptive approach in the survey method and the correlational relations, being the most appropriate to the nature of the research problem. The research community for the second stage students in the College of Physical Education and

View Publication Preview PDF

Publication Date

Mon Feb 04 2019

Journal Name

Journal Of The College Of Education For Women

From Learning for Living to Lifelong Learning “Seek knowledge from the cradle to the grave” Prophet Mohammed’s saying: نجاة احمد الجبوري

Nejat Ahmed

...Show More Authors

ملخص البحث
تبحث الدراسھ عن تنفیذ افضل لمفھوم التعلم مدى الحیاة كھیكل موجھ للسیاسة التربویة في العراق بشكل عام وفي
التعلیم العالي بشكل خاص. تحدد الدراسة استراتجیات التعلم مدى الحیاة وتناقش اھمیتھ وسماتھ الرئیسیة لتسھیل
الوصول الى فرص تعلم متمیز و ملائم لحاجات الطلبة مدى الحیاة، كما تناقش دور الجامعة في تحقیق ھذا الھدف.

View Publication Preview PDF

Publication Date

Sun Jan 19 2020

Journal Name

Journal Of Accounting And Financial Studies ( Jafs )

Effect flexibility of the Strategic human resources in improvement operational performance: "Survey study of views of a sample of managers, engineers and technicians in the Baghdad south 2/Station directorate of Electricity"

أثير عبد الله

...Show More Authors

The research attempts to diagnose the level of the effect of human resources flexibility (employees skills flexibility, employees behaviors flexibility, and human resource practice flexibility) in the south al-rusafa directorate of a power station one of the formations and the Ministry of Electricity, and impact of a range of variables related to the performance operational, namely, (efficiency, effectiveness)recognizing the importance of the subjects studied,& because of the importance of expected results of the field under consideration,researcher selected a sample of size (121) engineers and technicians of workers in the directorate. Was my hypotheses the major search of a relationship and impact between human resources flex

View Publication Preview PDF

Publication Date

Mon Oct 04 2021

Journal Name

Journal Of Petroleum Exploration And Production Technology

Perforation location optimization through 1-D mechanical earth model for high-pressure deep formations

Nagham

...Show More Authors

Optimum perforation location selection is an important study to improve well production and hence in the reservoir development process, especially for unconventional high-pressure formations such as the formations under study. Reservoir geomechanics is one of the key factors to find optimal perforation location. This study aims to detect optimum perforation location by investigating the changes in geomechanical properties and wellbore stress for high-pressure formations and studying the difference in different stress type behaviors between normal and abnormal formations. The calculations are achieved by building one-dimensional mechanical earth model using the data of four deep abnormal wells located in Southern Iraqi oil fields. The magni

Publication Date

Wed May 03 2023

Journal Name

Periodicals Of Engineering And Natural Sciences (pen)

Enhancing smart home energy efficiency through accurate load prediction using deep convolutional neural networks

Suaad M.

Geehan Sabah

...Show More Authors

The method of predicting the electricity load of a home using deep learning techniques is called intelligent home load prediction based on deep convolutional neural networks. This method uses convolutional neural networks to analyze data from various sources such as weather, time of day, and other factors to accurately predict the electricity load of a home. The purpose of this method is to help optimize energy usage and reduce energy costs. The article proposes a deep learning-based approach for nonpermanent residential electrical ener-gy load forecasting that employs temporal convolutional networks (TCN) to model historic load collection with timeseries traits and to study notably dynamic patterns of variants amongst attribute par

View Publication

Publication Date

Mon Jul 15 2024

Journal Name

2024 46th Annual International Conference Of The Ieee Engineering In Medicine And Biology Society (embc)

Automatic COVID-19 Detection from Chest X-ray using Deep MobileNet Convolutional Neural Network

Noor Kamal

Alaa A.

Sawal Hamid Bin Mohd

Siti Anom

...Show More Authors

View Publication

(5)

(4)

Publication Date

Sat Jun 01 2019

Journal Name

Journal Of Economics And Administrative Sciences

Evaluation Among Choices Of Water Quality Improvement By Using Some Of Total Quality Management Tools Applied Research In Baghdad Governorate Water Directorate

Product Quality

Quality Improvement

Brainstorming

Nominal Group Technique

Matrix Data Analysis.

جودة المنتوج

تحسين الجودة

العصف الذهني

تقنية المجموعة الأسمية

تحليل بيانات المصفوفة.

مها كامل

عادل ستار

...Show More Authors

Many managers in geometrical and technical organizations prefer to deal with quantitative values to choose between the available options and choose the best alternative to avoid randomization and bias in decision making. One of them Baghdad Water Department, which seeks to develop the quality of its product (drinking water) and achieve its objectives under increasing growing population and the demand for water, Some of TQM tools, especially the statistical, have this ability because there is chance to use historical data and experiment of employees in Application . Two statistical tools were applied: the nominal group technique, matrix data analysis technique as well as the brainstorming tool to search for the best o

View Publication Preview PDF

1 2 ... 141 142 143 144 ... 2329 2330