A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications

Ali H. Al-Timemy

doi:10.1186/s40537-023-00727-2

Details

Publication Date

Fri Apr 14 2023

Journal Name

Journal Of Big Data

Volume

10

DOI

10.1186/s40537-023-00727-2

Choose Citation Style

Statistics

View publication

25

View pdf

1

Statistics

(608)

(605)

A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications

Ali H. Al-Timemy

...Show More Authors

Abstract<p>Data scarcity is a major challenge when training deep learning (DL) models. DL demands a large amount of data to achieve exceptional performance. Unfortunately, many applications have small or inadequate data to train DL frameworks. Usually, manual labeling is needed to provide labeled data, which typically involves human annotators with a vast background of knowledge. This annotation process is costly, time-consuming, and error-prone. Usually, every DL framework is fed by a significant amount of labeled data to automatically learn representations. Ultimately, a larger amount of data would generate a better DL model and its performance is also application dependent. This issue is the main barrier for many applications dismissing the use of DL. Having sufficient data is the first step toward any successful and trustworthy DL application. This paper presents a holistic survey on state-of-the-art techniques to deal with training DL models to overcome three challenges including small, imbalanced datasets, and lack of generalization. This survey starts by listing the learning techniques. Next, the types of DL architectures are introduced. After that, state-of-the-art solutions to address the issue of lack of training data are listed, such as Transfer Learning (TL), Self-Supervised Learning (SSL), Generative Adversarial Networks (GANs), Model Architecture (MA), Physics-Informed Neural Network (PINN), and Deep Synthetic Minority Oversampling Technique (DeepSMOTE). Then, these solutions were followed by some related tips about data acquisition needed prior to training purposes, as well as recommendations for ensuring the trustworthiness of the training dataset. The survey ends with a list of applications that suffer from data scarcity, several alternatives are proposed in order to generate more data in each application including Electromagnetic Imaging (EMI), Civil Structural Health Monitoring, Medical imaging, Meteorology, Wireless Communications, Fluid Mechanics, Microelectromechanical system, and Cybersecurity. To the best of the authors’ knowledge, this is the first review that offers a comprehensive overview on strategies to tackle data scarcity in DL.</p>

View Publication Preview PDF

Quick Preview PDF

Publication Date

Fri Jun 08 2018

Journal Name

Advances In Intelligent Systems And Computing

Improve Memory for Alzheimer Patient by Employing Mind Wave on Virtual Reality with Deep Learning

Marwan Kadhim

Gao Tian

...Show More Authors

View Publication

(1)

(2)

Publication Date

Sat Jun 06 2020

Journal Name

Journal Of The College Of Education For Women

Image classification with Deep Convolutional Neural Network Using Tensorflow and Transfer of Learning

Convolutional Neural Network (CNN)

Synthetic Aperture Radar (SAR)

TensorFlow

Transfer learning

Visual Geometry Group (VGG16)

Aseel Sami

MatheelEmaduldin

...Show More Authors

The deep learning algorithm has recently achieved a lot of success, especially in the field of computer vision. This research aims to describe the classification method applied to the dataset of multiple types of images (Synthetic Aperture Radar (SAR) images and non-SAR images). In such a classification, transfer learning was used followed by fine-tuning methods. Besides, pre-trained architectures were used on the known image database ImageNet. The model VGG16 was indeed used as a feature extractor and a new classifier was trained based on extracted features.The input data mainly focused on the dataset consist of five classes including the SAR images class (houses) and the non-SAR images classes (Cats, Dogs, Horses, and Humans). The Conv

View Publication Preview PDF

(1)

Publication Date

Sun Jun 20 2021

Journal Name

Baghdad Science Journal

Arabic Speech Classification Method Based on Padding and Deep Learning Neural Network

Arabic alphabet

deep learning

speech classification

COVID-19

spectrogram

Asroni

Ku Ruhana

Cahya

Hasan Basri

...Show More Authors

Deep learning convolution neural network has been widely used to recognize or classify voice. Various techniques have been used together with convolution neural network to prepare voice data before the training process in developing the classification model. However, not all model can produce good classification accuracy as there are many types of voice or speech. Classification of Arabic alphabet pronunciation is a one of the types of voice and accurate pronunciation is required in the learning of the Qur’an reading. Thus, the technique to process the pronunciation and training of the processed data requires specific approach. To overcome this issue, a method based on padding and deep learning convolution neural network is proposed to

View Publication Preview PDF

(20)

(6)

Publication Date

Tue Oct 01 2019

Journal Name

Ieee Xplore

The Internet of Everything Based Smart Systems: Applications and Challenges

Zaid M.

Haider K.

...Show More Authors

Smart systems are the trend for modern organizations and should meet the quality of services that expect to produce. Internet of Everything (IoE) helped smart systems to adopt microcontrollers for improving the performance. Analyzing and controlling data in such a system are critical issues. In this study, a survey of IoE systems conducted to show how to apply a suitable model that meets such system requirements. The analysis of some microcontroller boards is explored based on known features. Factors for applying IoE devices have been defined such as connectivity, power consumption, compatibility, and cost. Different methods have been explained as an overview of applying IoE systems. Further, different approaches for applying IoE technology

(5)

(4)

Publication Date

Fri Jul 01 2016

Journal Name

Indonesian Journal Of Electrical Engineering And Computer Science

Survey Smoothly Fiber-Wireless (FiWi) Accessing Wireless Networks: Convergence and Challenges

Naseer Hwaidi

Raed

Dheyaa Jasim

...Show More Authors

<p> Traditionally, wireless networks and optical fiber Networks are independent of each other. Wireless networks are designed to meet specific service requirements, while dealing with weak physical transmission, and maximize system resources to ensure cost effectiveness and satisfaction for the end user. In optical fiber networks, on the other hand, search efforts instead concentrated on simple low-cost, future-proofness against inheritance and high services and applications through optical transparency. The ultimate goal of providing access to information when needed, was considered significantly. Whatever form it is required, not only increases the requirement sees technology convergence of wireless and optical networks but

View Publication

(1)

Publication Date

Sat Sep 28 2024

Journal Name

Journal Of Physical Education

Special exercises using tools and their effect on learning the skill of landing with Salto backward tucked to stand on the horizontal bar

Roaa

Shaima

Jamal

...Show More Authors

Special exercises in individual games are an important pillar in learning their basic skills. The aim of the research is to prepare special exercises using tools and their effect on learning the skill of landing with Salto backward tucked to stand - knowing the effect of special exercises using tools and their effect on learning the skill of landing with Salto backward tucked to stand on the horizontal bar. Either the research assumes the existence of significant differences in the pre- and post-tests in learning the skill of landing with Salto backward tucked to stand on the horizontal bar in favor of the post-test. The researchers used the experimental method with a single sample design to suit the research problem, as the researc

View Publication

Publication Date

Wed Mar 08 2023

Journal Name

Sensors

A Critical Review of Remote Sensing Approaches and Deep Learning Techniques in Archaeology

Israa

Fanar M.

...Show More Authors

To date, comprehensive reviews and discussions of the strengths and limitations of Remote Sensing (RS) standalone and combination approaches, and Deep Learning (DL)-based RS datasets in archaeology have been limited. The objective of this paper is, therefore, to review and critically discuss existing studies that have applied these advanced approaches in archaeology, with a specific focus on digital preservation and object detection. RS standalone approaches including range-based and image-based modelling (e.g., laser scanning and SfM photogrammetry) have several disadvantages in terms of spatial resolution, penetrations, textures, colours, and accuracy. These limitations have led some archaeological studies to fuse/integrate multip

View Publication

(18)

(13)

Publication Date

Thu Dec 01 2022

Journal Name

Journal Of Engineering

Deep Learning-Based Segmentation and Classification Techniques for Brain Tumor MRI: A Review

Brain Tumor

Magnetic Resonance Imaging (MRI)

Convolutional Neural Network (CNN)

Classification

Segmentation

Feature Extraction.

Noor Mohammed

Nassir H.

...Show More Authors

Early detection of brain tumors is critical for enhancing treatment options and extending patient survival. Magnetic resonance imaging (MRI) scanning gives more detailed information, such as greater contrast and clarity than any other scanning method. Manually dividing brain tumors from many MRI images collected in clinical practice for cancer diagnosis is a tough and time-consuming task. Tumors and MRI scans of the brain can be discovered using algorithms and machine learning technologies, making the process easier for doctors because MRI images can appear healthy when the person may have a tumor or be malignant. Recently, deep learning techniques based on deep convolutional neural networks have been used to analyze med

View Publication Preview PDF

(11)

Publication Date

Sun Jun 01 2025

Journal Name

Al-khwarizmi Engineering Journal

Recent Tools of Software-Defined Networking Traffic Generation and Data Collection

Tabarak

Omar

...Show More Authors

أثبتت الشبكات المحددة بالبرمجيات (SDN) تفوقها في معالجة مشاكل الشبكة العادية مثل قابلية التوسع وخفة الحركة والأمن. تأتي هذه الميزة من SDN بسبب فصل مستوى التحكم عن مستوى البيانات. على الرغم من وجود العديد من الأوراق والدراسات التي تركز على إدارة SDN، والرصد، والتحكم، وتحسين QoS، إلا أن القليل منها يركز على تقديم ما يستخدمونه لتوليد حركة المرور وقياس أداء الشبكة. كما أن المؤلفات تفتقر إلى مقارنات بين الأدوات والأ

View Publication

(1)

(2)

Publication Date

Fri Sep 01 2023

Journal Name

Journal Of Engineering

Iraqi Sentiment and Emotion Analysis Using Deep Learning

Emotion analysis

Sentiment analysis

CNN

GRU

Iraqi dialect

Anwar Abdul-Razzaq

Nada A. Z.

...Show More Authors

Analyzing sentiment and emotions in Arabic texts on social networking sites has gained wide interest from researchers. It has been an active research topic in recent years due to its importance in analyzing reviewers' opinions. The Iraqi dialect is one of the Arabic dialects used in social networking sites, characterized by its complexity and, therefore, the difficulty of analyzing sentiment. This work presents a hybrid deep learning model consisting of a Convolution Neural Network (CNN) and the Gated Recurrent Units (GRU) to analyze sentiment and emotions in Iraqi texts. Three Iraqi datasets (Iraqi Arab Emotions Data Set (IAEDS), Annotated Corpus of Mesopotamian-Iraqi Dialect (ACMID), and Iraqi Arabic Dataset (IAD)) col

View Publication Preview PDF

(6)

1 2 3 4 ... 2227 2228 2229 2230