A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications

Ali H. Al-Timemy

doi:10.1186/s40537-023-00727-2

Details

Publication Date

Fri Apr 14 2023

Journal Name

Journal Of Big Data

Volume

10

DOI

10.1186/s40537-023-00727-2

Choose Citation Style

Statistics

View publication

25

View pdf

1

Statistics

(506)

(500)

A survey on deep learning tools dealing with data scarcity: definitions, challenges, solutions, tips, and applications

Ali H. Al-Timemy

...Show More Authors

Abstract<p>Data scarcity is a major challenge when training deep learning (DL) models. DL demands a large amount of data to achieve exceptional performance. Unfortunately, many applications have small or inadequate data to train DL frameworks. Usually, manual labeling is needed to provide labeled data, which typically involves human annotators with a vast background of knowledge. This annotation process is costly, time-consuming, and error-prone. Usually, every DL framework is fed by a significant amount of labeled data to automatically learn representations. Ultimately, a larger amount of data would generate a better DL model and its performance is also application dependent. This issue is the main barrier for many applications dismissing the use of DL. Having sufficient data is the first step toward any successful and trustworthy DL application. This paper presents a holistic survey on state-of-the-art techniques to deal with training DL models to overcome three challenges including small, imbalanced datasets, and lack of generalization. This survey starts by listing the learning techniques. Next, the types of DL architectures are introduced. After that, state-of-the-art solutions to address the issue of lack of training data are listed, such as Transfer Learning (TL), Self-Supervised Learning (SSL), Generative Adversarial Networks (GANs), Model Architecture (MA), Physics-Informed Neural Network (PINN), and Deep Synthetic Minority Oversampling Technique (DeepSMOTE). Then, these solutions were followed by some related tips about data acquisition needed prior to training purposes, as well as recommendations for ensuring the trustworthiness of the training dataset. The survey ends with a list of applications that suffer from data scarcity, several alternatives are proposed in order to generate more data in each application including Electromagnetic Imaging (EMI), Civil Structural Health Monitoring, Medical imaging, Meteorology, Wireless Communications, Fluid Mechanics, Microelectromechanical system, and Cybersecurity. To the best of the authors’ knowledge, this is the first review that offers a comprehensive overview on strategies to tackle data scarcity in DL.</p>

View Publication Preview PDF

Quick Preview PDF

Publication Date

Sun Mar 15 2020

Journal Name

Journal Of The College Of Education For Women

Second Language Learning and Its Relationship with the Third Language Learning: Statistical Study: China

Statistical Studies

Second and Third Languages

Learning Methods

Yanglu

...Show More Authors

The exchanges in various fields,like economics, science, culture, etc., have been enhanced unceasingly among different countries around the world in the twenty-first century, thus, the university graduate who masters one foreign language does not meet the need of the labor market in most countries.So, many universities began to develop new programs to cultivate students who can use more foreign languages to serve the intercultural communication. At the same time, there is more scientific research emerged which is related to the relationship between the second and third languages. This humble research seeks to explain the relevant concepts and analyze the real data collected from Shanghai International Studies University in China, to expl

View Publication Preview PDF

Publication Date

Sat Aug 31 2024

Journal Name

International Review On Modelling And Simulations (iremos)

Prosthetic Hand: a Brief Survey of Design and Actuation Technologies

Abdul-Rasool Kareem

Majid Habeeb

...Show More Authors

View Publication

Publication Date

Thu Aug 01 2024

Journal Name

Electronics

A Survey: Security Vulnerabilities and Protective Strategies for Graphical Passwords

Zena M.

Ahmed T.

Omar Z.

Alaa K.

...Show More Authors

As technology advances and develops, the need for strong and simple authentication mechanisms that can help protect data intensifies. The contemporary approach to giving access control is through graphical passwords comprising images, patterns, or graphical items. The objective of this review was to determine the documented security risks that are related to the use of graphical passwords, together with the measures that have been taken to prevent them. The review was intended to present an extensive literature review of the subject matter on graphical password protection and to point toward potential future research directions. Many attacks, such as shoulder surfing attacks, SQL injection attacks, and spyware attacks, can easily ex

View Publication

(7)

(3)

Publication Date

Tue Jan 01 2019

Journal Name

Advances On Computational Intelligence In Energy

A Theoretical Framework for Big Data Analytics Based on Computational Intelligent Algorithms with the Potential to Reduce Energy Consumption

H.

U. A.

I. A. T. Hashem

Y.

R. D.

M. M.

G. E.

S.

...Show More Authors

Within the framework of big data, energy issues are highly significant. Despite the significance of energy, theoretical studies focusing primarily on the issue of energy within big data analytics in relation to computational intelligent algorithms are scarce. The purpose of this study is to explore the theoretical aspects of energy issues in big data analytics in relation to computational intelligent algorithms since this is critical in exploring the emperica aspects of big data. In this chapter, we present a theoretical study of energy issues related to applications of computational intelligent algorithms in big data analytics. This work highlights that big data analytics using computational intelligent algorithms generates a very high amo

View Publication

(1)

Publication Date

Wed Jan 15 2025

Journal Name

International Journal Of Cloud Computing And Database Management

Deep video understanding based on language generation

CNNS

3D CNN (RESNET-101)

RNNS

LSTMS

video understanding

language generation

Liqaa M

...Show More Authors

Vol. 6, Issue 1 (2025)

View Publication Preview PDF

Publication Date

Tue Aug 01 2023

Journal Name

Baghdad Science Journal

Digital Data Encryption Using a Proposed W-Method Based on AES and DES Algorithms

Advanced Encryption Standard AES

Data Encryption Standard DES

Decryption

Encryption

Keys Encryption.

Dr. Wisam

Luheb Kareem

Ahmed

...Show More Authors

This paper proposes a new encryption method. It combines two cipher algorithms, i.e., DES and AES, to generate hybrid keys. This combination strengthens the proposed W-method by generating high randomized keys. Two points can represent the reliability of any encryption technique. Firstly, is the key generation; therefore, our approach merges 64 bits of DES with 64 bits of AES to produce 128 bits as a root key for all remaining keys that are 15. This complexity increases the level of the ciphering process. Moreover, it shifts the operation one bit only to the right. Secondly is the nature of the encryption process. It includes two keys and mixes one round of DES with one round of AES to reduce the performance time. The W-method deals with

View Publication Preview PDF

(7)

(4)

Publication Date

Tue Aug 03 2021

Journal Name

Journal Of Global Trends In Pharmaceutical Sciences

A REVIEW: CEFPODOXIME PROXETIL (DOXEF. PROXETIL) DISCOVERY, PREPARATION, APPLICATIONS AND COMPARISON WITH CEFPODOXIME- CLAVULANIC ACID IN ACTIVITY

Sahar B.

Nedaa A. Hameed

Anwar A.

...Show More Authors

Publication Date

Sun Sep 03 2023

Journal Name

Al-mansour Journal

Biometrics Systems Challenges in a Post-COVID-19 Pandemic World: A review

COVID-19 pandemic

prevention

biometrics

contact-based

contactless

Suhaila N.

Fatin S.

...Show More Authors

One of the most serious health disasters in recent memory is the COVID-19 epidemic. Several restriction rules have been forced to reduce the virus spreading. Masks that are properly fitted can help prevent the virus from spreading from the person wearing the mask to others. Masks alone will not protect against COVID-19; they must be used in conjunction with physical separation and avoidance of direct contact. The fast spread of this disease, as well as the growing usage of prevention methods, underscore the critical need for a shift in biometrics-based authentication schemes. Biometrics systems are affected differently depending on whether are used as one of the preventive techniques based on COVID-19 pandemic rules. This study provides an

View Publication Preview PDF

Publication Date

Sun May 01 2011

Journal Name

Information Sciences

Design and implementation of a t-way test data generation strategy with automated execution tool support

Kamal Z.

Mohammad F.J.

Mohammed I.

Nor Ashidi Mat

Rusli

...Show More Authors

View Publication

(66)

(52)

Publication Date

Fri Jan 01 2021

Journal Name

Ieee Access

6G Wireless Communications Networks: A Comprehensive Survey

Muntadher

Marwah Abdulrazzaq

Basheera M.

Sadiq H.

Mohammad R.

Ahmed

Nor K.

Sadiq M.

Khaled A.

Fazirul

...Show More Authors

View Publication

(401)

(391)

1 2 ... 17 18 19 20 ... 2184 2185