Training and Testing Data Division Influence on Hybrid Machine Learning Model Process: Application of River Flow Forecasting

Hai Tao; Ali Omran Al-Sulttani; Ameen Mohammed Salih Ameen; Zainab Hasan Ali; Nadhir Al-Ansari; Sinan Q. Salih; Reham R. Mostafa

doi:10.1155/2020/8844367

Details

Publication Date

Thu Oct 29 2020

Journal Name

Complexity

Volume

2020

DOI

10.1155/2020/8844367

Choose Citation Style

Statistics

View publication

32

Statistics

(61)

(33)

Training and Testing Data Division Influence on Hybrid Machine Learning Model Process: Application of River Flow Forecasting

Hai Tao

Ali Omran Al-Sulttani

Ameen Mohammed Salih Ameen

Zainab Hasan Ali

Nadhir Al-Ansari

Sinan Q. Salih

Reham R. Mostafa

...Show More Authors

The hydrological process has a dynamic nature characterised by randomness and complex phenomena. The application of machine learning (ML) models in forecasting river flow has grown rapidly. This is owing to their capacity to simulate the complex phenomena associated with hydrological and environmental processes. Four different ML models were developed for river flow forecasting located in semiarid region, Iraq. The effectiveness of data division influence on the ML models process was investigated. Three data division modeling scenarios were inspected including 70%–30%, 80%–20, and 90%–10%. Several statistical indicators are computed to verify the performance of the models. The results revealed the potential of the hybridized support vector regression model with a genetic algorithm (SVR-GA) over the other ML forecasting models for monthly river flow forecasting using 90%–10% data division. In addition, it was found to improve the accuracy in forecasting high flow events. The unique architecture of developed SVR-GA due to the ability of the GA optimizer to tune the internal parameters of the SVR model provides a robust learning process. This has made it more efficient in forecasting stochastic river flow behaviour compared to the other developed hybrid models.

View Publication

Publication Date

Sun Jan 14 2024

Journal Name

Journal Of Al-rafidain University College For Sciences ( Print Issn: 1681-6870 ,online Issn: 2790-2293 )

Using Nonparametric Procedure to Develop an OCMT Estimator for Big Data Linear Regression Model with Application Chemical Pollution in the Tigris River

Munaf

...Show More Authors

Chemical pollution is a very important issue that people suffer from and it often affects the nature of health of society and the future of the health of future generations. Consequently, it must be considered in order to discover suitable models and find descriptions to predict the performance of it in the forthcoming years. Chemical pollution data in Iraq take a great scope and manifold sources and kinds, which brands it as Big Data that need to be studied using novel statistical methods. The research object on using Proposed Nonparametric Procedure NP Method to develop an (OCMT) test procedure to estimate parameters of linear regression model with large size of data (Big Data) which comprises many indicators associated with chemi

View Publication

Publication Date

Sun Sep 03 2023

Journal Name

Wireless Personal Communications

Application of Healthcare Management Technologies for COVID-19 Pandemic Using Internet of Things and Machine Learning Algorithms

Mohammed

...Show More Authors

View Publication

(11)

(7)

Publication Date

Sat Oct 19 2024

Journal Name

Iraqi Statisticians Journal

Forecasting Gold prices by hybrid ANFIS-based algorithm

Munaf Yousif

Ahmed A.

...Show More Authors

In this article, the high accuracy and effectiveness of forecasting global gold prices are verified using a hybrid machine learning algorithm incorporating an Adaptive Neuro-Fuzzy Inference System (ANFIS) model with Particle Swarm Optimization (PSO) and Gray Wolf Optimizer (GWO). The hybrid approach had successes that enabled it to be a good strategy for practical use. The ARIMA-ANFIS hybrid methodology was used to forecast global gold prices. The ARIMA model is implemented on real data, and then its nonlinear residuals are predicted by ANFIS, ANFIS-PSO, and ANFIS-GWO. The results indicate that hybrid models improve the accuracy of single ARIMA and ANFIS models in forecasting. Finally, a comparison was made between the hybrid foreca

View Publication

Publication Date

Thu Apr 30 2026

Journal Name

Journal Of Economics And Administrative Sciences

Combined Hybrid ARDL-GARCH-BIGRU Model in Analyzing and Forecasting Currency in Circulation Issued by the Central Bank of Iraq

Currency in circulation

ARDL model

GARCH model

BIGRU

Hybrid time series models

Deep learning

Abdulrazzaq Tallal

Omar Abdulmohsin

...Show More Authors

View Publication Preview PDF

Publication Date

Wed Mar 20 2024

Journal Name

Journal Of Petroleum Research And Studies

Advanced Machine Learning application for Permeability Prediction for (M) Formation in an Iraqi Oil Field

Noor alhuda K.

Ghanim M.

...Show More Authors

Permeability estimation is a vital step in reservoir engineering due to its effect on reservoir's characterization, planning for perforations, and economic efficiency of the reservoirs. The core and well-logging data are the main sources of permeability measuring and calculating respectively. There are multiple methods to predict permeability such as classic, empirical, and geostatistical methods. In this research, two statistical approaches have been applied and compared for permeability prediction: Multiple Linear Regression and Random Forest, given the (M) reservoir interval in the (BH) Oil Field in the northern part of Iraq. The dataset was separated into two subsets: Training and Testing in order to cross-validate the accuracy

View Publication Preview PDF

(2)

Publication Date

Sun Jan 22 2023

Journal Name

Mesopotamian Journal Of Big Data

Parallel Machine Learning Algorithms

Qusay

...Show More Authors

To expedite the learning process, a group of algorithms known as parallel machine learning algorithmscan be executed simultaneously on several computers or processors. As data grows in both size andcomplexity, and as businesses seek efficient ways to mine that data for insights, algorithms like thesewill become increasingly crucial. Data parallelism, model parallelism, and hybrid techniques are justsome of the methods described in this article for speeding up machine learning algorithms. We alsocover the benefits and threats associated with parallel machine learning, such as data splitting,communication, and scalability. We compare how well various methods perform on a variety ofmachine learning tasks and datasets, and we talk abo

View Publication

(25)

(18)

Publication Date

Mon Mar 31 2025

Journal Name

The Iraqi Geological Journal

Evaluation of Machine Learning Techniques for Missing Well Log Data in Buzurgan Oil Field: A Case Study

Usama

...Show More Authors

The investigation of machine learning techniques for addressing missing well-log data has garnered considerable interest recently, especially as the oil and gas sector pursues novel approaches to improve data interpretation and reservoir characterization. Conversely, for wells that have been in operation for several years, conventional measurement techniques frequently encounter challenges related to availability, including the lack of well-log data, cost considerations, and precision issues. This study's objective is to enhance reservoir characterization by automating well-log creation using machine-learning techniques. Among the methods are multi-resolution graph-based clustering and the similarity threshold method. By using cutti

View Publication Preview PDF

Publication Date

Wed Aug 01 2018

Journal Name

Journal Of Economics And Administrative Sciences

Comparison Some Estimation Methods Of GM(1,1) Model With Missing Data and Practical Application

GM(1

1)

LS

WLS

TLS

DS

HFO

D.O .

فراس احمد

نور

...Show More Authors

This paper presents a grey model GM(1,1) of the first rank and a variable one and is the basis of the grey system theory , This research dealt properties of grey model and a set of methods to estimate parameters of the grey model GM(1,1) is the least square Method (LS) , weighted least square method (WLS), total least square method (TLS) and gradient descent method (DS). These methods were compared based on two types of standards: Mean square error (MSE), mean absolute percentage error (MAPE), and after comparison using simulation the best method was applied to real data represented by the rate of consumption of the two types of oils a Heavy fuel (HFO) and diesel fuel (D.O) and has been applied several tests to

View Publication Preview PDF

Publication Date

Mon Jan 01 2024

Journal Name

Aip Conference Proceedings

A multivariate Bayesian model using Gibbs sampler with real data application

Bayesian model

Gibbs sampler

Multivariate regression

Posterior distribution.

Muntaha K.

Ghadeer J.

Hayder Abdul Hussein

...Show More Authors

In many scientific fields, Bayesian models are commonly used in recent research. This research presents a new Bayesian model for estimating parameters and forecasting using the Gibbs sampler algorithm. Posterior distributions are generated using the inverse gamma distribution and the multivariate normal distribution as prior distributions. The new method was used to investigate and summaries Bayesian statistics' posterior distribution. The theory and derivation of the posterior distribution are explained in detail in this paper. The proposed approach is applied to three simulation datasets of 100, 300, and 500 sample sizes. Also, the procedure was extended to the real dataset called the rock intensity dataset. The actual dataset is collecte

View Publication Preview PDF

(1)

Publication Date

Mon Jan 01 2024

Journal Name

Bio Web Of Conferences

Concepts of statistical learning and classification in machine learning: An overview

Amer F.A.H.

Tasnim H.K.

...Show More Authors

Statistical learning theory serves as the foundational bedrock of Machine learning (ML), which in turn represents the backbone of artificial intelligence, ushering in innovative solutions for real-world challenges. Its origins can be linked to the point where statistics and the field of computing meet, evolving into a distinct scientific discipline. Machine learning can be distinguished by its fundamental branches, encompassing supervised learning, unsupervised learning, semi-supervised learning, and reinforcement learning. Within this tapestry, supervised learning takes center stage, divided in two fundamental forms: classification and regression. Regression is tailored for continuous outcomes, while classification specializes in c

View Publication Preview PDF

(10)

(4)

1 2 3 4 ... 2947 2948 2949 2950