Multi-Resolution Hierarchical Structure for Efficient Data Aggregation and Mining of Big Data

Safaa Alwajidi

doi:10.1109/ICACTM.2019.8776717

Details

Publication Date

Mon Apr 01 2019

Journal Name

2019 International Conference On Automation, Computational And Technology Management (icactm)

DOI

10.1109/ICACTM.2019.8776717

Choose Citation Style

Statistics

View publication

9

Statistics

(4)

(2)

Multi-Resolution Hierarchical Structure for Efficient Data Aggregation and Mining of Big Data

Safaa Alwajidi

...Show More Authors

Big data analysis is essential for modern applications in areas such as healthcare, assistive technology, intelligent transportation, environment and climate monitoring. Traditional algorithms in data mining and machine learning do not scale well with data size. Mining and learning from big data need time and memory efficient techniques, albeit the cost of possible loss in accuracy. We have developed a data aggregation structure to summarize data with large number of instances and data generated from multiple data sources. Data are aggregated at multiple resolutions and resolution provides a trade-off between efficiency and accuracy. The structure is built once, updated incrementally, and serves as a common data input for multiple mining and learning algorithms. Data mining algorithms are modified to accept the aggregated data as input. Hierarchical data aggregation serves as a paradigm under which novel …

View Publication

Publication Date

Tue Jul 01 2025

Journal Name

Mastering The Minds Of Machines

Recurrent Neural Networks and its Applications in Time Series Data

Muhannad Akram

Nada Khalil

Canan Batur

Gang

Putra

Vaclav

Fatma A.

Aseel

Laith

...Show More Authors

View Publication

Publication Date

Mon Apr 03 2023

Journal Name

Journal Of Electronics,computer Networking And Applied Mathematics

Comparison of Some Estimator Methods of Regression Mixed Model for the Multilinearity Problem and High – Dimensional Data

Thaer Hashim Abdul

...Show More Authors

In order to obtain a mixed model with high significance and accurate alertness, it is necessary to search for the method that performs the task of selecting the most important variables to be included in the model, especially when the data under study suffers from the problem of multicollinearity as well as the problem of high dimensions. The research aims to compare some methods of choosing the explanatory variables and the estimation of the parameters of the regression model, which are Bayesian Ridge Regression (unbiased) and the adaptive Lasso regression model, using simulation. MSE was used to compare the methods.

View Publication

Publication Date

Wed Aug 01 2012

Journal Name

International Journal Of Geographical Information Science

Assessing similarity matching for possible integration of feature classifications of geospatial data from official and informal sources

Maythm

David

...Show More Authors

View Publication

(66)

(54)

Publication Date

Fri Jan 01 2016

Journal Name

Statistics And Its Interface

Search for risk haplotype segments with GWAS data by use of finite mixture models

ALI

Jian

...Show More Authors

The region-based association analysis has been proposed to capture the collective behavior of sets of variants by testing the association of each set instead of individual variants with the disease. Such an analysis typically involves a list of unphased multiple-locus genotypes with potentially sparse frequencies in cases and controls. To tackle the problem of the sparse distribution, a two-stage approach was proposed in literature: In the first stage, haplotypes are computationally inferred from genotypes, followed by a haplotype coclassification. In the second stage, the association analysis is performed on the inferred haplotype groups. If a haplotype is unevenly distributed between the case and control samples, this haplotype is labeled

View Publication

Publication Date

Sat Jan 01 2022

Journal Name

Journal Of Petroleum Science And Engineering

Performance evaluation of analytical methods in linear flow data for hydraulically-fractured gas wells

Hydraulic fracturing

Shale gas reservoirs

Tight gas reservoirs

Unconventional reservoirs

Fracture half-length

Petroleum engineering

Atheer

Huda

Dheiaa

Omar

Mofazzal

...Show More Authors

View Publication

(11)

(8)

Publication Date

Sat Jul 22 2023

Journal Name

Journal Of Engineering

Use of Gis for Creating a Project Management Data Base in Baghdad Al-Rissfa

Sawsan Rasheed

Ayad abdulamer

...Show More Authors

The main objective of resources management is to supply and support the site operation with necessary resources in a way to achieve the required timing in handing over the work as well as to achieve the cost-realism within the budget estimated. The research aims to know the advantage of using GIS in management of resources as one of the new tools that keep pace with the evolution in various countries around the world also collect the vast amount of spatial data resources in one environment easily to handled and accessed quickly and this help to make the right decision regarding management of resources in various construction projects. The process of using GIS in the management and identification of resources is of extreme importance in t

View Publication Preview PDF

Publication Date

Sat Dec 30 2023

Journal Name

Journal Of Economics And Administrative Sciences

The Cluster Analysis by Using Nonparametric Cubic B-Spline Modeling for Longitudinal Data

البيانات الطولية

نموذج الشرائح B-spline التكعيبية اللامعلمية

التحليل العنقودي

طريقة الاتجاه المتناوب لخوارزمية المضاعف ADMM.

Longitudinal Data

Nonparametric Cubic B-Spline

Cluster Analysis

The Alternating Direction Method for Multiplier Algorithm ADMM.

Noor

Suhail

...Show More Authors

Longitudinal data is becoming increasingly common, especially in the medical and economic fields, and various methods have been analyzed and developed to analyze this type of data.

In this research, the focus was on compiling and analyzing this data, as cluster analysis plays an important role in identifying and grouping co-expressed subfiles over time and employing them on the nonparametric smoothing cubic B-spline model, which is characterized by providing continuous first and second derivatives, resulting in a smoother curve with fewer abrupt changes in slope. It is also more flexible and can pick up on more complex patterns and fluctuations in the data.

The longitudinal balanced data profile was compiled into subgroup

View Publication Preview PDF

Publication Date

Thu Jun 01 2023

Journal Name

Bulletin Of Electrical Engineering And Informatics

A missing data imputation method based on salp swarm algorithm for diabetes disease

Geehan Sabah Hassan

Noora Jamal Ali

Asma Khazaal Abdulsahib

Farah Jasim Mohammed

...Show More Authors

Most of the medical datasets suffer from missing data, due to the expense of some tests or human faults while recording these tests. This issue affects the performance of the machine learning models because the values of some features will be missing. Therefore, there is a need for a specific type of methods for imputing these missing data. In this research, the salp swarm algorithm (SSA) is used for generating and imputing the missing values in the pain in my ass (also known Pima) Indian diabetes disease (PIDD) dataset, the proposed algorithm is called (ISSA). The obtained results showed that the classification performance of three different classifiers which are support vector machine (SVM), K-nearest neighbour (KNN), and Naïve B

View Publication

(10)

(2)

Publication Date

Fri Jul 21 2023

Journal Name

Journal Of Engineering

A Modified 2D-Checksum Error Detecting Method for Data Transmission in Noisy Media

Checksum

one dimentional parity

2Dimensional parity

Error Detecting

Ammar Osamah

...Show More Authors

In data transmission a change in single bit in the received data may lead to miss understanding or a disaster. Each bit in the sent information has high priority especially with information such as the address of the receiver. The importance of error detection with each single change is a key issue in data transmission field.
The ordinary single parity detection method can detect odd number of errors efficiently, but fails with even number of errors. Other detection methods such as two-dimensional and checksum showed better results and failed to cope with the increasing number of errors.
Two novel methods were suggested to detect the binary bit change errors when transmitting data in a noisy media.Those methods were: 2D-Checksum me

View Publication Preview PDF

Publication Date

Sun Dec 01 2019

Journal Name

Journal Of Economics And Administrative Sciences

Contemporary Challenges for Cloud Computing Data Governance in Information Centers: An analytical study

حوكمة البيانات

الحوسبة السحابية

تحديات حوكمة بيانات الحوسبة السحابية

فاعلية نظم المعلومات

مراكز الحاسوب والانترنت

الجامعات العراقية العامة والخاصة.

data governance

cloud computing

cloud governance challenges

information systems effectiveness

the Internet and computer centers

public and private Iraqi universities.

عامر عبد الرزاق

احمد زهير

...Show More Authors

Purpose – The Cloud computing (CC) and its services have enabled the information centers of organizations to adapt their informatic and technological infrastructure and making it more appropriate to develop flexible information systems in the light of responding to the informational and knowledge needs of their users. In this context, cloud-data governance has become more complex and dynamic, requiring an in-depth understanding of the data management strategy at these centers in terms of: organizational structure and regulations, people, technology, process, roles and responsibilities. Therefore, our paper discusses these dimensions as challenges that facing information centers in according to their data governance and the impa

View Publication Preview PDF

(1)

1 2 ... 16 17 18 19 ... 3000 3001