Fuzzy C means Based Evaluation Algorithms For Cancer Gene Expression Data Clustering

Omar Al-Janabee; Basad Al-Sarray

doi:https://doi.org/10.52866/ijcsm.2022.02.01.004

Details

Publication Date

Mon Feb 21 2022

Journal Name

Iraqi Journal For Computer Science And Mathematics

DOI

https://doi.org/10.52866/ijcsm.2022.02.01.004

Choose Citation Style

Statistics

View publication

7

Statistics

(1)

Fuzzy C means Based Evaluation Algorithms For Cancer Gene Expression Data Clustering

Omar Al-Janabee

Basad Al-Sarray

...Show More Authors

The influx of data in bioinformatics is primarily in the form of DNA, RNA, and protein sequences. This condition places a significant burden on scientists and computers. Some genomics studies depend on clustering techniques to group similarly expressed genes into one cluster. Clustering is a type of unsupervised learning that can be used to divide unknown cluster data into clusters. The k-means and fuzzy c-means (FCM) algorithms are examples of algorithms that can be used for clustering. Consequently, clustering is a common approach that divides an input space into several homogeneous zones; it can be achieved using a variety of algorithms. This study used three models to cluster a brain tumor dataset. The first model uses FCM, which is used to cluster genes. FCM allows an object to belong to two or more clusters with a membership grade between zero and one and the sum of belonging to all clusters of each gene is equal to one. This paradigm is useful when dealing with microarray data. The total time required to implement the first model is 22.2589 s. The second model combines FCM and particle swarm optimization (PSO) to obtain better results. The hybrid algorithm, i.e., FCM–PSO, uses the DB index as objective function. The experimental results show that the proposed hybrid FCM–PSO method is effective. The total time of implementation of this model is 89.6087 s. The third model combines FCM with a genetic algorithm (GA) to obtain better results. This hybrid algorithm also uses the DB index as objective function. The experimental results show that the proposed hybrid FCM–GA method is effective. Its total time of implementation is 50.8021 s. In addition, this study uses cluster validity indexes to determine the best partitioning for the underlying data. Internal validity indexes include the Jaccard, Davies Bouldin, Dunn, Xie–Beni, and silhouette. Meanwhile, external validity indexes include Minkowski, adjusted Rand, and percentage of correctly categorized pairings. Experiments conducted on brain tumor gene expression data demonstrate that the techniques used in this study outperform traditional models in terms of stability and biological significance.

View Publication

Publication Date

Sun Jul 09 2023

Journal Name

Journal Of Engineering

Analysis of Mosul and Haditha Dam Flow Data

Time series

Reservoir operation

Tigris –Euphrates River

Dam

and Forecasting.

Ahmaed Mohammed

Maryam Naeem

...Show More Authors

The expansion in water projects implementations in Turkey and Syria becomes of great concern to the workers in the field of water resources management in Iraq. Such expansion with the absence of bi-lateral agreement between the three riparian countries of Tigris and Euphrates Rivers; Turkey, Syria and Iraq, is expected to lead to a substantially reduction of water inflow to the territories of Iraq. Accordingly, this study consists of two parts: first part is aiming to study the changes of the water inflow to the territory of Iraq, at Turkey and Syria borders, from 1953 to 2009; the results indicated that the annual mean inflow in Tigris River was decreased from 677 m³/sec to 526 m³/sec, after operating Turkey reserv

View Publication Preview PDF

(3)

Publication Date

Tue Mar 01 2016

Journal Name

Journal Of Engineering

Analysis of Recorded Inflow Data of Ataturk Reservoir

Time series

Reservoir operation

Euphrates River

Ataturk Dam

and Forecasting.

Ahmed Mohammed

Zienh Sami

...Show More Authors

Since the beginning of the last century, the competition for water resources has intensified dramatically, especially between countries that have no agreements in place for water resources that they share. Such is the situation with the Euphrates River which flows through three countries (Turkey, Syria, and Iraq) and represents the main water resource for these countries. Therefore, the comprehensive hydrologic investigation needed to derive optimal operations requires reliable forecasts. This study aims to analysis and create a forecasting model for data generation from Turkey perspective by using the recorded inflow data of Ataturk reservoir for the period (Oct. 1961 - Sep. 2009). Based on 49 years of real inflow data

View Publication Preview PDF

Publication Date

Fri Apr 12 2019

Journal Name

Journal Of Economics And Administrative Sciences

Accounting Mining Data Using Neural Networks (Case study)

التنقيب المحاسبي

الشبكات العصبية الاصطناعية

شبكة (Multilayer Perceptron)

تدريب الشبكة العصبية

Accounting Mining

Neural Networks

Multilayer Perceptron Network

Training Neural Networks

وحيد محمود

...Show More Authors

Business organizations have faced many challenges in recent times, most important of which is information technology, because it is widely spread and easy to use. Its use has led to an increase in the amount of data that business organizations deal with an unprecedented manner. The amount of data available through the internet is a problem that many parties seek to find solutions for. Why is it available there in this huge amount randomly? Many expectations have revealed that in 2017, there will be devices connected to the internet estimated at three times the population of the Earth, and in 2015 more than one and a half billion gigabytes of data was transferred every minute globally. Thus, the so-called data mining emerged as a

View Publication Preview PDF

(1)

Publication Date

Mon Jan 01 2018

Journal Name

Matec Web Of Conferences

Carbon-13 Characterization and Modelling for Temperature Measurement-Based Proton Frequency

Abdullah M.A.

Thar M Badri Albarody

Alaa Raad

...Show More Authors

The physical substance at high energy level with specific circumstances; tend to behave harsh and complicated, meanwhile, sustaining equilibrium or non-equilibrium thermodynamic of the system. Measurement of the temperature by ordinary techniques in these cases is not applicable at all. Likewise, there is a need to apply mathematical models in numerous critical applications to measure the temperature accurately at an atomic level of the matter. Those mathematical models follow statistical rules with different distribution approaches of quantities energy of the system. However, these approaches have functional effects at microscopic and macroscopic levels of that system. Therefore, this research study represents an innovative of a wi

View Publication

Publication Date

Tue Jan 01 2019

Journal Name

International Journal Of Advanced Computer Science And Applications

Achieving Flatness: Honeywords Generation Method for Passwords based on user behaviours

Omar Z

Ann

G.

H.

...Show More Authors

View Publication

(3)

Publication Date

Sun Jul 01 2018

Journal Name

2018 2nd International Conference On Imaging, Signal Processing And Communication (icispc)

Analogy-based Common-Sense Knowledge for Opinion-Target Identification and Aggregation

Feature extraction

Task analysis

Semantics

Sentiment analysis

Aggregates

Encyclopedias

Omar Mustafa

Nurul Hashimah Ahamed Hassain

Yu-N

...Show More Authors

The development of Web 2.0 has improved people's ability to share their opinions. These opinions serve as an important piece of knowledge for other reviewers. To figure out what the opinions is all about, an automatic system of analysis is needed. Aspect-based sentiment analysis is the most important research topic conducted to extract reviewers-opinions about certain attribute, for instance opinion-target (aspect). In aspect-based tasks, the identification of the implicit aspect such as aspects implicitly implied in a review, is the most challenging task to accomplish. However, this paper strives to identify the implicit aspects based on hierarchical algorithm incorporated with common-sense knowledge by means of dimensionality reduction.

View Publication Preview PDF

(3)

(2)

Publication Date

Tue Jul 01 2014

Journal Name

Computer Engineering And Intelligent Systems

Static Analysis Based Behavioral API for Malware Detection using Markov Chain

M.

Lafta

...Show More Authors

Researchers employ behavior based malware detection models that depend on API tracking and analyzing features to identify suspected PE applications. Those malware behavior models become more efficient than the signature based malware detection systems for detecting unknown malwares. This is because a simple polymorphic or metamorphic malware can defeat signature based detection systems easily. The growing number of computer malwares and the detection of malware have been the concern for security researchers for a large period of time. The use of logic formulae to model the malware behaviors is one of the most encouraging recent developments in malware research, which provides alternatives to classic virus detection methods. To address the l

Publication Date

Fri Jan 01 2016

Journal Name

Journal Of Engineering

Enhanced Chain-Cluster Based Mixed Routing Algorithm for Wireless Sensor Networks

wireless sensor networks

energy efficiency

cluster routing algorithm

chain routing algorithm.

Husam Kareem

...Show More Authors

Energy efficiency is a significant aspect in designing robust routing protocols for wireless sensor networks (WSNs). A reliable routing protocol has to be energy efficient and adaptive to the network size. To achieve high energy conservation and data aggregation, there are two major techniques, clusters and chains. In clustering technique, sensor networks are often divided into non-overlapping subsets called clusters. In chain technique, sensor nodes will be connected with the closest two neighbors, starting with the farthest node from the base station till the closest node to the base station. Each technique has its own advantages and disadvantages which motivate some researchers to come up with a hybrid routing algorit

View Publication Preview PDF

Publication Date

Sun Aug 06 2023

Journal Name

Journal Of Economics And Administrative Sciences

Probit and Improved Probit Transform-Based Kernel Estimator for Copula Density

Copula function

Probit transformation

Kernel copula function

Improved probit transformation

Mirror reflection

Boundary bias

Fatimah Hashim

Munaf Yousif

...Show More Authors

Copula modeling is widely used in modern statistics. The boundary bias problem is one of the problems faced when estimating by nonparametric methods, as kernel estimators are the most common in nonparametric estimation. In this paper, the copula density function was estimated using the probit transformation nonparametric method in order to get rid of the boundary bias problem that the kernel estimators suffer from. Using simulation for three nonparametric methods to estimate the copula density function and we proposed a new method that is better than the rest of the methods by five types of copulas with different sample sizes and different levels of correlation between the copula variables and the different parameters for the function. The

Publication Date

Sun Feb 13 2022

Journal Name

Petroleum & Coal

Laboratory-Based Correlations to Estimate Geomechanical Properties for Carbonate Tight Reservoir.

Nagham

...Show More Authors

Rock mechanical properties are critical parameters for many development techniques related to tight reservoirs, such as hydraulic fracturing design and detecting failure criteria in wellbore instability assessment. When direct measurements of mechanical properties are not available, it is helpful to find sufficient correlations to estimate these parameters. This study summarized experimentally derived correlations for estimating the shear velocity, Young's modulus, Poisson's ratio, and compressive strength. Also, a useful correlation is introduced to convert dynamic elastic properties from log data to static elastic properties. Most of the derived equations in this paper show good fitting to measured data, while some equations show scatters

1 2 ... 89 90 91 92 ... 872 873