Text documents are unstructured and high dimensional. Effective feature selection is required to select the most important and significant feature from the sparse feature space. Thus, this paper proposed an embedded feature selection technique based on Term Frequency-Inverse Document Frequency (TF-IDF) and Support Vector Machine-Recursive Feature Elimination (SVM-RFE) for unstructured and high dimensional text classificationhis technique has the ability to measure the feature’s importance in a high-dimensional text document. In addition, it aims to increase the efficiency of the feature selection. Hence, obtaining a promising text classification accuracy. TF-IDF act as a filter approach which measures features importance of the text documents at the first stage. SVM-RFE utilized a backward feature elimination scheme to recursively remove insignificant features from the filtered feature subsets at the second stage. This research executes sets of experiments using a text document retrieved from a benchmark repository comprising a collection of Twitter posts. Pre-processing processes are applied to extract relevant features. After that, the pre-processed features are divided into training and testing datasets. Next, feature selection is implemented on the training dataset by calculating the TF-IDF score for each feature. SVM-RFE is applied for feature ranking as the next feature selection step. Only top-rank features will be selected for text classification using the SVM classifier. Based on the experiments, it shows that the proposed technique able to achieve 98% accuracy that outperformed other existing techniques. In conclusion, the proposed technique able to select the significant features in the unstructured and high dimensional text document.
A study of taxonomic quality of soil algae was conducted with some environmental variables in three sites of local gardens (Kadhimiya, Adhamiya and Dora) within the governorate of Baghdad for the period from October 2016 to March 2017. The study identified 28 species belonging to 16 species in which the predominance of blue green algae (18 species) Followed by Bacillarophyta algae (7 species) and three types of Chlorophyta. The study showed an increase in species of Oscillatoria. The results showed no significant differences between sites in temperature, pH and relative humidity, while there were clear differences between sites for salinity and nutrient The study showed a difference of irrigation water quality and use of different fertilize
... Show MoreLinear discriminant analysis and logistic regression are the most widely used in multivariate statistical methods for analysis of data with categorical outcome variables .Both of them are appropriate for the development of linear classification models .linear discriminant analysis has been that the data of explanatory variables must be distributed multivariate normal distribution. While logistic regression no assumptions on the distribution of the explanatory data. Hence ,It is assumed that logistic regression is the more flexible and more robust method in case of violations of these assumptions.
In this paper we have been focus for the comparison between three forms for classification data belongs
... Show More
The idea of this research is the basis of the role exercised by the dimensions of performance management (Performance Planning- performance evaluation- improve the performance and development- feedback - Review and Performance Monitor) In order to achieve the success Organizational Is through the (strategic vision- the operational activity- development of the company- selection of personnel- the company's culture- Leadership and Management- Personal Development - Assessment and Review).And The research aims to identify the extent of the responsibility of performance management in achieving success Organizational through main hypotheses branched out by the sub-hypotheses to knowing out the&nbs
... Show Morethe study considers the optical classification of cervical nodal lymph cells and is based on research into the development of a Computer Aid Diagnosis (CAD) to detect the malignancy cases of diseases. We consider 2 sets of features one of them is the statistical features; included Mode, Median, Mean, Standard Deviation and Maximum Probability Density and the second set are the features that consist of Euclidian geometrical features like the Object Perimeter, Area and Infill Coefficient. The segmentation method is based on following up the cell and its background regions as ranges in the minimum-maximum of pixel values. The decision making approach is based on applying of Minimum Dista
Accurate detection of Electro Cardio Graphic (ECG) features is an important demand for medical purposes, therefore an accurate algorithm is required to detect these features. This paper proposes an approach to classify the cardiac arrhythmia from a normal ECG signal based on wavelet decomposition and ID3 classification algorithm. First, ECG signals are denoised using the Discrete Wavelet Transform (DWT) and the second step is extract the ECG features from the processed signal. Interactive Dichotomizer 3 (ID3) algorithm is applied to classify the different arrhythmias including normal case. Massachusetts Institute of Technology-Beth Israel Hospital (MIT-BIH) Arrhythmia Database is used to evaluate the ID3 algorithm. The experimental resul
... Show MoreEvaluation of Dot. ELISA test for Diagnosis Visceral Leishmaniasis in Infected Children
For the first time in Iraq, this study was conducted to evaluate the usefulness of Dot.ELISA, for detecting anti - Leishmania donovani antibodies in serum samples from suspected patient (children under 8 years ) with Visceral Leishmaniasis V.L.. Sera from 73 V.L. , 60 Healthy controls, and 57 patient with other parasitic diseases other than V.L. (Amoebiasis, Giardiasis , Toxoplasmosis, Schistosomiasis , Hydatidosis, Ascariasis , Lupus Erythromatosus , Viral Hepatitis, and Cutaneous Leishmaniasis) were examined. Anti Leishmania donovani antibodies detected in 71 out of 73 suspected Visceral Leishmaniasis . Data of this study showed that infection in male group was more than female group. Result o
... Show More